Site Reliability Manager, Core Enterprise Systems

Google is a global technology company that builds and maintains large-scale technical infrastructure and platforms.
Site Reliability
Staff Software Engineer
Contact Company
5,000+ Employees
5+ years of experience
Enterprise SaaS

Description For Site Reliability Manager, Core Enterprise Systems

Google's Core Enterprise System (CES) SRE team is seeking a Site Reliability Manager to lead a team of 6-10 engineers supporting critical enterprise services. This role sits within Corporate Engineering-Site Reliability Engineering (SRE) and provides support to Enterprise applications powering key verticals such as Finance, Legal, Supply Chain, and HR.

The position combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. You'll be responsible for ensuring Google Cloud's services maintain reliability and uptime while continuously improving performance and capacity. The role involves significant technical leadership, requiring 5 years of software development experience and strong expertise in algorithms, data structures, and Unix/Linux systems.

As a Site Reliability Manager, you'll develop strategic roadmaps and OKRs, lead service lifecycle management from design through deployment, and implement automation for sustainable scaling. You'll need 3 years of people management experience to effectively lead your team and 3 years of project leadership experience working with system administration or networking.

The role offers unique opportunities to tackle complex challenges at Google's scale while working in a culture that values diversity, intellectual curiosity, and problem-solving. You'll collaborate with teams across Google to transform enterprise services through standardized solutions and platforms. The position requires expertise in enterprise applications, cloud workload management, and building strategic partnerships with internal customers.

This is an excellent opportunity for an experienced technical leader who wants to impact critical enterprise systems at one of the world's leading technology companies. You'll have the support and resources to drive innovation while developing your team and advancing your career in site reliability engineering.

Last updated 2 days ago

Responsibilities For Site Reliability Manager, Core Enterprise Systems

  • Manage a team of 6-10 site reliability engineers supporting Google's enterprise services
  • Develop roadmaps, planning, OKRs to move forward the maturity of the managed services
  • Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Practice sustainable incident response ensuring services meet their service level objectives

Requirements For Site Reliability Manager, Core Enterprise Systems

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience in algorithms, data structures, analysis, software design/development or Unix/Linux systems, IP networking, performance and application issues
  • 3 years of experience leading projects and working with administration or networking
  • 3 years of people management experience
  • Experience in SAP or other ERP systems (preferred)
  • Experience in an engineering or operations role in Enterprise Applications (preferred)
  • Expertise in building strategic partnership with internal customers (preferred)
  • Proficiency in navigating enterprise software, deployment, and management of workloads on Cloud (preferred)

Interested in this job?

Jobs Related To Google Site Reliability Manager, Core Enterprise Systems

Software Engineering Manager II, Site Reliability Engineering

Lead Google's Site Reliability Engineering team in Dublin, managing distributed systems and infrastructure while ensuring service reliability and performance optimization.

Software Engineering Manager II, Site Reliability Engineering

Lead Google's Site Reliability Engineering team in ensuring system reliability and performance while managing and mentoring engineers across global locations.

Staff Software Engineer, Site Reliability Engineering, Google Cloud

Staff Software Engineer position in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Systems Engineering Manager, Site Reliability Engineering

Lead Google's Site Reliability Engineering team in Sydney, managing distributed systems and ensuring service reliability at massive scale.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and service reliability while mentoring engineers and driving technical excellence.