Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google is a global technology company that builds and maintains large-scale distributed systems and infrastructure.
$150,000 - $250,000
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS · Cloud

Description For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google's Site Reliability Engineering (SRE) team is at the forefront of maintaining and optimizing the company's massive distributed systems. This role combines software and systems engineering to ensure Google's services maintain optimal reliability and performance. As a Software Engineering Manager II, you'll lead a team responsible for the uptime and efficiency of Google Cloud services.

The position offers unique challenges of scale specific to Google's infrastructure, requiring expertise in coding, algorithms, and large-scale system design. You'll work with a diverse team of professionals, managing complex distributed systems while building automation to prevent problems and optimize performance.

The role involves leading a team of Software/Systems Engineers, providing technical leadership, and being directly responsible for service uptime. You'll manage on-call rotations across continents and work on improving the availability, scalability, and efficiency of Google's services.

The Technical Infrastructure team is crucial in maintaining Google's vast architecture and developing next-generation platforms. This role offers the opportunity to work with cutting-edge technology while leading and mentoring a team of skilled engineers. The position requires a strong background in distributed systems, proven leadership experience, and the ability to solve complex technical challenges.

Working at Google provides exposure to some of the most complex and interesting technical challenges in the industry, alongside a culture that promotes diversity, intellectual curiosity, and problem-solving in a blame-free environment. The role offers significant growth opportunities and the chance to make a real impact on systems used by millions of users globally.

Last updated 2 days ago

Responsibilities For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

  • Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence
  • Lead by example, mentor the team and establish credibility through quality technical execution
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services

Requirements For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of people management experience
  • Experience designing, analyzing, and troubleshooting distributed systems
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Systematic problem-solving approach with effective communication skills

Benefits For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Medical Insurance
Parental Leave
Equity
  • Equal opportunity employer
  • Inclusive work environment
  • Global collaboration opportunities

Interested in this job?

Jobs Related To Google Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Technical Program Manager, Site Reliability

Technical Program Manager position at Google, leading Site Reliability initiatives for AI, Trust and Security platforms with 8+ years of experience required.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and service reliability while mentoring engineers and driving technical excellence.

Software Engineering Manager II, SRE, Cloud Logs

Lead SRE team at Google Cloud Logging, managing distributed systems and ensuring service reliability while driving technical excellence and team development.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while driving technical excellence and team development.

Software Engineering Manager II, SRE

Lead SRE role at Google focusing on managing distributed systems and infrastructure teams while ensuring service reliability and performance.