Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google is a global technology company that builds and maintains large-scale distributed systems and infrastructure.
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS · Cloud

Description For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Software Engineering Manager II to lead and grow their Cloud infrastructure. This role combines software and systems engineering to build and maintain Google's large-scale, distributed systems. As an Engineering Manager, you'll lead a team responsible for ensuring Google's services maintain optimal reliability and performance while driving continuous improvement.

The position requires a strong technical background with 8 years of experience in data structures and algorithms, along with proven leadership capabilities demonstrated through 3 years of people management. You'll be responsible for managing a team of Software/Systems Engineers, overseeing critical projects, and ensuring service reliability through automation and systematic problem-solving.

In this role, you'll work with Google's Technical Infrastructure team, which forms the backbone of Google's product portfolio. You'll lead end-to-end availability and performance initiatives, manage global on-call rotations, and drive automation efforts to prevent problem recurrence. The role offers unique challenges of scale specific to Google, combining technical leadership with people management.

SRE at Google promotes a culture of diversity, intellectual curiosity, and blame-free problem-solving. The team brings together individuals with varied backgrounds and perspectives, encouraging collaboration and innovative thinking. You'll have the opportunity to work on meaningful projects while receiving support and mentorship for continuous learning and growth.

The ideal candidate will possess expertise in distributed systems, strong debugging and optimization skills, and excellent communication abilities. You'll be part of a team that's proud to be "engineers' engineers," working on everything from data center development to next-generation Google platforms. This role offers the chance to make a significant impact on Google's infrastructure while leading and developing a team of talented engineers.

Last updated 3 days ago

Responsibilities For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

  • Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence
  • Lead by example, mentor the team and establish credibility through quality technical execution
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services

Requirements For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of people management experience
  • Experience designing, analyzing, and troubleshooting distributed systems

Interested in this job?

Jobs Related To Google Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and infrastructure at global scale.

Software Engineering Manager, Site Reliability Engineering, FM Store

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while driving technical excellence and team growth.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while driving technical excellence and team development.

Staff Software Engineer, Site Reliability Engineering, Google Cloud

Staff Software Engineer position at Google Cloud focusing on Site Reliability Engineering, building and maintaining large-scale distributed systems.

Staff Software Engineer, Site Reliability Engineering

Staff Software Engineer position at Google focusing on Site Reliability Engineering, building and maintaining large-scale distributed systems with 8+ years of experience required.