Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google is a global technology company that builds and maintains large-scale distributed systems and infrastructure.
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS · Cloud

Description For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Software Engineering Manager II to lead and grow their Cloud infrastructure. This role combines software and systems engineering to build and maintain Google's large-scale, distributed systems. As an Engineering Manager, you'll lead a team responsible for ensuring Google's services maintain optimal reliability and performance while driving continuous improvement.

The position requires a strong technical background with 8 years of experience in data structures and algorithms, along with proven leadership capabilities demonstrated through 3 years of people management. You'll be responsible for managing a team of Software/Systems Engineers, overseeing critical projects, and ensuring service reliability through automation and systematic problem-solving.

In this role, you'll work with Google's Technical Infrastructure team, which forms the backbone of Google's product portfolio. You'll lead initiatives to optimize system performance, implement automation to prevent issues, and manage global on-call rotations. The role offers unique challenges of scale specific to Google while providing opportunities to work with cutting-edge technology and diverse, talented teams.

The ideal candidate will combine technical expertise in distributed systems with strong leadership abilities. You'll need to demonstrate excellent communication skills, systematic problem-solving approaches, and the ability to mentor and develop team members. This position offers the opportunity to impact Google's infrastructure at a global scale while working in an environment that values diversity, intellectual curiosity, and innovation.

Google provides a supportive, blame-free environment that promotes self-direction and continuous learning. You'll have the chance to work on meaningful projects while receiving the support and mentorship needed to grow both technically and as a leader. The role offers exposure to complex technical challenges while allowing you to shape the future of Google's infrastructure reliability.

Last updated a month ago

Responsibilities For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

  • Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence
  • Lead by example, mentor the team and establish credibility through quality technical execution
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services

Requirements For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of people management experience
  • Experience designing, analyzing, and troubleshooting distributed systems

Interested in this job?

Jobs Related To Google Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Technical Program Manager III, Site Reliability, Storage

Technical Program Manager III position at Google, leading Storage Site Reliability Engineering initiatives and cross-functional programs.

Software Engineering Manager II, Site Reliability Engineering

Lead Google's Site Reliability Engineering team, managing distributed systems and ensuring service reliability while driving technical innovation and team development.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while providing technical leadership and team development.

Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Lead Site Reliability Engineering team at Google Cloud, managing distributed systems and ensuring service reliability at global scale.

Software Engineering Manager, Site Reliability Engineering, FM Store

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring reliability of Google's services while driving technical excellence and team growth.