Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google is a global technology company that builds and maintains large-scale distributed systems and infrastructure.
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS · Cloud

Description For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Google's Site Reliability Engineering (SRE) team is at the forefront of maintaining and optimizing the company's massive distributed systems. This role combines software and systems engineering to ensure Google's services maintain optimal reliability and performance. As a Software Engineering Manager II, you'll lead a team responsible for the uptime and efficiency of Google Cloud services.

The position offers unique challenges of scale specific to Google's infrastructure, requiring expertise in coding, algorithms, and large-scale system design. You'll be working with a diverse team of professionals who value intellectual curiosity and problem-solving in a blame-free environment. The role involves managing and mentoring a team while tackling complex technical challenges.

Your responsibilities will include leading software engineers, ensuring service reliability, implementing automation, and managing global on-call rotations. You'll be part of the Technical Infrastructure team, which forms the backbone of Google's product portfolio, from data centers to next-generation platforms.

This is an excellent opportunity for experienced engineering managers who are passionate about distributed systems, have strong technical leadership skills, and want to impact billions of users. You'll work with cutting-edge technology while building and maintaining the systems that power Google's vast array of services.

The role offers the chance to work with some of the most complex and large-scale systems in the industry, while developing your team and contributing to Google's technical infrastructure. You'll be part of an organization that values diversity, promotes self-direction, and provides the support needed for continuous learning and growth.

Last updated 2 hours ago

Responsibilities For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

  • Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence
  • Automate response to all non-exceptional service conditions
  • Lead by example, mentor the team and establish credibility through quality technical execution
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services

Requirements For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of people management experience
  • Experience designing, analyzing, and troubleshooting distributed systems
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Systematic problem-solving approach with effective verbal and written communication skills

Benefits For Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Medical Insurance
Parental Leave
Visa Sponsorship
  • Equal opportunity employer
  • Accommodation for special needs
  • Global work environment

Interested in this job?

Jobs Related To Google Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Technical Program Manager, Site Reliability

Technical Program Manager position at Google, leading Site Reliability initiatives for AI, Trust and Security platforms, requiring 8 years of program management experience.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering teams at Google, managing distributed systems and ensuring service reliability while driving technical excellence and team development.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and service reliability while mentoring engineers and driving technical excellence.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while providing technical leadership and mentorship.

System Engineering Manager, Site Reliability Engineering, Google Play

Lead Google Play's Site Reliability Engineering team, managing large-scale distributed systems and driving technical excellence while developing engineering talent.