Software Engineering Manager II, Site Reliability Engineering

Google is a global technology company that builds and maintains large-scale, distributed systems and infrastructure.
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS

Description For Software Engineering Manager II, Site Reliability Engineering

Google's Site Reliability Engineering (SRE) team is at the forefront of maintaining and optimizing the company's massive distributed systems. This role combines software and systems engineering to ensure Google's services maintain reliability and performance at scale. As a Software Engineering Manager II in SRE, you'll lead a team responsible for the uptime and efficiency of critical systems.

The position requires a strong technical background with at least 8 years of experience in data structures and algorithms, along with 5 years of software development experience. You'll need 3 years of people management experience and expertise in distributed systems. The role involves leading teams across continents, managing on-call rotations, and driving automation initiatives.

You'll work within Google's Technical Infrastructure organization, which is fundamental to keeping Google's vast product portfolio running smoothly. The team takes pride in being "engineers' engineers" and focuses on building and maintaining the next generation of Google platforms.

The role offers unique challenges of scale specific to Google, combining technical leadership with people management. You'll be responsible for mentoring teams, driving technical excellence, and ensuring the reliability of services used by millions globally. The SRE team values diversity, intellectual curiosity, and problem-solving in a blame-free environment.

This position is ideal for someone who enjoys both technical challenges and leadership responsibilities, with opportunities to work on meaningful projects while receiving support and mentorship for continued growth. You'll be part of a culture that promotes self-direction and risk-taking, while maintaining the highest standards of system reliability and performance.

The role involves working with cutting-edge technology and contributing to Google's infrastructure at a global scale. You'll be responsible for building automation, improving system efficiency, and leading initiatives that directly impact Google's service quality. The position offers the chance to work with diverse teams and tackle complex technical challenges while developing leadership skills.

Last updated 4 days ago

Responsibilities For Software Engineering Manager II, Site Reliability Engineering

  • Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence
  • Automate response to all non-exceptional service conditions
  • Lead by example, mentor the team and establish credibility through quality technical execution
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services

Requirements For Software Engineering Manager II, Site Reliability Engineering

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of people management experience
  • Experience designing, analyzing, and troubleshooting distributed systems
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Systematic problem-solving approach, coupled with effective communication skills

Benefits For Software Engineering Manager II, Site Reliability Engineering

Medical Insurance
Vision Insurance
Dental Insurance
Parental Leave
  • Equal opportunity employer
  • Accommodation for special needs
  • Global work environment

Interested in this job?

Jobs Related To Google Software Engineering Manager II, Site Reliability Engineering

Software Engineering Manager II, Namespaces Site Reliability Engineering

Lead Google's Namespaces SRE team in managing planet-scale storage systems, requiring expertise in distributed systems and team leadership.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and driving technical excellence while ensuring service reliability and performance.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while driving technical excellence and team development.

Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Lead a team of Site Reliability Engineers at Google Cloud, managing distributed systems and ensuring service reliability while driving technical excellence and team development.

Senior Staff Software Engineer, Site Reliability Engineering

Senior Staff SRE position at Google, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.