Software Engineering Manager II, Site Reliability Engineering

Google is a global technology company that builds and maintains large-scale, distributed systems and infrastructure.
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS

Description For Software Engineering Manager II, Site Reliability Engineering

Google's Site Reliability Engineering (SRE) team is at the forefront of maintaining and optimizing the company's massive distributed systems. As a Software Engineering Manager II in SRE, you'll lead a team responsible for ensuring Google's services maintain optimal reliability and performance. The role combines software and systems engineering to build and run large-scale, fault-tolerant systems.

The position offers unique challenges of scale specific to Google's infrastructure, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be responsible for leading a team that manages critical systems, both internal and customer-facing, focusing on reliability, uptime, and continuous improvement.

The role involves significant leadership responsibilities, including mentoring team members, managing on-call rotations across different time zones, and driving technical excellence. You'll work on optimizing existing systems, building infrastructure, and creating automation solutions to eliminate manual work.

Google's Technical Infrastructure team, which includes SRE, is fundamental to making Google's product portfolio possible. The team takes pride in being "engineers' engineers" and approaches challenges with both technical depth and creativity. The culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment.

This position offers the opportunity to work with cutting-edge technology at unprecedented scale, lead and develop talented engineers, and directly impact billions of users worldwide. The role requires a blend of technical expertise, leadership skills, and strategic thinking, making it ideal for experienced engineering managers who want to work on some of the most complex and impactful systems in the technology industry.

For those interested in learning more, Google has published books on Site Reliability Engineering and offers detailed career profiles about why engineers choose to join SRE. The role combines the excitement of technical challenges with the satisfaction of leadership and mentorship, all while working on systems that affect users globally.

Last updated an hour ago

Responsibilities For Software Engineering Manager II, Site Reliability Engineering

  • Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence
  • Automate response to all non-exceptional service conditions
  • Lead by example, mentor the team and establish credibility through quality technical execution
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services

Requirements For Software Engineering Manager II, Site Reliability Engineering

Linux
Python
Go
Java
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of people management experience
  • Experience designing, analyzing, and troubleshooting distributed systems
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Systematic problem-solving approach, coupled with effective communication skills

Benefits For Software Engineering Manager II, Site Reliability Engineering

Medical Insurance
Vision Insurance
Dental Insurance
Parental Leave
  • Equal opportunity employer
  • Accommodation for special needs
  • Global work environment

Interested in this job?

Jobs Related To Google Software Engineering Manager II, Site Reliability Engineering

Software Engineering Manager II, Namespaces Site Reliability Engineering

Lead Google's Namespaces SRE team, managing distributed systems and storage infrastructure while ensuring reliability and performance of critical services.

Software Engineering Manager II, Site Reliability Engineering

Lead Google's Site Reliability Engineering team in ensuring the reliability and performance of large-scale distributed systems while managing and mentoring engineering talent.

Senior Staff Software Engineer, Site Reliability Engineering

Senior Staff SRE position at Google, focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Senior Staff Software Engineer, Site Reliability Engineering, Google Cloud

Senior Staff SRE position at Google Cloud, focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.

Site Reliability Manager, Core Enterprise Systems

Lead a team of Site Reliability Engineers at Google, managing enterprise services and driving technical innovation in system reliability and automation.