System Engineering Manager, Site Reliability Engineering, Google Play

Google is a global technology company that builds and maintains large-scale, distributed systems and infrastructure.
$180,000 - $300,000
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS

Description For System Engineering Manager, Site Reliability Engineering, Google Play

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As a System Engineering Manager for Google Play's SRE team, you'll lead a team ensuring Google's services maintain reliability and appropriate uptime while monitoring system capacity and performance. The role involves managing complex challenges unique to Google's scale, utilizing expertise in coding, algorithms, and large-scale system design.

The position requires strong leadership skills to guide a team of experienced engineers, focusing on optimizing existing systems, building infrastructure, and implementing automation. You'll be responsible for maintaining service availability, managing cross-continental on-call rotations, and driving technical projects to completion. The role combines technical expertise with people management, requiring both strategic thinking and hands-on technical guidance.

Google's SRE culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll work with people from various backgrounds and perspectives, encouraging collaboration and innovation. The Technical Infrastructure team plays a crucial role in maintaining Google's architecture, from data centers to next-generation platforms, ensuring users have the best possible experience.

This is an opportunity to lead and grow a team while working on some of the most complex and impactful systems in technology. You'll be responsible for both the technical success of critical infrastructure and the professional development of your team members, making this role perfect for those who combine technical excellence with strong leadership capabilities.

Last updated 14 hours ago

Responsibilities For System Engineering Manager, Site Reliability Engineering, Google Play

  • Guide a team of Software/Systems Engineers on projects for users and be responsible for uptime
  • Own end-to-end availability and performance of services and build automation to prevent problem recurrence
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services
  • Develop goals and strategies for the team
  • Drive technical projects and provide leadership in a changing environment

Requirements For System Engineering Manager, Site Reliability Engineering, Google Play

Linux
Python
Go
Java
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 3 years of experience leading projects and working with administration or networking
  • 5 years of experience with programming in one or more programming languages
  • 3 years of people management experience
  • Experience in recruiting and managing a team of experienced engineers on large scale projects
  • Ability to set and drive the 'big picture' strategy while providing technical guidance

Interested in this job?

Jobs Related To Google System Engineering Manager, Site Reliability Engineering, Google Play

Technical Program Manager, Site Reliability

Technical Program Manager position at Google, leading Site Reliability initiatives for AI, Trust and Security platforms, requiring 8 years of program management experience.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering teams at Google, managing distributed systems and ensuring service reliability while driving technical excellence and team development.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and service reliability while mentoring engineers and driving technical excellence.

Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Lead SRE team at Google Cloud, managing distributed systems reliability and performance while mentoring engineers and driving technical excellence.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while providing technical leadership and mentorship.