System Engineering Manager, Site Reliability Engineering, Google Play

Google is a global technology company that builds and maintains large-scale technical infrastructure and platforms.
$150,000 - $250,000
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS

Description For System Engineering Manager, Site Reliability Engineering, Google Play

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As a System Engineering Manager for Google Play's SRE team, you'll lead a team ensuring Google's services maintain reliability and appropriate uptime while monitoring system capacity and performance. The role involves managing complex challenges unique to Google's scale, utilizing expertise in coding, algorithms, and large-scale system design.

The position requires leadership in optimizing existing systems, building infrastructure, and automating processes. You'll guide a team of Software/Systems Engineers, managing on-call rotations across continents and driving technical projects to improve service availability, scalability, and efficiency. The role demands both technical expertise and people management skills, as you'll be responsible for recruiting, developing, and inspiring team members.

Working in Google's Technical Infrastructure team, you'll be part of the foundation that makes Google's product portfolio possible. The team takes pride in being the engineers' engineers, maintaining data centers, and building next-generation Google platforms. This role offers the opportunity to work in an environment that values diversity, intellectual curiosity, and problem-solving, while promoting self-direction and providing support for continuous learning and growth.

The ideal candidate will combine technical leadership with people management skills, having the ability to set and drive big-picture strategy while providing hands-on technical guidance. You'll be responsible for the overall planning, execution, and success of technical projects, ensuring your team works cohesively to deliver products on time and within budget.

Last updated 5 days ago

Responsibilities For System Engineering Manager, Site Reliability Engineering, Google Play

  • Guide a team of Software/Systems Engineers on projects for users and be responsible for uptime
  • Own end-to-end availability and performance of services and build automation to prevent problem recurrence
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services
  • Develop goals and strategies for the team
  • Drive technical projects and provide leadership in a changing environment

Requirements For System Engineering Manager, Site Reliability Engineering, Google Play

Linux
Python
Go
Java
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 3 years of experience leading projects and working with administration or networking
  • 5 years of experience with programming in one or more programming languages
  • 3 years of people management experience
  • Experience in recruiting and managing a team of experienced engineers on large scale projects
  • Ability to set and drive the 'big picture' strategy while providing technical guidance

Interested in this job?

Jobs Related To Google System Engineering Manager, Site Reliability Engineering, Google Play

Technical Program Manager, Site Reliability

Technical Program Manager position at Google focusing on Site Reliability Engineering, managing cross-functional projects and ensuring system reliability.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering teams at Google, managing distributed systems and ensuring service reliability at global scale.

Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Lead Site Reliability Engineering team at Google Cloud, managing distributed systems and infrastructure while ensuring service reliability and performance.

Software Developer Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and service reliability while driving technical excellence and team growth.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while providing technical leadership and team development.