Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As a System Engineering Manager for Google Play's SRE team, you'll lead a team ensuring Google's services maintain reliability and appropriate uptime while monitoring system capacity and performance. The role involves managing complex challenges unique to Google's scale, utilizing expertise in coding, algorithms, and large-scale system design.
The position requires leadership in optimizing existing systems, building infrastructure, and automating processes. You'll guide a team of Software/Systems Engineers, managing on-call rotations across continents and driving technical projects to improve service availability, scalability, and efficiency. The role demands both technical expertise and people management skills, as you'll be responsible for recruiting, developing, and inspiring team members.
Working in Google's Technical Infrastructure team, you'll be part of the foundation that makes Google's product portfolio possible. The team takes pride in being the engineers' engineers, maintaining data centers, and building next-generation Google platforms. This role offers the opportunity to work in an environment that values diversity, intellectual curiosity, and problem-solving, while promoting self-direction and providing support for continuous learning and growth.
The ideal candidate will combine technical leadership with people management skills, having the ability to set and drive big-picture strategy while providing hands-on technical guidance. You'll be responsible for the overall planning, execution, and success of technical projects, ensuring your team works cohesively to deliver products on time and within budget.