Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an Engineering Manager in the SRE team, you'll lead a talented group of engineers responsible for ensuring Google's services maintain optimal reliability and performance. You'll manage complex challenges unique to Google's scale while leveraging expertise in coding, algorithms, and system design.
The role involves leading projects globally, providing technical leadership, and developing teams. You'll be part of the Technical Infrastructure team, which builds and maintains Google's data centers and platforms. Your team will be responsible for keeping networks running efficiently and ensuring the best user experience.
The position requires strong leadership skills, technical expertise, and experience with distributed systems. You'll manage on-call rotations across time zones and lead initiatives to improve service availability, scalability, and efficiency. The role offers opportunities to work on meaningful projects in a blame-free environment that values diversity, intellectual curiosity, and problem-solving.
Google's SRE culture promotes self-direction while providing support and mentorship for growth. You'll join a diverse team with various backgrounds and perspectives, collaborating on complex challenges. The role combines technical leadership with people management, requiring both strong engineering skills and the ability to develop and empower team members.
Working at Google's Technical Infrastructure team means being at the forefront of technology, building and maintaining systems that power Google's vast product portfolio. You'll have the opportunity to make a significant impact on systems used by millions of users worldwide while working with cutting-edge technology and leading a team of talented engineers.