Google's Site Reliability Engineering (SRE) team is seeking a Software Engineering Manager II to lead and grow their Cloud infrastructure. This role combines software and systems engineering to build and maintain Google's large-scale, distributed systems. As an Engineering Manager, you'll lead a team responsible for ensuring Google's services maintain optimal reliability and performance while driving continuous improvement.
The position requires a strong technical background with 8 years of experience in data structures and algorithms, along with proven leadership capabilities demonstrated through 3 years of people management. You'll be responsible for managing a team of Software/Systems Engineers, overseeing critical projects, and ensuring service reliability through automation and systematic problem-solving.
In this role, you'll work with Google's Technical Infrastructure team, which forms the backbone of Google's product portfolio. You'll lead end-to-end availability and performance initiatives, manage global on-call rotations, and drive automation efforts to prevent problem recurrence. The role offers unique challenges of scale specific to Google, combining technical leadership with people management.
SRE at Google promotes a culture of diversity, intellectual curiosity, and blame-free problem-solving. The team brings together individuals with varied backgrounds and perspectives, encouraging collaboration and innovative thinking. You'll have the opportunity to work on meaningful projects while receiving support and mentorship for continuous learning and growth.
The ideal candidate will possess expertise in distributed systems, strong debugging and optimization skills, and excellent communication abilities. You'll be part of a team that's proud to be "engineers' engineers," working on everything from data center development to next-generation Google platforms. This role offers the chance to make a significant impact on Google's infrastructure while leading and developing a team of talented engineers.