Google's Site Reliability Engineering (SRE) team is at the forefront of maintaining and optimizing the company's massive distributed systems infrastructure. As a Senior SRE, you'll combine software and systems engineering expertise to ensure Google Cloud's services maintain exceptional reliability and performance. The role offers unique challenges in managing scale that are specific to Google Cloud, while leveraging your skills in coding, algorithms, and large-scale system design.
The position involves working with both internally critical and externally-visible systems, focusing on reliability, uptime, and continuous improvement. You'll be part of a team that values diversity, intellectual curiosity, and problem-solving in a blame-free environment. The role encompasses the entire service lifecycle, from design and development to deployment and optimization.
As part of the Technical Infrastructure team, you'll be instrumental in building and maintaining Google's data centers and next-generation platforms. The team takes pride in being the "engineers' engineers," with a hands-on approach to problem-solving. Your work will directly impact millions of users by ensuring optimal performance and reliability of Google's extensive product portfolio.
This is an excellent opportunity for experienced engineers who are passionate about distributed systems, automation, and maintaining large-scale infrastructure. The role offers significant autonomy, with opportunities to collaborate with diverse teams and tackle complex technical challenges. Google's culture promotes self-direction while providing strong support and mentorship for continued learning and growth.