Google's Site Reliability Engineering (SRE) team combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while managing performance and capacity. The role focuses on optimizing systems, building infrastructure, and automation. You'll tackle unique scale challenges specific to Google Cloud, applying expertise in coding, algorithms, complexity analysis, and large-scale system design.
The team values diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll join a diverse group collaborating on meaningful projects with strong support and mentorship. The Technical Infrastructure team builds and maintains the architecture behind Google's products, from data centers to next-generation platforms.
This role offers the opportunity to:
The position combines technical depth with leadership opportunities, requiring both hands-on engineering skills and the ability to guide technical decisions. You'll work in an environment that promotes self-direction while providing the support needed for professional growth.