Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while monitoring system capacity and performance. The role involves optimizing existing systems, building infrastructure, and automating processes. You'll tackle unique scaling challenges specific to Google Cloud, applying expertise in coding, algorithms, complexity analysis, and large-scale system design. The team values intellectual curiosity, problem-solving, and openness, bringing together diverse backgrounds and perspectives. You'll work in a blame-free environment that encourages collaboration, big thinking, and risk-taking, with strong support and mentorship for professional growth. The Technical Infrastructure team builds and maintains the architecture supporting Google's product portfolio, from data centers to next-generation platforms. The role combines hands-on engineering with system architecture, ensuring users receive the best possible experience through reliable, efficient infrastructure.