Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale distributed systems. As a Senior SRE, you'll be responsible for ensuring the reliability and uptime of Google Cloud's services, both internal and customer-facing systems. The role involves complex challenges of scale unique to Google Cloud, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design.
The position offers opportunities to work on meaningful projects in a blame-free environment that values diversity, intellectual curiosity, and problem-solving. You'll be part of the Technical Infrastructure team, building and maintaining data centers and next-generation Google platforms. The team takes pride in being the engineers' engineers, focusing on keeping networks running optimally for the best user experience.
The role involves the complete service lifecycle, from design to deployment and refinement. You'll contribute to system design, develop software platforms, plan capacity, and conduct launch reviews. Post-deployment, you'll monitor system health, implement automation for scaling, and lead incident response with a blameless postmortem culture.
Working at Google offers competitive compensation including base salary, bonus, equity, and comprehensive benefits. The company strongly values diversity and inclusion, maintaining a culture of belonging and equal opportunity. You'll be part of a global team collaborating to solve complex technical challenges while contributing to Google's mission of organizing the world's information.