Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. The role focuses on ensuring Google's services maintain reliability and appropriate uptime while continuously improving performance. As part of the Bedrock team, you'll be responsible for foundational production systems managing Google's machine fleet from Turnup to Decommissioning. The position offers unique challenges of scale specific to Google, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design. The team promotes a culture of diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll work with professionals from various backgrounds, collaborating on meaningful projects while receiving support and mentorship for continuous growth and development. The role involves managing complex distributed systems, automating routine tasks, and ensuring the efficient operation of Google's vast technical infrastructure.