Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while focusing on system optimization and automation. The role offers unique challenges of scale specific to Google Cloud, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design.
The position is part of the Technical Infrastructure team, responsible for the architecture behind all Google's user-facing services. You'll be involved in developing and maintaining data centers, building next-generation Google platforms, and ensuring networks operate at peak performance. The team takes pride in being the engineers' engineers, focusing on both maintenance and innovation.
The role offers a collaborative environment that values diversity, intellectual curiosity, and problem-solving. Google encourages self-direction on meaningful projects while providing support and mentorship for growth. You'll manage project priorities, deadlines, and deliverables while designing, developing, testing, deploying, and enhancing software solutions.
The compensation package is comprehensive, including a competitive base salary range of $136,000-$200,000, plus bonus, equity, and benefits. The role offers opportunities to work with cutting-edge technology while contributing to systems that impact millions of users globally. Join a team that combines technical excellence with a supportive, blame-free culture focused on continuous improvement and innovation.