Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale distributed systems. As an SRE III, you'll be responsible for ensuring the reliability and uptime of Google Cloud's services, both internal and customer-facing systems. The role involves optimizing existing systems, building infrastructure, and automating processes to eliminate manual work.
You'll be part of the Technical Infrastructure team, working on managing complex challenges unique to Google Cloud's scale. The position requires expertise in coding, algorithms, complexity analysis, and large-scale system design. Google's SRE culture emphasizes diversity, intellectual curiosity, problem-solving, and openness in a blame-free environment.
The role offers opportunities to work with people from diverse backgrounds and perspectives, encouraging collaboration and innovation. You'll manage project priorities, deadlines, and deliverables while designing, developing, testing, deploying, maintaining, and enhancing software solutions. The position includes competitive compensation with a base salary range of $136,000-$200,000, plus bonus, equity, and comprehensive benefits.
As an SRE III, you'll be responsible for keeping Google's networks running optimally, ensuring users have the best and fastest experience possible. You'll work on developing and maintaining data centers and building next-generation Google platforms. The role requires strong technical skills, ability to work in teams, and a passion for solving complex infrastructure challenges at scale.
The position offers growth opportunities through self-direction on meaningful projects, supported by mentorship and a collaborative learning environment. You'll be part of a team that's proud to be "engineers' engineers," taking on challenging technical problems and building solutions that make Google's product portfolio possible.