Google Cloud's Site Reliability Engineering (SRE) team is at the forefront of building and maintaining large-scale, massively distributed, fault-tolerant systems. As a Senior SRE, you'll be responsible for ensuring the reliability and uptime of Google Cloud's critical services while focusing on optimization, infrastructure development, and automation. The role combines software and systems engineering expertise to tackle unique scaling challenges.
You'll be part of the Technical Infrastructure team that powers Google's entire product portfolio, from developing and maintaining data centers to building next-generation platforms. The position offers the opportunity to work with complex distributed systems at a scale few companies can match, while collaborating with a diverse team of engineers who value intellectual curiosity and problem-solving.
The role provides a unique blend of development and operations, where you'll be involved in the entire service lifecycle - from design and implementation to deployment and maintenance. You'll work on optimizing existing systems, building robust infrastructure, and creating automation solutions to eliminate manual work. The team culture promotes self-direction, mentorship, and growth in a blame-free environment.
As a Senior SRE, you'll contribute to critical decisions about system design, capacity planning, and performance optimization. The position offers competitive compensation, including a comprehensive benefits package, and the chance to work with cutting-edge technology while solving some of the most challenging problems in distributed systems engineering.