Google Cloud is seeking a Senior Software Developer in Site Reliability Engineering (SRE) to join their technical infrastructure team. This role combines software and systems engineering to build and maintain Google Cloud's large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll be responsible for ensuring the reliability and uptime of both internal and external systems while managing the unique challenges of scale specific to Google Cloud.
The position requires expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll work on optimizing existing systems, building infrastructure, and implementing automation to eliminate manual work. The role offers opportunities to work with cutting-edge technology and solve complex problems at an unprecedented scale.
Google's SRE team promotes a culture of diversity, intellectual curiosity, and problem-solving in a blame-free environment. The organization brings together individuals from various backgrounds and experiences, encouraging collaboration and innovative thinking. You'll have the freedom to work on meaningful projects while receiving the support and mentorship needed for professional growth.
The role offers competitive compensation, including a base salary range of $161,000-$239,000, plus bonus, equity, and comprehensive benefits. You'll be part of the Technical Infrastructure team, working behind the scenes to maintain and develop Google's data centers and platforms, ensuring optimal performance and user experience.
Key responsibilities include managing the complete service lifecycle, from design to deployment and refinement, conducting system design consulting, capacity planning, and launch reviews. You'll also focus on monitoring system health, implementing automation for scalability, and participating in incident response with blameless postmortems.
This is an excellent opportunity for experienced engineers who are passionate about distributed systems, enjoy solving complex technical challenges, and want to make a significant impact on systems that serve billions of users worldwide.