Site Reliability Engineering (SRE) at Google is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As a Software Developer II in the SRE team, you'll be responsible for ensuring Google's services maintain reliability and uptime while focusing on performance and capacity optimization. The role involves creative problem-solving, automation, and system optimization.
SRE at Google follows key principles including limiting operational work, conducting blameless postmortems, and proactively identifying potential outages. The team embraces a culture of diversity, intellectual curiosity, and openness, bringing together people with varied backgrounds and perspectives. You'll work in a blame-free environment that encourages collaboration, big thinking, and risk-taking.
The position offers a competitive compensation package including base salary, bonus, equity, and comprehensive benefits. You'll have the opportunity to work on meaningful projects while receiving support and mentorship for professional growth. The role requires technical expertise to manage project priorities, deadlines, and deliverables, as well as skills in designing, developing, testing, deploying, maintaining, and enhancing software solutions.
As part of the SRE team, you'll contribute to Google's internally critical and externally-visible systems, working with a breadth of tools and approaches to solve complex problems. The role combines aspects of software development and systems engineering, focusing on building infrastructure and eliminating work through automation. This position is perfect for someone who wants to impact how Google's systems interact while working with cutting-edge technology at scale.