Site Reliability Engineering (SRE) at Google is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google's services have appropriate reliability and uptime while maintaining performance and capacity. The role focuses on optimizing existing systems, building infrastructure, and automating operations problems.
SRE at Google emphasizes limiting operational work, conducting blameless postmortems, and proactively identifying potential outages. The culture promotes diversity, intellectual curiosity, problem-solving, and openness. You'll work with people from various backgrounds and perspectives, collaborating in a blame-free environment that encourages big thinking and risk-taking.
The position offers opportunities for self-direction on meaningful projects while providing support and mentorship for learning and growth. You'll be responsible for managing project priorities, deadlines, and deliverables, as well as designing, developing, testing, deploying, maintaining, and enhancing software solutions.
Google provides competitive compensation including bonus, equity, and comprehensive benefits. The company is committed to building a representative workforce and fostering a culture of belonging, offering equal employment opportunities to all candidates regardless of background.