Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE Engineer II, you'll be responsible for ensuring Google Cloud's services maintain reliability and appropriate uptime for customer needs. The role involves managing complex challenges of scale unique to Google Cloud while utilizing expertise in coding, algorithms, and large-scale system design.
The position offers opportunities to work on meaningful projects in a blame-free environment that promotes self-direction and collaboration. You'll be part of a diverse team that values intellectual curiosity and problem-solving, working on optimizing existing systems, building infrastructure, and automating processes.
Key aspects of the role include system design consulting, capacity planning, launch reviews, and maintaining service health through monitoring and measurement. You'll be involved in the entire service lifecycle, from inception to refinement, while focusing on sustainability and reliability improvements.
The ideal candidate should have a strong foundation in computer science or related field, programming experience, and knowledge of system administration or networking. The role offers growth opportunities through mentorship and learning in an environment that brings together people with diverse backgrounds and perspectives.
Google provides an inclusive workplace culture and equal employment opportunities, regardless of background. The company is committed to building a representative workforce and creating a culture of belonging. This role requires English proficiency to facilitate efficient global collaboration and communication.