Google's Site Reliability Engineering (SRE) team is seeking a Senior Systems Engineer to join their technical infrastructure organization. This role combines software and systems engineering to build and maintain Google Cloud's large-scale, distributed systems. As an SRE, you'll be responsible for ensuring the reliability and uptime of both internal and external systems while focusing on optimization, infrastructure development, and automation.
The position requires strong technical leadership skills and deep expertise in distributed systems, with opportunities to work on unique scaling challenges specific to Google Cloud. You'll be part of a diverse team that values intellectual curiosity, problem-solving, and openness, working in a blame-free environment that encourages collaboration and innovation.
Key responsibilities include improving service lifecycles, providing technical guidance to team members, maintaining system health through monitoring and metrics, leading incident response, and driving automation initiatives. You'll also contribute to system design, capacity planning, and launch reviews for new services.
The role offers the chance to work with cutting-edge technology at massive scale, alongside some of the industry's best engineers. You'll be part of the team that builds and maintains the architecture behind Google's entire product portfolio, ensuring users have the best possible experience.
This position is ideal for experienced engineers who are passionate about large-scale systems, have strong leadership capabilities, and want to make a significant impact on technology that serves billions of users. The role offers opportunities for growth, learning, and collaboration in a supportive environment that promotes self-direction and mentorship.