Google Cloud's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their dynamic organization. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. As an SRE, you'll be responsible for ensuring reliability and optimal performance of both internal and customer-facing systems.
The position offers unique challenges of scale specific to Google Cloud, where you'll apply your expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be involved in the complete service lifecycle, from design and development to deployment and maintenance, with a focus on optimizing existing systems and automating processes.
The role is based in London, UK, and requires 5 years of software development experience, with particular emphasis on distributed systems knowledge. You'll work in a diverse, collaborative environment that values intellectual curiosity and problem-solving. The team promotes self-direction while providing support and mentorship for professional growth.
Key responsibilities include system design consulting, capacity planning, monitoring system health, and implementing automation for scalability. You'll also participate in incident response and blameless postmortems, contributing to the continuous improvement of Google's infrastructure.
This is an excellent opportunity for experienced engineers who are passionate about large-scale systems, automation, and maintaining high-reliability services. You'll be working with cutting-edge technology while contributing to the infrastructure that powers Google's global services. The role offers the chance to solve complex technical challenges while working with a diverse team of talented engineers in a supportive, growth-oriented environment.