Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their Data Cloud division. This role combines software and systems engineering to build and maintain large-scale, distributed systems. The position focuses on ensuring Google Cloud's services maintain optimal reliability and performance while driving continuous improvement.
The ideal candidate will have strong experience in both software development and distributed systems, with the ability to lead projects and provide technical direction. You'll work on optimizing existing systems, building infrastructure, and implementing automation solutions. The role heavily emphasizes AI integration, developing APIs, and creating tools that enhance SRE team capabilities.
Working at Google's Technical Infrastructure team, you'll be at the heart of what makes Google's product portfolio possible. The team is responsible for developing and maintaining data centers, building next-generation platforms, and ensuring networks operate at peak performance. The culture promotes diversity, intellectual curiosity, and problem-solving in a blame-free environment.
Key responsibilities include leading the design and implementation of AI-powered tools, developing APIs for AI functionalities, and implementing production monitoring capabilities. You'll collaborate with SRE teams to improve engineering efficiency and customer satisfaction through innovative solutions like incident-support case matching and bug analysis.
The position offers the opportunity to work with cutting-edge technology while solving unique challenges of scale. Google's commitment to diversity and inclusion, combined with its supportive environment for learning and growth, makes this an ideal role for someone looking to make a significant impact in technical infrastructure and site reliability engineering.