Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google is a global technology company that builds innovative products and services used by billions of people worldwide.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS · AI

Description For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their Data Cloud division. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. The position offers a unique opportunity to work on complex challenges of scale while leveraging expertise in coding, algorithms, and system design.

The role involves leading the development of AI-powered tools and infrastructure that help SRE teams gain deeper insights into system behavior. You'll be responsible for designing and implementing critical APIs, developing AI features for improved engineering efficiency, and building monitoring capabilities across the Google Cloud Platform.

The ideal candidate brings strong experience in software development, distributed systems, and technical leadership. You'll work in an environment that values diversity, intellectual curiosity, and problem-solving, while collaborating with teams across Google's technical infrastructure.

This position offers the opportunity to impact Google's global infrastructure, working with cutting-edge technologies and contributing to systems that serve billions of users. You'll be part of a team that promotes self-direction and provides strong support for professional growth and learning.

The role combines technical expertise with leadership responsibilities, requiring both hands-on development skills and the ability to guide projects and teams. You'll work at the intersection of Site Reliability Engineering and AI, helping to shape the future of Google's cloud infrastructure while solving complex technical challenges at unprecedented scale.

Last updated 12 minutes ago

Responsibilities For Senior Software Engineer, Site Reliability Engineering, Data Cloud

  • Engage in and improve the whole lifecycle of services, from inception and design, through to deployment, operation and refinement
  • Lead the forefront of design, build and maintain the core infrastructure and tools that empower SRE teams to leverage the power of AI
  • Develop APIs for essential AI functionalities across diverse data sources
  • Collaborate with SRE teams to design, implement, and evaluate AI features
  • Develop AI features like incident-support case matcher, similarity search, bug analyzer
  • Implement production cohorting and regression attribution capabilities
  • Build and expand horizontal cloud monitoring coverage across Google Cloud Platform (GCP)

Requirements For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Python
Java
Go
  • Bachelor's degree in Computer Science, or a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership
  • Experience in computing, distributed systems, storage, or networking
  • Ability to use a systematic problem-solving approach
  • Excellent verbal and written communication skills
  • English proficiency

Interested in this job?

Jobs Related To Google Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5 years of software development experience.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and system optimization.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and system optimization.

Senior Systems Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.