Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google is a global technology company that builds innovative products and services used by billions of people worldwide.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS · AI

Description For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their Data Cloud division. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. The position offers a unique opportunity to work on complex challenges of scale while leveraging expertise in coding, algorithms, and system design.

The role involves leading the development of AI-powered tools and infrastructure that help SRE teams gain deeper insights into system behavior. You'll be responsible for designing and implementing critical APIs, developing innovative AI features for incident support, and building robust monitoring solutions for the Google Cloud Platform.

The ideal candidate brings strong experience in distributed systems, software development, and technical leadership. You'll work in a diverse, collaborative environment that encourages intellectual curiosity and problem-solving. The role offers the chance to impact Google's global infrastructure while working with cutting-edge technologies and brilliant colleagues.

As part of Google's Technical Infrastructure team, you'll be at the heart of making Google's product portfolio possible. The team takes pride in being the engineers' engineers, focusing on building and maintaining data centers and next-generation platforms. The position offers excellent growth opportunities and the chance to work on meaningful projects that affect billions of users worldwide.

The role combines technical expertise with leadership responsibilities, requiring both deep technical knowledge and the ability to guide teams and projects. You'll be part of a culture that values diversity, openness, and collaboration, with access to Google's vast resources and the opportunity to solve some of the most interesting challenges in large-scale computing.

Last updated 21 hours ago

Responsibilities For Senior Software Engineer, Site Reliability Engineering, Data Cloud

  • Engage in and improve the whole lifecycle of services, from inception and design, through to deployment, operation and refinement
  • Lead the forefront of design, build and maintain the core infrastructure and tools that empower SRE teams to leverage the power of AI
  • Develop APIs for essential AI functionalities across diverse data sources
  • Collaborate with SRE teams to design, implement, and evaluate AI features
  • Develop AI features like incident-support case matcher, similarity search, bug analyzer
  • Implement production cohorting and regression attribution capabilities
  • Build and expand horizontal cloud monitoring coverage across Google Cloud Platform (GCP)

Requirements For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Kubernetes
Linux
  • Bachelor's degree in Computer Science, or a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership
  • Experience in computing, distributed systems, storage, or networking
  • Ability to use a systematic problem-solving approach
  • Excellent verbal and written communication skills
  • English proficiency

Interested in this job?

Jobs Related To Google Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, ATS Matrix Site Reliability Engineer

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems, combining software development and systems engineering expertise.