Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google is a global technology company that builds and maintains technical infrastructure powering search, cloud, and various online services.
$150,000 - $250,000
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their Data Cloud division. This role combines software and systems engineering to build and maintain large-scale, distributed systems. The position focuses on ensuring Google Cloud's services maintain optimal reliability and performance while driving continuous improvement.

The ideal candidate will have strong experience in both software development and distributed systems, with the ability to lead projects and provide technical direction. You'll work on optimizing existing systems, building infrastructure, and implementing automation solutions. The role heavily emphasizes AI integration, developing APIs, and creating tools that enhance SRE team capabilities.

Working at Google's Technical Infrastructure team, you'll be at the heart of what makes Google's product portfolio possible. The team is responsible for developing and maintaining data centers, building next-generation platforms, and ensuring networks operate at peak performance. The culture promotes diversity, intellectual curiosity, and problem-solving in a blame-free environment.

Key responsibilities include leading the design and implementation of AI-powered tools, developing APIs for AI functionalities, and implementing production monitoring capabilities. You'll collaborate with SRE teams to improve engineering efficiency and customer satisfaction through innovative solutions like incident-support case matching and bug analysis.

The position offers the opportunity to work with cutting-edge technology while solving unique challenges of scale. Google's commitment to diversity and inclusion, combined with its supportive environment for learning and growth, makes this an ideal role for someone looking to make a significant impact in technical infrastructure and site reliability engineering.

Last updated 4 days ago

Responsibilities For Senior Software Engineer, Site Reliability Engineering, Data Cloud

  • Engage in and improve the whole lifecycle of services, from inception and design, through to deployment, operation and refinement
  • Lead the forefront of design, build and maintain the core infrastructure and tools that empower SRE teams to leverage the power of AI
  • Develop Application programming interface (APIs) for essential AI functionalities across diverse data sources
  • Collaborate with SRE teams to design, implement, and evaluate AI features, ensuring their quality
  • Develop AI features like incident-support case matcher, similarity search, bug analyzer
  • Implement production cohorting and regression attribution capabilities
  • Build and expand horizontal cloud monitoring coverage across Google Cloud Platform (GCP)

Requirements For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Python
Java
Go
Kubernetes
  • Bachelor's degree in Computer Science, or a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership
  • Experience in software engineering and machine learning (preferred)
  • Experience working in computing, distributed systems, storage, or networking (preferred)
  • Ability to use a systematic problem-solving approach, with excellent communication skills

Interested in this job?

Jobs Related To Google Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, Site Reliability Engineering

Senior Site Reliability Engineer position at Google, focusing on building and maintaining large-scale distributed systems and infrastructure for Google Cloud services.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Site Reliability Engineering role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.