Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google is a global technology company that builds and maintains technical infrastructure powering user experiences through data centers and platforms.
$150,000 - $250,000
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their Data Cloud division. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. The position offers a unique opportunity to work on complex scalability challenges while leveraging AI and machine learning capabilities.

The role involves leading the development of core infrastructure and tools that enable SRE teams to harness AI power for system behavior insights. You'll be responsible for designing and implementing AI features that enhance engineering efficiency and customer satisfaction, such as incident-support case matching, similarity search, and bug analysis.

As part of Google's Technical Infrastructure team, you'll work behind the scenes to maintain and develop data centers and next-generation Google platforms. The team takes pride in being "engineers' engineers" and focuses on keeping networks running optimally for the best user experience.

The position offers exposure to cutting-edge technology and the chance to work with diverse, intellectually curious professionals in a blame-free environment that encourages collaboration and risk-taking. Google promotes self-direction on meaningful projects while providing support and mentorship for continuous learning and growth.

This role is perfect for candidates with strong software development experience and expertise in distributed systems who want to impact global-scale infrastructure. You'll join a culture that values diversity, problem-solving, and openness, working on projects that directly influence Google's product portfolio reliability and performance.

The ideal candidate will have experience in software engineering and machine learning, with a proven track record in designing and troubleshooting large-scale distributed systems. Strong communication skills and a systematic approach to problem-solving are essential, as you'll be collaborating with teams across Google to ensure service reliability and continuous improvement.

Last updated 5 hours ago

Responsibilities For Senior Software Engineer, Site Reliability Engineering, Data Cloud

  • Engage in and improve the whole lifecycle of services, from inception and design, through to deployment, operation and refinement
  • Lead the design, build and maintain the core infrastructure and tools that empower SRE teams to leverage AI
  • Develop APIs for essential AI functionalities across diverse data sources
  • Develop AI features like incident-support case matcher, similarity search, bug analyzer
  • Implement production cohorting and regression attribution capabilities
  • Build and expand horizontal cloud monitoring coverage across Google Cloud Platform (GCP)

Requirements For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Python
Java
Go
Kubernetes
  • Bachelor's degree in Computer Science, or a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership
  • Experience in software engineering and machine learning
  • Experience working in computing, distributed systems, storage, or networking
  • Ability to use systematic problem-solving approach with excellent communication skills

Benefits For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Medical Insurance
Vision Insurance
Dental Insurance
Parental Leave
  • Comprehensive health coverage
  • Parental leave benefits
  • Equal employment opportunity
  • Inclusive work environment

Interested in this job?

Jobs Related To Google Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and opportunities for technical leadership.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.