Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google is a global technology company that builds and maintains large-scale distributed systems and cloud infrastructure.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their Data Cloud division. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. The position offers a unique opportunity to work at the intersection of SRE and AI, where you'll develop innovative solutions to complex infrastructure challenges.

As an SRE, you'll be responsible for ensuring the reliability and performance of both internal and customer-facing systems. The role involves significant work in optimizing existing systems, building infrastructure, and creating automation solutions. You'll be working with cutting-edge AI technologies to improve system behavior analysis and incident response.

The ideal candidate brings strong experience in distributed systems, software development, and technical leadership. You'll be part of a diverse team that values intellectual curiosity and problem-solving in a blame-free environment. The position offers opportunities to work on meaningful projects while receiving support and mentorship for continuous learning and growth.

Key aspects of the role include developing AI-powered tools for system analysis, implementing APIs for AI functionalities, and building monitoring solutions for Google Cloud Platform. You'll collaborate with various teams to design and implement features that enhance engineering efficiency and customer satisfaction.

Google offers a collaborative environment where diversity of thought is celebrated, and innovation is encouraged. You'll have the chance to work with some of the industry's brightest minds while contributing to systems that impact millions of users globally. The role provides an excellent opportunity to grow your technical and leadership skills while working on challenging problems at unprecedented scale.

Last updated a day ago

Responsibilities For Senior Software Engineer, Site Reliability Engineering, Data Cloud

  • Engage in and improve the whole lifecycle of services, from inception and design, through to deployment, operation and refinement
  • Lead the forefront of design, build and maintain the core infrastructure and tools that empower SRE teams to leverage the power of AI
  • Develop Application programming interface (APIs) for essential AI functionalities across diverse data sources
  • Collaborate with SRE teams to design, implement, and evaluate AI features, ensuring their quality
  • Develop AI features like incident-support case matcher, similarity search, bug analyzer
  • Implement production cohorting and regression attribution capabilities
  • Build and expand horizontal cloud monitoring coverage across Google Cloud Platform (GCP)

Requirements For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Python
Java
Go
Kubernetes
  • Bachelor's degree in Computer Science, or a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership
  • Experience in software engineering and machine learning (preferred)
  • Experience working in computing, distributed systems, storage, or networking (preferred)
  • Ability to use a systematic problem-solving approach, with excellent communication skills (preferred)

Interested in this job?

Jobs Related To Google Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and infrastructure development.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior Site Reliability Engineering role at Google Cloud, focusing on building and maintaining large-scale distributed systems with opportunities for technical leadership.