Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google is a global technology company that builds innovative products and services used by billions of people worldwide.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their Data Cloud division. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. The position offers a unique opportunity to work at the intersection of SRE and AI, where you'll be responsible for developing and maintaining critical infrastructure that ensures reliability and performance at massive scale.

The role involves leading the development of AI-powered tools and systems that help SRE teams better understand and manage system behavior. You'll be working on cutting-edge projects that involve building APIs for AI functionalities, implementing intelligent monitoring systems, and developing sophisticated analysis tools for production environments.

As a Senior SRE, you'll be part of Google's Technical Infrastructure team, where you'll collaborate with world-class engineers to solve complex challenges of scale. The team culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll have the opportunity to work on meaningful projects while receiving support and mentorship to grow your career.

The ideal candidate brings strong software development experience, expertise in distributed systems, and leadership capabilities. You'll be working in Warsaw, Poland, contributing to Google's global infrastructure while being part of a team that values innovation and technical excellence. This role offers the chance to impact billions of users while working with cutting-edge technology and contributing to the evolution of Google's cloud infrastructure.

If you're passionate about reliability, scalability, and using AI to solve complex engineering challenges, this role offers an exceptional opportunity to work on some of the most sophisticated systems in the industry. You'll be at the forefront of combining SRE principles with AI capabilities, helping to shape the future of Google's infrastructure.

Last updated 7 days ago

Responsibilities For Senior Software Engineer, Site Reliability Engineering, Data Cloud

  • Engage in and improve the whole lifecycle of services, from inception and design, through to deployment, operation and refinement
  • Lead the forefront of design, build and maintain the core infrastructure and tools that empower SRE teams to leverage the power of AI
  • Develop APIs for essential AI functionalities across diverse data sources
  • Collaborate with SRE teams to design, implement, and evaluate AI features
  • Develop AI features like incident-support case matcher, similarity search, bug analyzer
  • Implement production cohorting and regression attribution capabilities
  • Build and expand horizontal cloud monitoring coverage across Google Cloud Platform (GCP)

Requirements For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Python
Java
Go
Kubernetes
  • Bachelor's degree in Computer Science, or a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership
  • Experience in software engineering and machine learning (preferred)
  • Experience working in computing, distributed systems, storage, or networking (preferred)
  • Ability to use a systematic problem-solving approach
  • Excellent verbal and written communication skills

Interested in this job?

Jobs Related To Google Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.

Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior SRE position at Google focusing on AI-powered infrastructure and tools for cloud services, requiring expertise in distributed systems and technical leadership.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5 years of software development experience.