Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google is a global technology company that builds and maintains large-scale distributed systems and cloud infrastructure.
Site Reliability
Senior Software Engineer
Contact Company
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their Data Cloud division. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. The position offers a unique opportunity to work at the intersection of SRE and AI, where you'll be developing innovative solutions to complex infrastructure challenges.

As an SRE, you'll be responsible for ensuring the reliability and performance of Google Cloud's critical systems, both internal and customer-facing. The role involves designing and implementing AI-powered tools and features that enhance system monitoring, incident response, and overall service reliability. You'll work with cutting-edge technologies and contribute to the development of APIs that enable AI functionality across diverse data sources.

The position requires a strong background in software development, distributed systems, and algorithms, with at least 5 years of experience. You'll need to demonstrate leadership capabilities, as you'll be guiding projects and providing technical direction to teams. The ideal candidate should have experience with machine learning and a proven track record in troubleshooting large-scale systems.

Google offers a collaborative environment that values diversity, intellectual curiosity, and innovation. You'll be part of a team that encourages risk-taking and self-direction while providing the support and mentorship needed for professional growth. The role presents an excellent opportunity to work on meaningful projects that directly impact Google's global infrastructure and service delivery.

Working in Warsaw, you'll be part of Google's Technical Infrastructure team, which is responsible for building and maintaining the foundation of Google's extensive product portfolio. The position offers the chance to work with some of the most complex and interesting technical challenges in the industry, while contributing to the continuous improvement of Google Cloud Platform's reliability and performance.

Last updated 3 days ago

Responsibilities For Senior Software Engineer, Site Reliability Engineering, Data Cloud

  • Engage in and improve the whole lifecycle of services, from inception and design, through to deployment, operation and refinement
  • Lead the forefront of design, build and maintain the core infrastructure and tools that empower SRE teams to leverage the power of AI
  • Develop Application programming interface (APIs) for essential AI functionalities across diverse data sources
  • Collaborate with SRE teams to design, implement, and evaluate AI features, ensuring their quality
  • Develop AI features like incident-support case matcher, similarity search, bug analyzer
  • Implement production cohorting and regression attribution capabilities
  • Build and expand horizontal cloud monitoring coverage across Google Cloud Platform (GCP)

Requirements For Senior Software Engineer, Site Reliability Engineering, Data Cloud

Python
Java
Go
  • Bachelor's degree in Computer Science, or a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership
  • Experience in software engineering and machine learning (preferred)
  • Experience working in computing, distributed systems, storage, or networking (preferred)
  • Ability to use a systematic problem-solving approach, with excellent communication skills

Interested in this job?

Jobs Related To Google Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Software Developer role in Google's Site Reliability Engineering team, focusing on building and maintaining large-scale distributed systems for Google Cloud.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems, ensuring reliability and performance of Google Cloud services.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior Site Reliability Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance optimization.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior Site Reliability Engineer position at Google Cloud, focusing on maintaining and optimizing large-scale distributed systems and infrastructure.