Staff Software Engineer, Site Reliability Engineering

Google is a global technology leader that specializes in internet-related services and products.
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS · Cloud

Description For Staff Software Engineer, Site Reliability Engineering

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As a Staff Software Engineer in SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while managing complex challenges unique to Google's scale. The role focuses on optimizing existing systems, building infrastructure, and automation. You'll work with algorithms, complexity analysis, and large-scale system design. Google's SRE team values diversity, intellectual curiosity, and problem-solving in a blame-free environment. The position offers opportunities to collaborate with people from diverse backgrounds, think big, and take risks while receiving support and mentorship for professional growth. You'll be responsible for improving critical applications' reliability, leading technical interactions with business partners, and leveraging SRE strategies across Google Cloud Platform. The role requires extensive experience in distributed systems, programming, and enterprise-scale problem-solving, making it ideal for those passionate about maintaining and improving large-scale system reliability.

Last updated 4 days ago

Responsibilities For Staff Software Engineer, Site Reliability Engineering

  • Architect, design, and execute on projects to improve the reliability posture of critical applications
  • Drive technical interactions with business partners to come up with innovative ideas in terms of improving reliability for enterprise applications
  • Help the team focus and choose impactful projects, and deliver to completion
  • Leverage Google Site Reliability Engineering (SRE) strategies to span Google Cloud Platform (GCP) and the Google stack
  • Be a technical lead for a team in Bengaluru

Requirements For Staff Software Engineer, Site Reliability Engineering

Go
Java
Python
Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience in algorithms, data structures, analysis, software design or Unix/Linux systems, IP networking, performance and application issues
  • 5 years of experience programming in one or more of the following languages: C, C++, Java, Python, Go, Perl, or Ruby
  • 3 years of experience leading projects and designing, analyzing, and troubleshooting distributed systems
  • Experience in an engineering or operations role in enterprise applications or other large-scale enterprise space
  • Experience in navigating enterprise software, deployment, and management of workloads in the cloud
  • Experience in problem-solving and analyzing complex enterprise systems

Interested in this job?

Jobs Related To Google Staff Software Engineer, Site Reliability Engineering

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and infrastructure at global scale.

Software Engineering Manager, Site Reliability Engineering, FM Store

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while driving technical excellence and team growth.

Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Lead Google Cloud's Site Reliability Engineering team, managing distributed systems and infrastructure while ensuring service reliability and performance.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and ensuring service reliability while driving technical excellence and team development.

Staff Software Engineer, Site Reliability Engineering, Google Cloud

Staff Software Engineer position at Google Cloud focusing on Site Reliability Engineering, building and maintaining large-scale distributed systems.