Senior Software Developer, Site Reliability Engineering

Google is a global technology company that builds and maintains large-scale, massively distributed, fault-tolerant systems.
Site Reliability
Senior Software Engineer
Contact Company
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Developer, Site Reliability Engineering

Site Reliability Development at Google combines software and systems development to build and run large-scale, massively distributed, fault-tolerant systems. As a Senior Software Developer in Site Reliability Engineering for Google Cloud, you'll ensure that Google's services have reliability and uptime appropriate to users' needs, while maintaining a fast rate of improvement. You'll focus on optimizing existing systems, building infrastructure, and eliminating work through automation.

Key responsibilities include:

  • Engaging in the entire lifecycle of services, from design to deployment and refinement
  • Supporting services pre-launch through system design consulting, developing platforms, capacity planning, and launch reviews
  • Maintaining live services by monitoring availability, latency, and system health
  • Scaling systems sustainably through automation
  • Practicing sustainable incident response and blameless postmortems

You'll have the opportunity to manage complex challenges unique to Google's scale, applying your expertise in coding, algorithms, complexity analysis, and large-scale system design. The role requires a blend of software development skills and systems engineering knowledge.

Google's Technical Infrastructure team, which includes Site Reliability Engineering, is crucial in keeping the company's vast array of products and services running smoothly. They pride themselves on being the "engineers' engineers," focusing on building and maintaining the architecture that powers Google's online presence.

This role offers the chance to work in a culture that values diversity, intellectual curiosity, problem-solving, and openness. You'll collaborate with people from various backgrounds in a blame-free environment that encourages big thinking and risk-taking, while providing support and mentorship for continuous learning and growth.

Qualifications:

  • Bachelor's degree in Computer Science or related field (Master's preferred)
  • 5+ years of software development experience
  • 5+ years experience with data structures and algorithms
  • 3+ years experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2+ years of experience leading projects and providing technical leadership

Join Google's Site Reliability Development team to tackle exciting challenges at a global scale and contribute to the infrastructure that powers some of the world's most widely-used technologies.

Last updated 2 months ago

Responsibilities For Senior Software Developer, Site Reliability Engineering

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Practice sustainable incident response and blameless postmortems

Requirements For Senior Software Developer, Site Reliability Engineering

Linux
Kubernetes
  • Bachelor's degree in Computer Science, related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership

Interested in this job?

Jobs Related To Google Senior Software Developer, Site Reliability Engineering

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and infrastructure development.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Senior Systems Engineer, Site Reliability Engineering

Senior Systems Engineer role at Google focusing on Site Reliability Engineering, building and maintaining large-scale distributed systems with emphasis on reliability and automation.