Senior Software Developer, Site Reliability Engineering

Google is a global technology company that builds and maintains large-scale, massively distributed, fault-tolerant systems.
Site Reliability
Senior Software Engineer
Contact Company
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Developer, Site Reliability Engineering

Site Reliability Development at Google combines software and systems development to build and run large-scale, massively distributed, fault-tolerant systems. As a Senior Software Developer in Site Reliability Engineering for Google Cloud, you'll ensure that Google's services have reliability and uptime appropriate to users' needs, while maintaining a fast rate of improvement. You'll focus on optimizing existing systems, building infrastructure, and eliminating work through automation.

Key responsibilities include:

  • Engaging in the entire lifecycle of services, from design to deployment and refinement
  • Supporting services pre-launch through system design consulting, developing platforms, capacity planning, and launch reviews
  • Maintaining live services by monitoring availability, latency, and system health
  • Scaling systems sustainably through automation
  • Practicing sustainable incident response and blameless postmortems

You'll have the opportunity to manage complex challenges unique to Google's scale, applying your expertise in coding, algorithms, complexity analysis, and large-scale system design. The role requires a blend of software development skills and systems engineering knowledge.

Google's Technical Infrastructure team, which includes Site Reliability Engineering, is crucial in keeping the company's vast array of products and services running smoothly. They pride themselves on being the "engineers' engineers," focusing on building and maintaining the architecture that powers Google's online presence.

This role offers the chance to work in a culture that values diversity, intellectual curiosity, problem-solving, and openness. You'll collaborate with people from various backgrounds in a blame-free environment that encourages big thinking and risk-taking, while providing support and mentorship for continuous learning and growth.

Qualifications:

  • Bachelor's degree in Computer Science or related field (Master's preferred)
  • 5+ years of software development experience
  • 5+ years experience with data structures and algorithms
  • 3+ years experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2+ years of experience leading projects and providing technical leadership

Join Google's Site Reliability Development team to tackle exciting challenges at a global scale and contribute to the technology that impacts billions of users worldwide.

Last updated a day ago

Responsibilities For Senior Software Developer, Site Reliability Engineering

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Practice sustainable incident response and blameless postmortems

Requirements For Senior Software Developer, Site Reliability Engineering

Linux
Kubernetes
  • Bachelor's degree in Computer Science, related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership

Interested in this job?

Jobs Related To Google Senior Software Developer, Site Reliability Engineering

Site Reliability Engineer - REST API

Apple is hiring a Site Reliability Engineer for their Vision Pro team to support event operations, focusing on API integration and automation.

Senior Site Reliability Engineer

Senior Site Reliability Engineer at Microsoft, ensuring product reliability and solving complex customer issues in Windows services.

Site Reliability Engineer - Video on Demand/Streaming Event Support

Join Apple's Vision Pro team as a Site Reliability Engineer, supporting video on demand and streaming event operations with a focus on automation, monitoring, and innovation.

Senior Site Reliability Engineer - AI Research Clusters

Senior Site Reliability Engineer for AI Research Clusters at NVIDIA, designing and implementing GPU compute clusters for AI research.

Senior Software Engineer, ATS Matrix Site Reliability Engineer

Senior Software Engineer role in Site Reliability Engineering at Google, building and maintaining large-scale distributed systems.