Systems Engineer III, Site Reliability Engineering

Google is a global technology leader that develops innovative products and services used by billions of people worldwide.
Site Reliability
Mid-Level Software Engineer
Contact Company
5,000+ Employees
2+ years of experience
Enterprise SaaS

Description For Systems Engineer III, Site Reliability Engineering

Google's Site Reliability Engineering (SRE) team combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while focusing on system optimization and automation. You'll tackle unique scaling challenges, work with cutting-edge technology, and join a diverse culture that values intellectual curiosity and problem-solving. The role involves managing complex distributed systems, developing automation solutions, and maintaining critical infrastructure that powers Google's vast product portfolio. You'll collaborate with a diverse team, participate in blameless postmortems, and have opportunities for continuous learning and growth in an environment that supports mentorship and self-direction.

Last updated a month ago

Responsibilities For Systems Engineer III, Site Reliability Engineering

  • Improve the whole lifecycle of services from inception and design, through deployment, operation, and refinement
  • Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews
  • Provide guidance to other team members on managing availability and performance of mission critical services
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
  • Scale systems sustainably through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity

Requirements For Systems Engineer III, Site Reliability Engineering

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with programming in one or more programming languages
  • 2 years of experience working with Unix/Linux systems internals and administration or networking
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Systematic problem-solving approach with effective verbal and written communication skills

Interested in this job?

Jobs Related To Google Systems Engineer III, Site Reliability Engineering

Software Developer III, Site Reliability Development, Google Cloud

Site Reliability Developer role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.

Software Engineer III, Shopping Build Site Reliability Engineer

Site Reliability Engineer role at Google focusing on Shopping Build infrastructure, requiring distributed systems expertise and 2+ years of software development experience.

Software Engineer III, Google Cloud, Site Reliability Engineering

Site Reliability Engineer role at Google Cloud focusing on building and maintaining large-scale distributed systems with opportunities for technical growth and impact.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer position at Google Cloud focusing on maintaining and optimizing large-scale distributed systems with opportunities for automation and infrastructure development.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance optimization.