System Engineer, Site Reliability Engineering

Google is a global technology company that builds and maintains large-scale, distributed systems and infrastructure.
Site Reliability
Mid-Level Software Engineer
In-Person
2+ years of experience
Enterprise SaaS

Description For System Engineer, Site Reliability Engineering

Google's Site Reliability Engineering (SRE) team is seeking a System Engineer to join their dynamic organization that combines software and systems engineering to build and maintain large-scale, distributed systems. This role offers a unique opportunity to work on complex challenges of scale specific to Google while utilizing expertise in coding, algorithms, and system design.

The position involves ensuring Google's services maintain optimal reliability and performance, with a focus on both internal and external systems. As an SRE, you'll be responsible for managing system capacity, optimizing existing infrastructure, and implementing automation to eliminate manual work. The role requires strong technical skills in debugging, system optimization, and enterprise-level troubleshooting.

Working in a collaborative environment, you'll interact with teams across India and the US, contributing to critical infrastructure projects and participating in on-call rotations. The role offers exposure to Google Cloud Platform and Google stack, where you'll implement reliability strategies and improve operational efficiency.

Google's SRE culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment. The organization brings together professionals from various backgrounds and perspectives, encouraging collaboration and innovation. You'll have the opportunity to work on meaningful projects while receiving support and mentorship for professional growth.

The Technical Infrastructure team, which you'll be part of, is fundamental to Google's product portfolio, handling everything from data center development to next-generation platform building. The role combines hands-on technical work with strategic thinking, requiring both engineering expertise and communication skills to work effectively with business partners and cross-functional teams.

This position offers the chance to work at the forefront of large-scale system reliability, contributing to the infrastructure that powers Google's global services while developing expertise in cutting-edge technologies and practices.

Last updated a minute ago

Responsibilities For System Engineer, Site Reliability Engineering

  • Design, code and execute on projects to improve the reliability posture of critical enterprise applications
  • Reduce the operational work significantly for our footprint on Google Cloud Platform (GCP), Google stack and Leverage, Google Site Reliability Engineering (SRE) reliability strategies
  • Drive technical interactions with business partners to come up with innovative ideas in terms of improving reliability for enterprise applications
  • Foster communication across India, US and business partner teams and collaborate with the US SRE team to support Corp Engineering services
  • Work with other engineering teams to ensure that Google's infrastructure is reliable, scalable, and secure
  • Participate in the team's on-call rotation

Requirements For System Engineer, Site Reliability Engineering

Linux
  • Bachelor's degree in Computer Science, or in a related technical field, or equivalent practical experience
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages
  • Experience with Unix/Linux operating systems internals and administration or networking and Debugging/Troubleshooting
  • Experience in analyzing and troubleshooting large-scale complex enterprise system
  • Experience in navigating enterprise software, deployment and management of workloads on Google Stack or Cloud
  • Ability to debug, optimize code, and to automate routine tasks

Interested in this job?

Jobs Related To Google System Engineer, Site Reliability Engineering

Software Engineer III, Site Reliability Engineering

Site Reliability Engineer position at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and system optimization.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer position at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.

Site Reliability Engineer, Publish/Subscribe

Site Reliability Engineer position at Google focusing on large-scale distributed systems and infrastructure reliability for Google Cloud services.

Software Engineer, Traffic Trust SRE, DoS Infrastructure

Site Reliability Engineer position at Google focusing on Traffic Trust and DoS Infrastructure, combining security, distributed systems, and reliability engineering.