Site Reliability Engineer, Publish/Subscribe

Google is a global technology leader that develops innovative products and services used by billions of people.
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Enterprise SaaS · Cloud

Description For Site Reliability Engineer, Publish/Subscribe

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while focusing on system capacity and performance optimization. The role involves managing complex challenges unique to Google Cloud's scale, utilizing expertise in coding, algorithms, and large-scale system design. You'll work in a diverse, collaborative environment that values intellectual curiosity and problem-solving. The Technical Infrastructure team is responsible for the architecture behind all Google products, from data centers to next-generation platforms. You'll review code, contribute to documentation, triage system issues, and participate in design reviews. The role offers opportunities for growth through mentorship and hands-on experience with Google's sophisticated infrastructure. Working with the SRE team means joining a culture that promotes self-direction while tackling meaningful projects that directly impact Google's global user base.

Last updated 3 days ago

Responsibilities For Site Reliability Engineer, Publish/Subscribe

  • Review code developed by other engineers and provide feedback to ensure best practices
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality
  • Participate in, or lead design reviews with peers and stakeholders to decide on available technologies

Requirements For Site Reliability Engineer, Publish/Subscribe

Java
Python
Go
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages (e.g., Java, Python, Go, C, C++)
  • Experience working in computing, distributed systems, storage, or networking
  • Experience in designing, analyzing, and troubleshooting distributed systems
  • Ability to debug, optimize code, and to automate routine tasks

Interested in this job?

Jobs Related To Google Site Reliability Engineer, Publish/Subscribe

Software Engineer, Traffic Trust SRE, DoS Infrastructure

Site Reliability Engineer position at Google focusing on Traffic Trust and DoS Infrastructure, combining security, distributed systems, and reliability engineering.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and system optimization.