Software Engineer III, Site Reliability Engineering, Google Cloud

Google is a global technology company that builds and runs large-scale, distributed systems and services.
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Enterprise SaaS · Cloud

Description For Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale, distributed systems. As an SRE, you'll be responsible for ensuring the reliability and uptime of Google Cloud's services, both internal and customer-facing systems. The role involves complex challenges of scale unique to Google Cloud, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design.

The position offers opportunities to work on meaningful projects in a blame-free environment that values diversity, intellectual curiosity, and problem-solving. You'll be part of a team that promotes self-direction while providing support and mentorship for professional growth. The role involves managing project priorities, deadlines, and deliverables, as well as designing, developing, testing, deploying, maintaining, and enhancing software solutions.

SRE's focus includes optimizing existing systems, building infrastructure, and automating processes to eliminate manual work. You'll be responsible for monitoring system capacity and performance, ensuring services meet customer needs, and maintaining a fast rate of improvement. The role combines technical expertise with collaborative teamwork, where you'll work with people from diverse backgrounds and perspectives.

As an SRE at Google Cloud, you'll contribute to a culture that values openness and collaboration, while tackling some of the most challenging problems in distributed systems. The position offers a unique blend of software engineering and systems operations, making it ideal for those who enjoy both building and maintaining complex technical infrastructure at scale.

Last updated 3 days ago

Responsibilities For Software Engineer III, Site Reliability Engineering, Google Cloud

  • Write product or system development code
  • Review code developed by other engineers and provide feedback to ensure best practices
  • Contribute to existing documentation or educational content
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues
  • Participate in, or lead design reviews with peers and stakeholders

Requirements For Software Engineer III, Site Reliability Engineering, Google Cloud

Python
Go
Java
Kubernetes
Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with data structures/algorithms and software development
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Systematic problem-solving approach
  • Effective verbal and written communication skills
  • English proficiency

Benefits For Software Engineer III, Site Reliability Engineering, Google Cloud

Medical Insurance
Parental Leave
Equity
  • Equal employment opportunity
  • Accommodation for special needs
  • Global work environment

Interested in this job?

Jobs Related To Google Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer, Publish/Subscribe

Site Reliability Engineer position at Google focusing on large-scale distributed systems and infrastructure reliability for Google Cloud services.

Software Engineer, Traffic Trust SRE, DoS Infrastructure

Site Reliability Engineer position at Google focusing on Traffic Trust and DoS Infrastructure, combining security, distributed systems, and reliability engineering.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and system optimization.