Databases Site Reliability Engineer

Google is a global technology leader that develops innovative products and services used by billions of people worldwide.
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Enterprise SaaS

Description For Databases Site Reliability Engineer

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google's services maintain reliability and appropriate uptime while monitoring system capacity and performance. The role focuses on optimizing existing systems, building infrastructure, and automation.

The Technical Infrastructure team is responsible for the architecture that powers Google's product portfolio. From developing and maintaining data centers to building next-generation Google platforms, this team makes Google's products possible. The team takes pride in being engineers' engineers and maintains networks for optimal user experience.

Working with Google Cloud Platform's Spanner database, you'll collaborate with various teams to ensure system manageability and efficiency. The role involves project planning and execution for improved reliability, participating in on-call rotations, and managing GCP Spanner allocations.

Google offers a diverse and inclusive environment where intellectual curiosity, problem-solving, and openness are valued. The company brings together people with varied backgrounds and perspectives, encouraging collaboration and innovation in a blame-free environment. You'll have opportunities to work on meaningful projects while receiving support and mentorship for professional growth.

Join a team that's at the forefront of large-scale system design and maintenance, where your expertise in coding, algorithms, and complexity analysis will be put to use in solving unique challenges at Google's scale.

Last updated a month ago

Responsibilities For Databases Site Reliability Engineer

  • Collaborate with other teams and the Cloud Support organization to ensure Spanner is easy to manage and meets customers' needs with minimal operational load
  • Plan and execute projects that improve reliability or efficiency
  • Participate in an on-call rotation as required
  • Manage the responsibilities of the GCP Spanner allocations

Requirements For Databases Site Reliability Engineer

Linux
  • Bachelor's degree or equivalent practical experience
  • 2 years of experience with programming in one or more programming languages
  • Experience with Unix/Linux operating systems internals and administration or networking
  • Experience with Site Reliability Engineering, System Design, and Distributed Computing (preferred)
  • Experience delivering projects in systems (preferred)
  • Excellent influencing skills (preferred)

Benefits For Databases Site Reliability Engineer

Medical Insurance
Parental Leave
  • Equal opportunity employer
  • Accommodation for special needs

Interested in this job?

Jobs Related To Google Databases Site Reliability Engineer

Software Engineer, Traffic Trust SRE, DoS Infrastructure

Site Reliability Engineer position at Google focusing on Traffic Trust and DoS Infrastructure, combining software engineering with systems operations to maintain large-scale distributed systems.

Software Engineer III, Site Reliability Engineer

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Databases Site Reliability Engineer

Site Reliability Engineer position at Google focusing on database systems, requiring expertise in distributed systems and infrastructure management.

Software Engineer III, Site Reliability Engineering

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.