Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Google is a global technology leader that specializes in internet-related services and products, including search, cloud computing, software, and hardware.
Site Reliability
Mid-Level Software Engineer
Contact Company
5+ years of experience
Enterprise SaaS · Cloud

Description For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while managing performance and capacity. The role focuses on optimizing existing systems, building infrastructure, and automation. You'll tackle unique scaling challenges specific to Google Cloud, applying expertise in coding, algorithms, and large-scale system design. The team values diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll collaborate with professionals from various backgrounds, taking on meaningful projects with support and mentorship for growth. The position involves managing project priorities, deadlines, and deliverables, while designing, developing, testing, deploying, maintaining, and enhancing software solutions. Google's SRE culture promotes self-direction and innovation, making it an ideal environment for those passionate about large-scale distributed systems and infrastructure.

Last updated 22 days ago

Responsibilities For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

  • Work on the availability, scalability, efficiency, and latency of some of Google Cloud's most critical services
  • Work with partner Development and SRE teams to design and deliver different programs and projects in a scalable, reliable, and secure manner (Cloud Capacity fungibility, Trusted Private Cloud, Convergence, Butter, etc.)
  • Design and develop innovative solutions that enable key Google initiatives that scale with the requirements of the business
  • Provide oncall and on-duty excellence; driving problems towards sustainable, long-term solutions
  • Contribute to continuous service improvement

Requirements For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 5 years of experience building and developing infrastructure, distributed systems or networks, or experience with compute technologies, storage, or hardware architecture
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages
  • Experience with debugging and Linux
  • Experience working in computing, distributed systems, storage, or networking
  • Systematic problem-solving approach, coupled with effective verbal and written communication skills
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems

Interested in this job?

Jobs Related To Google Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Software Engineer, Traffic Trust SRE, DoS Infrastructure

Site Reliability Engineer position at Google focusing on Traffic Trust and DoS Infrastructure, combining software engineering with systems operations to maintain large-scale distributed systems.

Software Engineer III, Site Reliability Engineer

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Databases Site Reliability Engineer

Site Reliability Engineer position at Google focusing on database systems, requiring expertise in distributed systems and infrastructure management.

Software Engineer III, Site Reliability Engineering

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.