Software Engineer III, Site Reliability Engineering, YouTube

YouTube

YouTube is a global video-sharing platform and subsidiary of Google.

Zürich, Switzerland

Site Reliability

Mid-Level Software Engineer

In-Person

2+ years of experience

Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Software Engineer III, Site Reliability Engineering, YouTube

Site Reliability Engineering (SRE) at YouTube combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll be responsible for ensuring Google Cloud's services maintain reliability and appropriate uptime while continuously improving performance. The role involves optimizing existing systems, building infrastructure, and automating processes.

You'll tackle unique scaling challenges specific to Google Cloud, applying your expertise in coding, algorithms, complexity analysis, and large-scale system design. The team values diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll work on meaningful projects with support and mentorship opportunities for growth.

The Technical Infrastructure team is crucial in maintaining the architecture behind all user-facing services. From data center development to building next-generation Google platforms, this team makes Google's product portfolio possible. The role involves managing project priorities, deadlines, and deliverables while designing, developing, testing, deploying, and enhancing software solutions.

This position offers the opportunity to work with cutting-edge technology, collaborate with talented engineers, and directly impact millions of users worldwide. You'll be part of a team that takes pride in being the engineers' engineers, focusing on maintaining optimal network performance and user experience.

Last updated 8 months ago

Responsibilities For Software Engineer III, Site Reliability Engineering, YouTube

Write product or system development code
Review code developed by other engineers and provide feedback to ensure best practices
Contribute to existing documentation or educational content
Triage product or system issues and debug/track/resolve by analyzing the sources of issues
Participate in, or lead design reviews with peers and stakeholders

Requirements For Software Engineer III, Site Reliability Engineering, YouTube

Python

Java

Kubernetes

Bachelor's degree in Computer Science, a related field, or equivalent practical experience
2 years of experience with data structures/algorithms and software development
Experience working in computing, distributed systems, storage, or networking
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
Ability to debug, optimize code, and to automate routine tasks
Systematic problem-solving approach, coupled with effective communication skills