CDN Site Reliability Engineer (SRE) L4/L5

Netflix is one of the world's leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages.
$100,000 - $720,000
Site Reliability
Senior Software Engineer
Remote
5,000+ Employees
3+ years of experience
Entertainment · Streaming · Technology

Description For CDN Site Reliability Engineer (SRE) L4/L5

Netflix is seeking a CDN Site Reliability Engineer (SRE) L4/L5 to join their Engineering team. This role involves designing, scaling, operating, automating, and analyzing Netflix's globally distributed Content Delivery Network (CDN). The ideal candidate will have 3+ years of Service Reliability/Operational experience, strong knowledge of networking concepts, and expertise in Unix/Linux systems. Responsibilities include driving improvements in resiliency, observability, and automation, analyzing performance data, and providing technical assistance to ISP partners. The role offers the opportunity to work on Netflix's Open Connect CDN, which is responsible for delivering 100% of Netflix's video traffic worldwide. This position combines technical challenges with the exciting mission of entertaining millions of people globally. Netflix offers a unique culture, values diversity, and provides competitive compensation based on skills and experience.

Last updated 4 hours ago

Responsibilities For CDN Site Reliability Engineer (SRE) L4/L5

  • Drive continual improvement in resiliency, observability, monitoring, instrumentation, and automation of the CDN platform
  • Aggregate, analyze, and correlate large amounts of server and application performance data
  • Provide technical design and engineering assistance to ISP partners to integrate Open Connect Appliances
  • Handle Tier 3 escalation and participate in an on-call rotation for CDN platform production issues

Requirements For CDN Site Reliability Engineer (SRE) L4/L5

Linux
Python
Kubernetes
  • 3+ years Service Reliability/Operational experience running large scale, high performance systems & internet services
  • Strong working knowledge of networking concepts and application protocols (TCP/IP, BGP, DNS, TLS, HTTP/S)
  • Skilled in designing, creating and maintaining automation written in a programming language such as Python
  • Expert-level knowledge managing and debugging Unix/Linux systems at scale
  • Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
  • Strong understanding of applied statistics and ability to code systems that identify outlier behavior
  • Some experience with container and container orchestration technologies (Docker, Kubernetes)
  • Ability to work in a highly collaborative environment and communicate cross-functionally

Interested in this job?

Jobs Related To Netflix CDN Site Reliability Engineer (SRE) L4/L5

Site Reliability Engineer L4/L5 - Live Streaming Pipeline

Netflix is hiring a Senior Site Reliability Engineer for their Live Streaming Pipeline, offering remote work and competitive compensation.

Site Reliability Engineer - REST API

Apple is hiring a Site Reliability Engineer for their Vision Pro team to support event operations, focusing on API integration and automation.

Senior Site Reliability Engineer

Senior Site Reliability Engineer at Microsoft, ensuring product reliability and solving complex customer issues in Windows services.