Staff Site Reliability Engineer

Building a better web by making it easier to build, deploy, and scale web applications through unified web development tools and services.
$96,000 - $130,000
Site Reliability
Staff Software Engineer
Remote
501 - 1,000 Employees
2+ years of experience
Enterprise SaaS

Description For Staff Site Reliability Engineer

Netlify, a Series D company that has raised over $200M, is revolutionizing web development by unifying the ecosystem of development tools and services. As a Staff Site Reliability Engineer, you'll be instrumental in scaling their infrastructure to meet the demands of over 4 million web developers and businesses.

The role combines technical leadership with hands-on engineering, requiring expertise in cloud architecture, CI/CD pipelines, and database management. You'll be responsible for championing architectural vision, fostering cross-organizational reliability initiatives, and mentoring senior engineers. The position demands strong experience with tools like Kafka, configuration management systems, and programming languages such as Python and Go.

This is an exceptional opportunity for an experienced SRE leader who wants to make a significant impact on the future of web development. You'll work in a remote-first, globally distributed environment that values asynchronous communication and documentation. The company culture emphasizes diversity, inclusion, and work-life balance, making it an ideal place for both career growth and personal development.

The role offers competitive compensation (£96,000 - £130,000 for UK-based locations), equity participation, and the chance to work with cutting-edge technology. You'll be joining a company backed by prestigious investors like Andreessen Horowitz and Kleiner Perkins, with a clear mission to build a better web.

The ideal candidate will bring at least two years of leadership experience in complex technical projects, deep expertise in cloud architecture, and exceptional communication skills. You'll be working with teams across the organization to implement large-scale infrastructure improvements and standardize SRE practices, making this an excellent opportunity for someone who wants to drive technical excellence at scale.

Last updated an hour ago

Responsibilities For Staff Site Reliability Engineer

  • Champion architectural vision and technical strategy for reliability systems
  • Foster cross-organizational reliability initiatives
  • Set technical standards and best practices
  • Act as technical authority during major incidents
  • Mentor senior engineers and tech leads
  • Design and implement reliability frameworks and tooling
  • Lead architecture reviews for critical infrastructure projects
  • Develop reliability metrics and SLO frameworks

Requirements For Staff Site Reliability Engineer

Python
Go
Kafka
PostgreSQL
MongoDB
  • Significant history in Site Reliability Engineering with 2+ years leading complex projects
  • Deep expertise in cloud architecture (AWS, GCP, or Azure)
  • Experience with CI/CD pipelines (Jenkins, GitLab CI, CircleCI)
  • Expertise in configuration management (Ansible, Chef, Puppet)
  • Proficiency with Kafka and messaging brokers
  • Strong database management experience
  • Programming skills in Python, Go, or Bash
  • Strong technical leadership skills
  • Exceptional communication skills
  • Experience with compliance frameworks (PCI, ISO 27001, HIPAA, SOC)

Benefits For Staff Site Reliability Engineer

Equity
  • Equity participation
  • Competitive salary
  • Remote-first work environment
  • Work-life balance

Interested in this job?

Jobs Related To Netlify Staff Site Reliability Engineer

Senior Site Reliability / GitOps Engineer

Senior Site Reliability Engineer position at Canonical, focusing on GitOps and infrastructure automation for Ubuntu's ecosystem

Site Reliability Engineer (Information Technology)

SpaceX Site Reliability Engineer position focusing on Kubernetes and Linux infrastructure management to support space exploration technology development.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability architecture, incident management, and technical leadership, offering competitive compensation and remote work flexibility.

Staff Site Reliability Engineer

Staff SRE position at Forma focusing on cloud infrastructure, monitoring, and automation, offering remote work and comprehensive benefits.