Staff Site Reliability Engineer

Building a better web by making it easier to build, deploy, and scale web applications through unified web development tools and services.
$96,000 - $130,000
Site Reliability
Staff Software Engineer
Remote
501 - 1,000 Employees
2+ years of experience
Enterprise SaaS

Description For Staff Site Reliability Engineer

Netlify, a Series D company that has raised over $200M, is revolutionizing web development by unifying the ecosystem of development tools and services. As a Staff Site Reliability Engineer, you'll be instrumental in scaling their infrastructure to meet the demands of over 4 million web developers and businesses.

The role combines technical leadership with hands-on engineering, requiring expertise in cloud architecture, CI/CD pipelines, and database management. You'll be responsible for championing architectural vision, fostering cross-organizational reliability initiatives, and mentoring senior engineers. The position demands strong experience with tools like Kafka, configuration management systems, and programming languages such as Python and Go.

This is an exceptional opportunity for an experienced SRE leader who wants to make a significant impact on the future of web development. You'll work in a remote-first, globally distributed environment that values asynchronous communication and documentation. The company culture emphasizes diversity, inclusion, and work-life balance, making it an ideal place for both career growth and personal development.

The role offers competitive compensation (£96,000 - £130,000 for UK-based locations), equity participation, and the chance to work with cutting-edge technology. You'll be joining a company backed by prestigious investors like Andreessen Horowitz and Kleiner Perkins, with a clear mission to build a better web.

The ideal candidate will bring at least two years of leadership experience in complex technical projects, deep expertise in cloud architecture, and exceptional communication skills. You'll be working with teams across the organization to implement large-scale infrastructure improvements and standardize SRE practices, making this an excellent opportunity for someone who wants to drive technical excellence at scale.

Last updated 2 months ago

Responsibilities For Staff Site Reliability Engineer

  • Champion architectural vision and technical strategy for reliability systems
  • Foster cross-organizational reliability initiatives
  • Set technical standards and best practices
  • Act as technical authority during major incidents
  • Mentor senior engineers and tech leads
  • Design and implement reliability frameworks and tooling
  • Lead architecture reviews for critical infrastructure projects
  • Develop reliability metrics and SLO frameworks

Requirements For Staff Site Reliability Engineer

Python
Go
Kafka
PostgreSQL
MongoDB
  • Significant history in Site Reliability Engineering with 2+ years leading complex projects
  • Deep expertise in cloud architecture (AWS, GCP, or Azure)
  • Experience with CI/CD pipelines (Jenkins, GitLab CI, CircleCI)
  • Expertise in configuration management (Ansible, Chef, Puppet)
  • Proficiency with Kafka and messaging brokers
  • Strong database management experience
  • Programming skills in Python, Go, or Bash
  • Strong technical leadership skills
  • Exceptional communication skills
  • Experience with compliance frameworks (PCI, ISO 27001, HIPAA, SOC)

Benefits For Staff Site Reliability Engineer

Equity
  • Equity participation
  • Competitive salary
  • Remote-first work environment
  • Work-life balance

Interested in this job?

Jobs Related To Netlify Staff Site Reliability Engineer

Senior Technical Program Manager I, Site Reliability Engineering, Google Cloud Platforms

Senior Technical Program Manager role at Google Cloud, focusing on Site Reliability Engineering, offering competitive compensation and the opportunity to lead complex technical projects.

Technical Program Manager III, SRE, Cloud Infrastructure

Technical Program Manager III position at Google, focusing on SRE and Cloud Infrastructure, requiring 5 years of experience and offering $156,000-$229,000 base salary plus benefits.

Site Reliability Manager, Core Enterprise Systems

Lead a team of Site Reliability Engineers at Google, managing enterprise services and driving engineering excellence in system reliability, automation, and service delivery.

Software Engineering Manager, Site Reliability Engineering, Platform, Devices

Lead Site Reliability Engineering team at Google, managing distributed systems and infrastructure while mentoring engineers and ensuring service reliability.

System Engineering Manager, Site Reliability Engineering, Google Play

Lead Site Reliability Engineering team at Google Play, managing distributed systems and ensuring service reliability while driving technical excellence and team growth.