Staff Site Reliability Engineer

Building a better web by making it easier to build, deploy, and scale web applications through unified web development tools and services.
$96,000 - $130,000
Site Reliability
Staff Software Engineer
Remote
501 - 1,000 Employees
2+ years of experience
Enterprise SaaS

Description For Staff Site Reliability Engineer

Netlify, a Series D company that has raised over $200M, is revolutionizing web development by unifying the ecosystem of development tools and services. As a Staff Site Reliability Engineer, you'll be instrumental in scaling their infrastructure to meet the demands of over 4 million web developers and businesses.

The role combines technical leadership with hands-on engineering, requiring expertise in cloud architecture, CI/CD pipelines, and database management. You'll be responsible for championing architectural vision, fostering cross-organizational reliability initiatives, and mentoring senior engineers. The position demands strong experience with tools like Kafka, configuration management systems, and programming languages such as Python and Go.

This is an exceptional opportunity for an experienced SRE leader who wants to make a significant impact on the future of web development. You'll work in a remote-first, globally distributed environment that values asynchronous communication and documentation. The company culture emphasizes diversity, inclusion, and work-life balance, making it an ideal place for both career growth and personal development.

The role offers competitive compensation (£96,000 - £130,000 for UK-based locations), equity participation, and the chance to work with cutting-edge technology. You'll be joining a company backed by prestigious investors like Andreessen Horowitz and Kleiner Perkins, with a clear mission to build a better web.

The ideal candidate will bring at least two years of leadership experience in complex technical projects, deep expertise in cloud architecture, and exceptional communication skills. You'll be working with teams across the organization to implement large-scale infrastructure improvements and standardize SRE practices, making this an excellent opportunity for someone who wants to drive technical excellence at scale.

Last updated 2 months ago

Responsibilities For Staff Site Reliability Engineer

  • Champion architectural vision and technical strategy for reliability systems
  • Foster cross-organizational reliability initiatives
  • Set technical standards and best practices
  • Act as technical authority during major incidents
  • Mentor senior engineers and tech leads
  • Design and implement reliability frameworks and tooling
  • Lead architecture reviews for critical infrastructure projects
  • Develop reliability metrics and SLO frameworks

Requirements For Staff Site Reliability Engineer

Python
Go
Kafka
PostgreSQL
MongoDB
  • Significant history in Site Reliability Engineering with 2+ years leading complex projects
  • Deep expertise in cloud architecture (AWS, GCP, or Azure)
  • Experience with CI/CD pipelines (Jenkins, GitLab CI, CircleCI)
  • Expertise in configuration management (Ansible, Chef, Puppet)
  • Proficiency with Kafka and messaging brokers
  • Strong database management experience
  • Programming skills in Python, Go, or Bash
  • Strong technical leadership skills
  • Exceptional communication skills
  • Experience with compliance frameworks (PCI, ISO 27001, HIPAA, SOC)

Benefits For Staff Site Reliability Engineer

Equity
  • Equity participation
  • Competitive salary
  • Remote-first work environment
  • Work-life balance

Interested in this job?

Jobs Related To Netlify Staff Site Reliability Engineer

Lead Site Reliability Engineer- Azure Cloud enablement

Lead Site Reliability Engineer position at JPMorgan Chase focusing on Azure cloud infrastructure, offering competitive compensation and comprehensive benefits.

Site Reliability Engineer III- DevOps

Senior Site Reliability Engineer role at JPMorgan Chase focusing on AWS, Kubernetes, and DevOps practices with competitive compensation and comprehensive benefits.

Site Reliability Developer 4

Senior Site Reliability Developer position at Oracle focusing on cloud infrastructure, automation, and system reliability with competitive compensation and benefits.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability strategy, incident management, and system architecture, offering competitive compensation and remote work flexibility.