Staff Site Reliability Engineer

Building a better web by making it easier to build, deploy, and scale web applications through unified web development tools and services.
$96,000 - $130,000
Site Reliability
Staff Software Engineer
Remote
501 - 1,000 Employees
2+ years of experience
Enterprise SaaS

Description For Staff Site Reliability Engineer

Netlify, a Series D company that has raised over $200M, is revolutionizing web development by unifying the ecosystem of development tools and services. As a Staff Site Reliability Engineer, you'll be instrumental in scaling their infrastructure to meet the demands of over 4 million web developers and businesses.

The role combines technical leadership with hands-on engineering, requiring expertise in cloud architecture, CI/CD pipelines, and database management. You'll be responsible for championing architectural vision, fostering cross-organizational reliability initiatives, and mentoring senior engineers. The position demands strong experience with tools like Kafka, configuration management systems, and programming languages such as Python and Go.

This is an exceptional opportunity for an experienced SRE leader who wants to make a significant impact on the future of web development. You'll work in a remote-first, globally distributed environment that values asynchronous communication and documentation. The company culture emphasizes diversity, inclusion, and work-life balance, making it an ideal place for both career growth and personal development.

The role offers competitive compensation (£96,000 - £130,000 for UK-based locations), equity participation, and the chance to work with cutting-edge technology. You'll be joining a company backed by prestigious investors like Andreessen Horowitz and Kleiner Perkins, with a clear mission to build a better web.

The ideal candidate will bring at least two years of leadership experience in complex technical projects, deep expertise in cloud architecture, and exceptional communication skills. You'll be working with teams across the organization to implement large-scale infrastructure improvements and standardize SRE practices, making this an excellent opportunity for someone who wants to drive technical excellence at scale.

Last updated 2 months ago

Responsibilities For Staff Site Reliability Engineer

  • Champion architectural vision and technical strategy for reliability systems
  • Foster cross-organizational reliability initiatives
  • Set technical standards and best practices
  • Act as technical authority during major incidents
  • Mentor senior engineers and tech leads
  • Design and implement reliability frameworks and tooling
  • Lead architecture reviews for critical infrastructure projects
  • Develop reliability metrics and SLO frameworks

Requirements For Staff Site Reliability Engineer

Python
Go
Kafka
PostgreSQL
MongoDB
  • Significant history in Site Reliability Engineering with 2+ years leading complex projects
  • Deep expertise in cloud architecture (AWS, GCP, or Azure)
  • Experience with CI/CD pipelines (Jenkins, GitLab CI, CircleCI)
  • Expertise in configuration management (Ansible, Chef, Puppet)
  • Proficiency with Kafka and messaging brokers
  • Strong database management experience
  • Programming skills in Python, Go, or Bash
  • Strong technical leadership skills
  • Exceptional communication skills
  • Experience with compliance frameworks (PCI, ISO 27001, HIPAA, SOC)

Benefits For Staff Site Reliability Engineer

Equity
  • Equity participation
  • Competitive salary
  • Remote-first work environment
  • Work-life balance

Interested in this job?

Jobs Related To Netlify Staff Site Reliability Engineer

Lead Site Reliability Engineer (Observability)

Lead SRE role at Xero focusing on observability, implementing monitoring solutions, and driving reliability standards across a global engineering organization.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, incident management, and building scalable systems with competitive compensation and remote work options.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on building and scaling reliable systems, leading technical strategy, and mentoring teams while working remotely.

Senior Software Engineering Manager, Espresso SRE

Lead LinkedIn's Espresso SRE team managing distributed NoSQL database infrastructure serving 30M QPS, overseeing system reliability and team development in hybrid work environment.

Senior Software Engineering Manager, Espresso SRE

Senior Software Engineering Manager position at LinkedIn leading the Espresso SRE team, focusing on distributed NoSQL database infrastructure and team leadership.