Staff Infrastructure Site Reliability Engineer

Netlify

A Series D company building a better web by unifying web development tools, content sources, services, and APIs into one simplified workflow.

Spain • Canada • United Kingdom

$84,000 - $221,000

DevOps

Staff Software Engineer

Remote

501 - 1,000 Employees

8+ years of experience

Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Staff Infrastructure Site Reliability Engineer

Netlify is seeking a Staff Infrastructure Site Reliability Engineer to join their remote-first team, focusing on ensuring the reliability, scalability, and efficiency of their rapidly growing platform. This role combines technical leadership with hands-on engineering work, requiring expertise in cloud architecture, infrastructure automation, and system reliability.

The position offers an opportunity to shape the direction of Netlify's systems while working with cutting-edge technologies including Kubernetes, cloud platforms (AWS/Azure), and modern DevOps tools. You'll be responsible for leading high-impact reliability initiatives, managing critical infrastructure, and mentoring other engineers. The role requires strong technical skills in Go programming, infrastructure-as-code, and cloud services, combined with the ability to drive organizational-level reliability strategies.

As a Series D company that has raised over $200M from top-tier investors, Netlify offers a competitive compensation package including equity participation. The salary ranges from €84,000 - €113,000 for Spain-based locations and CAD $163,000 - CAD $221,000 for Canada-based locations, with adjustments based on location and experience.

The ideal candidate will have deep expertise in cloud architecture, experience with messaging systems like Kafka, strong database knowledge, and proven leadership in large-scale technical initiatives. You'll work in a globally distributed environment that values asynchronous communication, documentation, and a culture of transparency. This role is perfect for someone who thinks in systems, enjoys coding (especially in Go), and is passionate about building reliable infrastructure at scale.

Working at Netlify means joining a mission to build a better web by making it easier to develop and deploy web applications. The company culture emphasizes diversity, inclusion, and work-life balance, making it an attractive opportunity for experienced infrastructure engineers looking to make a significant impact while maintaining professional growth.

Last updated 3 months ago

Responsibilities For Staff Infrastructure Site Reliability Engineer

Lead high-impact reliability and infrastructure initiatives across the platform
Drive the adoption of Infrastructure-as-Code and champion reliability-focused tooling
Manage cloud infrastructure components including instances, networking, DNS, Terraform automation, and Kubernetes
Define and uphold architectural standards and technical strategy for reliability at scale
Provide mentorship to senior engineers and tech leads
Partner with Engineering, Product, and Executive teams
Lead architecture reviews and provide oversight for critical infrastructure projects
Develop reliability metrics and SLO frameworks
Participate in on-call rotation and act as Incident Commander

Requirements For Staff Infrastructure Site Reliability Engineer

Kubernetes

Linux

Deep expertise in cloud architecture with AWS, Azure, or GCP
Strong proficiency with Kafka or similar messaging systems
Solid experience in database design and maintenance
Skilled in programming and scripting languages such as Go or Python
Proven track record of leading large-scale technical initiatives
Proficiency in configuration management tools
Experience in managing CI/CD pipelines
Excellent communication skills
Must be based in Spain, Canada, or the UK

Benefits For Staff Infrastructure Site Reliability Engineer

Equity

Remote-first work environment
Equity participation
Competitive salary based on location