Senior Site Reliability Engineer

Webflow is the leading visual development platform for building powerful websites without writing code.
$139,000 - $218,000
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Enterprise SaaS

Description For Senior Site Reliability Engineer

At Webflow, our mission is to bring development superpowers to everyone. Webflow is the leading visual development platform for building powerful websites without writing code. By combining modern web development technologies into one platform, Webflow enables people to build websites visually, saving engineering time, while clean code seamlessly generates in the background. From independent designers and creative agencies to Fortune 500 companies, millions worldwide use Webflow to be more nimble, creative, and collaborative. It's the web, made better.

We're looking for a Senior Site Reliability Engineer to improve reliability and stability of Webflow's customer-facing, production infrastructure, serving millions of page views per hour. Our product is used by over 2 million users world-wide across 190 countries, and you'll help ensure our platform is secure and scalable for these users as tens of thousands of projects are launched on Webflow each month.

As a Senior Site Reliability Engineer, you'll:

  • Empower engineers on other teams to take control of their services by maintaining monitoring tooling and collaborating on internal best practices for observability.
  • Enhance reliability of applications running in Kubernetes by optimizing resource allocation, streamlining upgrade processes, and ensuring scalability and fault tolerance.
  • Occasionally dive into the main Webflow application in Node, Python, or Go to better discern (and sometimes fix) behavior in production.
  • Work with peers on Webflow's Customer Support, Partnerships, and Sales teams to enable customers using Webflow's services in production.
  • Participate in and continuously improve on-call and incident response processes.

You'll thrive as a Senior Site Reliability Engineer if you have:

  • Either a background as an ops engineer with an enthusiasm for code, or a background as a software engineer with an enthusiasm for systems administration.
  • 5+ years of experience building, maintaining, and debugging distributed systems in a customer-facing environment that allows for little to no downtime.
  • Experience navigating and scaling multi-tier cloud environments on either AWS or GCP.
  • Experience with container-centric architectures, built with Docker and tools like Kubernetes (EKS, GKE, AKS, OpenShift, etc.), ECS, Docker Swarm, or Mesos.
  • Experience with infrastructure-as-code tools like Terraform, Pulumi, Ansible, Puppet, or Chef.
  • Experience in contributing to full-stack applications built using tools like React, Node, and MongoDB.
  • Enthusiasm for mentoring and sponsoring less-experienced engineers.

Bonus points for:

  • Experience with Kubernetes, Nginx, Terraform, or Pulumi specifically.
  • Experience improving on-call and incident response processes for Engineering.
  • Experience working in high-compliance environments or a special interest in security engineering.

Join us at Webflow to build the future of the web and empower millions of users worldwide!

Last updated 2 months ago

Responsibilities For Senior Site Reliability Engineer

  • Empower engineers on other teams to take control of their services by maintaining monitoring tooling and collaborating on internal best practices for observability
  • Enhance reliability of applications running in Kubernetes by optimizing resource allocation, streamlining upgrade processes, and ensuring scalability and fault tolerance
  • Occasionally dive into the main Webflow application in Node, Python, or Go to better discern (and sometimes fix) behavior in production
  • Work with peers on Webflow's Customer Support, Partnerships, and Sales teams to enable customers using Webflow's services in production
  • Participate in and continuously improve on-call and incident response processes

Requirements For Senior Site Reliability Engineer

Go
Java
JavaScript
Kubernetes
MongoDB
Node.js
Python
React
TypeScript
  • Either a background as an ops engineer with an enthusiasm for code, or a background as a software engineer with an enthusiasm for systems administration
  • 5+ years of experience building, maintaining, and debugging distributed systems in a customer-facing environment that allows for little to no downtime
  • Experience navigating and scaling multi-tier cloud environments on either AWS or GCP
  • Experience with container-centric architectures, built with Docker and tools like Kubernetes (EKS, GKE, AKS, OpenShift, etc.), ECS, Docker Swarm, or Mesos
  • Experience with infrastructure-as-code tools like Terraform, Pulumi, Ansible, Puppet, or Chef
  • Experience in contributing to full-stack applications built using tools like React, Node, and MongoDB
  • Enthusiasm for mentoring and sponsoring less-experienced engineers

Benefits For Senior Site Reliability Engineer

401k
Dental Insurance
Education Budget
Equity
Medical Insurance
Mental Health Assistance
Parental Leave
Vision Insurance
  • Equity ownership (RSUs) in a growing, privately-owned company
  • 100% employer-paid healthcare, vision, and dental insurance coverage for employees and dependents
  • 12 weeks of paid parental leave for both birthing and non-birthing caregivers
  • Flexible PTO with a mandatory annual minimum of 10 days paid time off
  • Access to mental wellness and professional coaching, therapy, and Employee Assistance Program
  • Monthly stipends to support health and wellness, smart work, and professional growth
  • Professional career coaching, internal learning & development programs
  • 401k plan and pension schemes
  • Discounted Pet Insurance offering (US only)
  • Commuter benefits for in-office employees

Interested in this job?

Jobs Related To Webflow Senior Site Reliability Engineer

Platform Engineer (Service Reliability Engineer)

Senior Platform Engineer role focusing on service reliability, cloud infrastructure, and DevOps practices in a financial services environment.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at NordVPN, focusing on infrastructure automation and reliability for a leading VPN service provider.

Site Reliability Engineer- SRE

Senior Site Reliability Engineer position at Apple, focusing on platform engineering and cloud infrastructure for hardware engineering tools and data analytics.

Senior Site Reliability Engineer - Observability and Telemetry Platform

Senior SRE position at NVIDIA focusing on observability and telemetry platforms, offering competitive salary and opportunity to work with cutting-edge cloud technologies.

Senior Production SRE Engineer - Storage

Senior Production SRE Engineer position at NVIDIA focusing on storage systems, requiring 5+ years experience and expertise in large-scale system reliability and automation.