Staff Site Reliability Engineer

Modern data and workflow platform for real estate professionals, building next-generation software/data products for the residential real estate industry.
$195,000 - $235,000
Site Reliability
Staff Software Engineer
Hybrid
5+ years of experience
Real Estate · Enterprise SaaS

Description For Staff Site Reliability Engineer

Perchwell, a modern data and workflow platform for real estate professionals backed by Lux Capital and Founders Fund, is seeking a Staff Site Reliability Engineer to join their team in New York. This is a founding member position of the SRE team where you'll own and improve the technical foundations while driving strategic initiatives across the engineering organization.

The role requires deep expertise in infrastructure and architectural design, with a focus on observability, performance engineering, and incident response. You'll work closely with the VP of Engineering and senior leaders to tackle current challenges while building new capabilities. The position involves partnering with product and QA organizations to build systems that enable faster, safer innovation.

As a Staff SRE, you'll lead initiatives around performance, event-driven architecture, and manage AWS infrastructure and Kubernetes deployments. You'll champion observability practices, lead incident management, and foster a culture of continuous improvement. The role offers competitive compensation ($195K-$235K plus equity) and requires working from the New York City HQ at least 3 days/week.

Key responsibilities include designing scalable solutions, managing AWS infrastructure, owning CI/CD processes, and implementing observability systems. The ideal candidate should have 5+ years of SRE experience, strong programming skills in Python/Go/Rust, and extensive knowledge of AWS services and Kubernetes.

This is an excellent opportunity for an experienced SRE who wants to make a significant impact in a growing company that's revolutionizing the real estate technology space. The role combines technical leadership with hands-on engineering work, making it perfect for someone who enjoys both building systems and mentoring teams.

Last updated 7 hours ago

Responsibilities For Staff Site Reliability Engineer

  • Lead major core initiatives around performance and event-driven architecture
  • Design and build scalable processes and solutions to engineering challenges
  • Design and manage scalable, secure AWS infrastructure
  • Partner with Quality Team to own CI/CD processes
  • Own Kubernetes infrastructure and strategy
  • Build and manage self-service infrastructure automation via Terraform
  • Champion observability systems and best practices
  • Lead incident management and disaster recovery processes
  • Mentor teams on SRE principles and practices
  • Partner with FinOps to manage infrastructure spending

Requirements For Staff Site Reliability Engineer

Python
Go
Rust
Kubernetes
Linux
  • BS or MS in Computer Science, related technical field, or equivalent experience
  • Distributed systems experience
  • Deep experience with AWS cloud services
  • In-depth knowledge of Kubernetes
  • Experience building automation tools
  • Programming experience in Python, Golang, or Rust
  • Systems thinking and strategic problem solving
  • Experience with service, database and infrastructure boundaries
  • Observability implementation experience
  • 5+ years of experience in a dedicated SRE role

Interested in this job?

Jobs Related To Perchwell Staff Site Reliability Engineer

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, building and maintaining systems for service reliability at scale with incident management responsibilities.

Site Reliability Engineer – AIOps

Senior Site Reliability Engineer role focusing on AIOps at Oracle, building AI-driven solutions for cloud infrastructure reliability and automation.

Lead Site Reliability Engineer

Lead Site Reliability Engineer position at Bumble Inc., focusing on ensuring system reliability and scalability while working with cutting-edge technologies in a hybrid work environment in London.

Staff Site Reliability Engineer

Staff Site Reliability Engineer position at ClickUp, focusing on maintaining and improving the reliability of their all-in-one work management platform.

Staff Site Reliability Engineer

Staff Site Reliability Engineer position at Assured, focusing on building and maintaining scalable infrastructure for insurance claims processing platform.