Site Reliability Engineer

One is a fintech company backed by Ribbit and Walmart, focused on helping customers achieve financial progress by providing solutions for saving, spending, borrowing, and growing money in one place.
$120,000 - $190,000
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Finance

Description For Site Reliability Engineer

One is revolutionizing financial services by creating an all-in-one platform to help customers save, spend, borrow, and grow their money. As a Site Reliability Engineer at One, you'll play a crucial role in ensuring the reliability and availability of critical services that impact millions of Americans' financial lives. The company is uniquely positioned with backing from Ribbit and Walmart, combining the agility of a startup with strong financial support.

The role offers an exciting opportunity to be an early member of the growing SRE team, where you'll help establish core processes and best practices. You'll work with distributed systems, implement observability solutions, and drive the incident management process while maintaining a blameless culture. The position requires expertise in cloud native systems, programming skills in languages like Python, TypeScript, or Go, and experience with observability platforms.

The company offers an attractive compensation package ranging from $120,000 to $190,000, along with comprehensive benefits including equity, flexible work arrangements, and various leave options. One's commitment to inclusion and diversity, combined with their mission to improve financial access for underserved populations, makes this an opportunity to make a real impact while working with cutting-edge technology.

Working at One means joining a mission-driven company that's addressing real problems in the financial sector, with roughly 80% of fintech users currently relying on multiple accounts to manage their finances. The role offers both technical challenges and the satisfaction of helping build systems that contribute to people's financial progress.

Last updated a month ago

Responsibilities For Site Reliability Engineer

  • Working proactively with engineering teams to help them set SLOs and implement best practices for logging and telemetry collection
  • Design, implement and maintain the tools and systems that support service reliability, monitoring, and alerting
  • Participating in a 12x7 on-call rotation supporting the health of our services
  • Driving the incident management process and support a blameless post-mortem culture
  • Participating in application design consulting and capacity planning
  • Defining and formalizing SRE practices and help guide the overall reliability engineering direction
  • Providing mentorship both formally and informally to engineers at One
  • Continuously optimizing systems and workflows by improving architecture, infrastructure, automation, CI/CD, and observability
  • Combining software and systems knowledge to engineer high-volume distributed systems

Requirements For Site Reliability Engineer

Python
TypeScript
Go
  • 5+ years of relevant industry experience with distributed cloud native systems design, observability, operation, maintenance, and troubleshooting
  • 5+ years operational experience with an observability platform like Datadog, Splunk, Prometheus/Grafana, or AppDynamics
  • Fluency in one or more programming languages (e.g. Python, Typescript, Go)
  • Strong conviction in software development best practices
  • Self-motivated, inquisitive, and always looking to learn new technologies
  • Great teammate who communicates clearly and transparently
  • Triple H Factor: Humble, Hungry and Honest
  • Act-like-an-owner mentality

Benefits For Site Reliability Engineer

401k
Equity
  • Competitive cash
  • Benefits effective on day one
  • Early access to a high potential, high growth fintech
  • Generous stock option packages
  • Remote friendly (anywhere in the US)
  • Flexible time off programs
  • Vacation
  • Sick leave
  • Paid parental leave
  • Paid caregiver leave
  • 401(k) plan with match

Interested in this job?

Jobs Related To One Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Zscaler, focusing on cloud infrastructure, automation, and maintaining high-availability systems across AWS, Azure, and GCP.

Senior Site Reliability Engineer

Senior SRE position at Blacklane focusing on system reliability, observability, and mentoring, offering hybrid work and equity in a global mobility company.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Zscaler, focusing on cloud infrastructure, automation, and maintaining high-availability systems across AWS, Azure, and GCP.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Prove, focusing on building and maintaining scalable, reliable systems for digital identity solutions.

Site Reliability Engineer - EMEA

Remote Site Reliability Engineer position at BforeAI, focusing on system reliability and scalability across EMEA region.