Site Reliability Engineer

One is a fintech company backed by Ribbit and Walmart, focused on helping customers achieve financial progress through simple solutions for saving, spending, borrowing, and growing money.
Site Reliability
Senior Software Engineer
Hybrid
5+ years of experience
Finance

Description For Site Reliability Engineer

One is on a mission to revolutionize financial services by creating an all-in-one platform to help customers save, spend, borrow, and grow their money. As a Site Reliability Engineer at One, you'll play a crucial role in ensuring the reliability and availability of critical services. The company is uniquely positioned with backing from Ribbit and Walmart, combining startup agility with strong financial support.

The role demands a seasoned professional with 5+ years of experience in distributed systems and observability. You'll be an early member of the growing SRE team, helping establish core processes and best practices. Your responsibilities will span from setting SLOs and implementing monitoring solutions to participating in on-call rotations and driving incident management processes.

The ideal candidate brings strong technical expertise in cloud-native systems, proficiency in languages like Python, TypeScript, or Go, and experience with observability platforms. Beyond technical skills, we value team players who demonstrate the "Triple H Factor" - Humble, Hungry, and Honest - and maintain an owner's mentality.

One offers a unique opportunity to impact millions of Americans' financial lives, addressing the needs of underbanked populations and those struggling with fragmented financial services. The company maintains a flat titling structure to promote equity and values diversity in building solutions that solve real-world problems. Join One to be part of a mission-driven team working to make financial progress accessible to all.

Last updated 6 minutes ago

Responsibilities For Site Reliability Engineer

  • Working proactively with engineering teams to help them set SLOs and implement best practices for logging and telemetry collection
  • Design, implement, and maintain tools and systems for service reliability, monitoring, and alerting
  • Participating in 24x7 on-call rotation supporting service health
  • Driving incident management process and supporting blameless post-mortem culture
  • Participating in application design consulting and capacity planning
  • Defining and formalizing SRE practices and guiding reliability engineering direction
  • Providing mentorship to engineers
  • Continuously optimizing systems and workflows
  • Engineering high-volume distributed systems

Requirements For Site Reliability Engineer

Python
TypeScript
Go
  • 5+ years of relevant industry experience with distributed cloud native systems
  • 5+ years operational experience with observability platforms
  • Fluency in one or more programming languages (Python, Typescript, Go)
  • Strong conviction in software development best practices
  • Self-motivated and inquisitive
  • Great teammate with clear communication
  • Triple H Factor: Humble, Hungry and Honest
  • Act-like-an-owner mentality

Interested in this job?

Jobs Related To One Site Reliability Engineer

Site Reliability Engineer

Senior Site Reliability Engineer position at One, focusing on ensuring reliability of critical financial services with competitive pay and comprehensive benefits.

Senior Site Reliability Engineer (GCP)

Senior Site Reliability Engineer position at Rackspace Technology focusing on GCP infrastructure, requiring 8+ years of experience in DevOps and cloud technologies.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Pepperstone, focusing on building and maintaining highly available, scalable systems in a global fintech environment.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Invert, focusing on cloud infrastructure, reliability, and developer experience in a remote-first biotech company.

Senior Software Reliability Engineer

Senior SRE position at Rho, focusing on system reliability, automation, and scalability for a fintech platform, offering hybrid work in Belgrade.