Senior Site Reliability Engineer

World's leading Open Payments Platform processing over $50b of GMV annually, providing payments orchestration and advanced vault solutions.
Durham, NC, USAQuébec City, QC, Canada
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Finance · Enterprise SaaS

Description For Senior Site Reliability Engineer

Spreedly, the world's leading Open Payments Platform, is seeking a Senior Site Reliability Engineer to join their team. This role is crucial in maintaining and optimizing their globally distributed payments platform that processes over $50B in annual transactions. The position offers a unique opportunity to work with modern technologies including Ruby on Rails, Elixir, and various AWS services while ensuring the reliability and performance of critical payment systems.

The ideal candidate will be responsible for implementing robust observability solutions, leading incident response efforts, and developing automation tools to enhance system reliability. They will work closely with development teams to improve application performance and maintain 24/7 system reliability through shared on-call rotations. The role requires expertise in database optimization, cloud infrastructure, and modern SRE practices.

Spreedly offers an excellent compensation package including competitive salary, equity, comprehensive health benefits, and flexible work arrangements. The company maintains a strong focus on work-life balance with an open PTO policy and parental leave. They foster a culture of diversity, equity, and inclusion, making it an ideal workplace for those seeking both technical challenges and professional growth.

The position offers the opportunity to work remotely while being part of a team that's transforming the payment processing industry. You'll be instrumental in ensuring the reliability and scalability of a platform that processes billions in transactions monthly, making a significant impact on global commerce.

Last updated 4 days ago

Responsibilities For Senior Site Reliability Engineer

  • Ensure reliability, availability, and performance of globally distributed payments platform processing $4B monthly
  • Collaborate with development teams to improve Ruby on Rails and Elixir applications
  • Implement and maintain observability solutions using Datadog and OpenTelemetry
  • Lead incident response efforts and participate in on-call rotation
  • Develop and maintain automation tools
  • Monitor and optimize database performance
  • Provide technical leadership and mentorship
  • Foster a culture of reliability within the engineering organization

Requirements For Senior Site Reliability Engineer

Ruby
PostgreSQL
Kafka
Linux
  • Experience with Datadog, OpenTelemetry, Sentry, and Sumo Logic
  • Proficiency in modern programming languages (Ruby, Rails, Elixir preferred)
  • Experience with AWS services (EC2, S3, RDS)
  • In-depth knowledge of relational databases (CockroachDB, PostgreSQL, Riak)
  • Experience applying design patterns for reliability and scalability
  • Excellent problem-solving skills in production environments
  • Strong cross-functional collaboration abilities
  • Strong written and verbal communication skills

Benefits For Senior Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
  • Competitive salary + Equity
  • 100% employer-paid Medical and Dental benefits
  • Company-paid Life and Disability insurance
  • Optional vision and supplemental insurance
  • Open Paid Time Off policy
  • 12 weeks paid parental leave
  • 401(k) matching (5% up to $5,000 yearly)
  • $1,000 annual professional development stipend
  • Monthly home working/digital lifestyle stipend
  • New MacBook and accessory reimbursement
  • LinkedIn Learning subscription
  • Professional coaching service
  • Visits to HQ in Durham, NC for remote employees

Interested in this job?

Jobs Related To Spreedly Senior Site Reliability Engineer

Sr. Site Reliability Engineer

Senior Site Reliability Engineer position at Broadcom focusing on cloud infrastructure and SaaS platform operations.

Sr. Site Reliability Engineer - Top Secret Clearance

Senior Site Reliability Engineer position at SpaceX, requiring Top Secret clearance, focusing on infrastructure automation and DevOps practices for space flight systems.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5+ years of software development experience and strong system design skills.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Kontakt.io, focusing on maintaining 99.99% uptime for healthcare operations platform using AWS, Kubernetes, and advanced monitoring tools.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5+ years of software development experience.