Senior Site Reliability Engineer

A values-driven, well-funded FinTech and HR company providing financial empowerment platform for small and midsize businesses.
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Finance · Enterprise SaaS

Description For Senior Site Reliability Engineer

ZayZoon is a rapidly growing FinTech and HR company recognized in the 2023 Deloitte Technology Fast 500 and Canadian Technology Fast 50 program. Their mission is to save ten-million hard-working employees ten-billion dollars through their financial empowerment platform. As a Senior Site Reliability Engineer, you'll be crucial in elevating ZayZoon's cloud infrastructure using AWS, infrastructure-as-code, and comprehensive observability solutions. You'll work within an embedded reliability team alongside app and data engineers, focusing on monitoring, benchmarking, and scaling ZayZoon's products.

The role demands expertise in AWS services, particularly with serverless resources like ECS, Fargate, and Lambda. You'll be responsible for maintaining CloudFormation templates, analyzing metrics using AWS tooling and third-party platforms, and managing deployment pipelines. Database management, cost optimization, and security compliance are key aspects of the position.

This is an excellent opportunity for an experienced SRE who values predictability, reliability, and scalability. You'll work with cutting-edge technologies while bridging bare metal infrastructure with Ruby on Rails applications. The position offers the flexibility of remote work within Canada, requiring candidates to have secure high-speed internet and workspace.

ZayZoon's platform is becoming an essential financial wellness super-app that both employees and employers value. As part of a values-driven, well-funded organization, you'll contribute to making a significant impact on employee financial wellness while working with first-class technologies and staff. The company's recent recognition for rapid growth demonstrates its momentum and potential for continued success.

Last updated 14 days ago

Responsibilities For Senior Site Reliability Engineer

  • Develop and maintain infrastructure-as-code CloudFormation templates
  • Perform instrumentation and daily metrics analysis of infrastructure and Ruby on Rails applications
  • Manage deployment pipelines including blue/green and intelligent auto-scaling
  • Maintain database resources and dependencies
  • Project costs and implement AWS cost savings programs
  • Ensure SOC-2 and cybersecurity compliance
  • Collaborate with app developers on metrics and database performance
  • Collaborate with data engineers on data warehouse development
  • Participate in agile development process

Requirements For Senior Site Reliability Engineer

Redis
Ruby
  • 5+ years infrastructure experience
  • 2+ years AWS experience including certification
  • Proficiency with IaC, specifically CloudFormation
  • Experience with containerization (Docker, ECS, ECR)
  • Experience with observability platforms (DataDog, NewRelic, OTel)
  • Strong SQL and data analysis skills
  • Ability to build quick for experiments and clean for core functionality

Interested in this job?

Jobs Related To ZayZoon Senior Site Reliability Engineer

Senior Site Reliability Engineer

Remote Senior Site Reliability Engineer position at ZayZoon, focusing on AWS infrastructure and cloud operations across Canadian locations.

Site Reliability Engineer - GovCloud - Rotating Shift

Site Reliability Engineer position at Salesforce focusing on GovCloud infrastructure maintenance, incident response, and system reliability for government customers.

Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Senior Site Reliability Engineer position at Netflix focusing on cloud platform reliability for live streaming events, offering competitive compensation and comprehensive benefits.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and infrastructure development.