Senior Site Reliability Engineer

Sword Health is on a mission to free two billion people from pain as the world's first and only end-to-end platform to predict, prevent and treat pain.
Canada
Site Reliability
Senior Software Engineer
Hybrid
501 - 1,000 Employees
5+ years of experience

Description For Senior Site Reliability Engineer

Sword Health is on a mission to free two billion people from pain as the world's first and only end-to-end platform to predict, prevent and treat pain. Delivering a 62% reduction in pain and a 60% reduction in surgery intent, Sword is using technology to save millions for their 2,500+ enterprise clients across three continents. They hold the majority of industry patents, win 70% of competitive evaluations, and have raised more than $300 million from top venture firms.

As a Senior Site Reliability Engineer (SRE) at Sword Health, you will play a critical role in maintaining the health and uptime of their services. You will collaborate with development teams to build and operate scalable and resilient systems, troubleshoot issues across the stack, and implement automation to reduce manual work.

Key responsibilities include:

  • Monitoring and incident management
  • Automation and tooling development
  • Performance optimization
  • Security and compliance
  • Documentation and knowledge sharing
  • Database management

Requirements:

  • Proficiency in programming languages such as Python, Go, Javascript
  • 5+ years of experience with cloud platforms (AWS, Google Cloud, or Azure)
  • Strong understanding of Linux/Unix systems and networking
  • Familiarity with containerization and orchestration tools
  • Experience with monitoring and logging tools
  • Knowledge of CI/CD pipelines and tools
  • Database experience with relational and NoSQL databases

Sword Health offers a stimulating, fast-paced environment with room for creativity, career development, and a competitive salary. They provide a flexible work environment with unlimited vacation and access to a health and well-being program. Join a talented team of 800+ colleagues spanning two continents and make a significant difference on a massive scale in building a pain-free world.

Last updated 5 months ago

Responsibilities For Senior Site Reliability Engineer

  • Monitoring and Incident Management: Develop and maintain monitoring and alerting solutions. Respond to incidents, troubleshoot issues, and perform root cause analysis
  • Automation and Tooling: Automate repetitive tasks and improve deployment processes. Develop and maintain tools to support infrastructure and applications
  • Performance Optimization: Analyze system performance and implement optimizations to improve efficiency and reduce latency
  • Security and Compliance: Ensure systems are secure and compliant with relevant standards and regulations
  • Documentation and Knowledge Sharing: Maintain comprehensive documentation of systems and processes. Share knowledge and best practices with team members
  • Database Management: Ensure the reliability, performance, and scalability of databases. Perform database optimization, maintenance, and troubleshooting

Requirements For Senior Site Reliability Engineer

Python
Go
JavaScript
Linux
Kubernetes
Redis
  • Proficiency in programming languages such as Python, Go, Javascript
  • 5+ years of experience with cloud platforms such as AWS, Google Cloud, or Azure
  • Strong understanding of Linux/Unix systems and networking
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes)
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
  • Knowledge of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI)
  • Proficiency with relational and NoSQL databases (e.g., MySQL, PostgreSQL, Redis, Elasticsearch)
  • Team Player: Willingness to collaborate and share knowledge with colleagues
  • Ownership: Taking responsibility for your work and demonstrating accountability for outcomes

Benefits For Senior Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Mental Health Assistance
  • Comprehensive health, dental and vision insurance
  • Equity shares
  • Discretionary PTO plan
  • Parental leave
  • 401(k)
  • Flexible working hours
  • Remote-first company
  • Paid company holidays
  • Free digital therapist for you and your family
  • Meal allowance
  • Remote work allowance
  • Unlimited vacation
  • Snacks and beverages
  • English class
  • Unlimited access to Coursera Learning Platform

Interested in this job?

Jobs Related To Sword Health Senior Site Reliability Engineer

Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer at Sword Health, maintaining service health and uptime for pain prevention platform.

Site Reliability Engineer

Senior Site Reliability Engineer position at OneDegree, focusing on cloud infrastructure, monitoring, and automation for insurance and cybersecurity platforms in APAC.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Prove, focusing on building and maintaining scalable, reliable systems for digital identity solutions.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Prove, focusing on building and maintaining scalable, reliable systems for digital identity solutions.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.