Senior Site Reliability Engineer (SRE)

Sword Health is on a mission to free two billion people from pain as the world's first and only end-to-end platform to predict, prevent and treat pain.
Site Reliability
Senior Software Engineer
Remote
501 - 1,000 Employees
5+ years of experience
Healthcare · Enterprise SaaS

Description For Senior Site Reliability Engineer (SRE)

Sword Health is on a mission to free two billion people from pain as the world's first and only end-to-end platform to predict, prevent and treat pain. As a Senior Site Reliability Engineer (SRE) at SwordHealth, you will play a critical role in maintaining the health and uptime of our services. You will collaborate with development teams to build and operate scalable and resilient systems, troubleshoot issues across the stack, and implement automation to reduce manual work.

Your responsibilities will include:

  • Monitoring and Incident Management: Develop and maintain monitoring and alerting solutions. Respond to incidents, troubleshoot issues, and perform root cause analysis.
  • Automation and Tooling: Automate repetitive tasks and improve deployment processes. Develop and maintain tools to support infrastructure and applications.
  • Performance Optimization: Analyze system performance and implement optimizations to improve efficiency and reduce latency.
  • Security and Compliance: Ensure systems are secure and compliant with relevant standards and regulations.
  • Documentation and Knowledge Sharing: Maintain comprehensive documentation of systems and processes. Share knowledge and best practices with team members.
  • Database Management: Ensure the reliability, performance, and scalability of databases. Perform database optimization, maintenance, and troubleshooting.

Sword Health has experienced unprecedented growth since our market debut in 2020 and has created a remarkable mission and value-driven environment. With a recent valuation of $2 billion, we are in a phase of hyper growth and expansion. Joining Sword Health means committing to a set of core values, chief amongst them to "do it for the patients" every day, and to always "deliver more than expected" on behalf of our members and clients.

This is an opportunity for you to make a significant difference on a massive scale as you work alongside 800+ (and growing!) talented colleagues, spanning two continents. Your charge? To help us build a pain-free world, powered by technology, enhanced by people — accessible to all.

Last updated 2 months ago

Responsibilities For Senior Site Reliability Engineer (SRE)

  • Monitoring and Incident Management: Develop and maintain monitoring and alerting solutions. Respond to incidents, troubleshoot issues, and perform root cause analysis
  • Automation and Tooling: Automate repetitive tasks and improve deployment processes. Develop and maintain tools to support infrastructure and applications
  • Performance Optimization: Analyze system performance and implement optimizations to improve efficiency and reduce latency
  • Security and Compliance: Ensure systems are secure and compliant with relevant standards and regulations
  • Documentation and Knowledge Sharing: Maintain comprehensive documentation of systems and processes. Share knowledge and best practices with team members
  • Database Management: Ensure the reliability, performance, and scalability of databases. Perform database optimization, maintenance, and troubleshooting

Requirements For Senior Site Reliability Engineer (SRE)

Python
Go
JavaScript
Kubernetes
Linux
MySQL
PostgreSQL
Redis
  • Proficiency in programming languages such as Python, Go, Javascript
  • 5+ years of experience with cloud platforms such as AWS, Google Cloud, or Azure
  • Strong understanding of Linux/Unix systems and networking
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes)
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
  • Knowledge of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI)
  • Proficiency with relational and NoSQL databases (e.g., MySQL, PostgreSQL, Redis, Elasticsearch)
  • Team Player: Willingness to collaborate and share knowledge with colleagues to drive collective success
  • Ownership: Taking responsibility for your work and demonstrating accountability for outcomes

Benefits For Senior Site Reliability Engineer (SRE)

Medical Insurance
Dental Insurance
Vision Insurance
Equity
Parental Leave
401k
  • Comprehensive health, dental and vision insurance
  • Equity shares
  • Discretionary PTO plan
  • Parental leave
  • 401(k)
  • Flexible working hours
  • Remote-first company
  • Paid company holidays
  • Free digital therapist for you and your family

Interested in this job?

Jobs Related To Sword Health Senior Site Reliability Engineer (SRE)

Site Reliability Engineer

Senior Site Reliability Engineer position at OneDegree, focusing on cloud infrastructure, monitoring, and automation for insurance and cybersecurity platforms in APAC.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Prove, focusing on building and maintaining scalable, reliable systems for digital identity solutions.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Prove, focusing on building and maintaining scalable, reliable systems for digital identity solutions.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.