Site Reliability Engineer

Tecsys is a fast-growing innovator offering supply chain solutions to industry leading healthcare systems, hospitals, and pharmacy businesses to distributors, retailers, and 3PLs.
Toronto, ON, CanadaMontreal, QC, CanadaOttawa, ON, Canada
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Healthcare · Logistics

Description For Site Reliability Engineer

Tecsys, a fast-growing innovator in supply chain solutions, is seeking a Site Reliability Engineer to join their "Network and Security Operations Center" department. This role is part of a digital-first company that values employee wellbeing and productivity. The ideal candidate will have at least 5 years of systems engineering experience and a strong background in designing and deploying large-scale systems.

As a Site Reliability Engineer, you'll be responsible for improving the reliability and uptime of Tecsys' platform and applications. Your duties will include collaborating with engineering teams, maintaining services, developing automation tools, implementing monitoring and alerting systems, and managing high-severity incidents. You'll work with cutting-edge technologies like Azure, AWS, and various monitoring tools to ensure the smooth operation of Tecsys' systems.

The role requires a bachelor's degree in computer science or a related field, along with expertise in system design, full-stack automation, and cloud platforms (AWS or Azure). Knowledge of Java or .NET development, as well as experience with tools like Datadog, Rapid7 Insight, and GitLab, will be beneficial. The ideal candidate will be a self-starter who can work independently and collaborate effectively across teams and time zones.

Tecsys offers a flexible, remote-first work environment with occasional travel for team meetings and conferences. They value diversity and inclusion, welcoming applicants from all backgrounds. This position provides an excellent opportunity for those interested in working with industry-leading healthcare systems, distributors, and retailers while tackling interesting challenges in a continuously learning environment.

Join Tecsys to transform supply chains through technology and be part of a team that's making a significant impact in the industry. Apply now to contribute to the reliability and scalability of Tecsys' innovative solutions!

Last updated 5 days ago

Responsibilities For Site Reliability Engineer

  • Collaborate with other Engineering teams to support services
  • Maintain services by measuring and monitoring availability, latency and overall system health
  • Develop tools & automation on top of Azure & AWS
  • Scale systems sustainably through automation
  • Be on-call
  • Practice sustainable incident response and blameless postmortems
  • Implement automated solutions for continuous integration and delivery (CI / CD)
  • Implement monitoring, Logging, alerting, and SLA Reporting
  • Implement service monitoring dashboards displaying key metrics
  • Create and maintain technical documentation
  • Apply SRE best practices
  • Take command of high-severity incidents and facilitate their resolution
  • Provide support for planning and deployment teams
  • Collaborate with Platform Engineering team
  • Work cross-functionally with internal teams and vendors

Requirements For Site Reliability Engineer

Java
Linux
Kubernetes
  • Bachelor's degree in computer science or related technical discipline
  • At least 5 years' experience in systems engineering
  • Experience designing and deploying large scale systems
  • Strong knowledge of system design
  • High level of understanding and examples of executing projects with full stack automation
  • Self-organize, collaborate, and manage efforts across teams
  • Be a self-starter, curious, and not afraid to ask questions
  • Knowledge of Datadog preferred
  • Knowledge of Rapid7 Insight preferred
  • Knowledge and experience of AWS or Azure required
  • Basic knowledge of Java- or .Net-based development required
  • Knowledge of GitLab preferred
  • Experience with SaaS company is a strong asset
  • Experience with Fedramp compliance is a strong asset
  • Strong English communication skills
  • Canadian Citizen, Permanent Resident, or valid Canadian work permit

Benefits For Site Reliability Engineer

  • Remote work options
  • Collaborative workspaces
  • Freedom and flexibility to work productively

Interested in this job?

Jobs Related To Tecsys Inc. Site Reliability Engineer

Senior Software Engineer, ATS Matrix Site Reliability Engineer

Senior Software Engineer role in Site Reliability Engineering at Google, building and maintaining large-scale distributed systems.

Senior Software Developer, Site Reliability Development, Protected Data

Senior Software Developer role at Google focusing on Site Reliability Development for Protected Data systems.

Senior Software Developer, Site Reliability Engineering

Senior Software Developer role in Site Reliability Engineering for Google Cloud, focusing on building and maintaining large-scale distributed systems.

Senior Software Developer, Site Reliability Engineering

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on large-scale distributed systems.

Senior Site Reliability Engineer, Cloud Spanner

Senior Site Reliability Engineer role at Google, focusing on Cloud Spanner and large-scale distributed systems.