Site Reliability Engineer

Tecsys is a fast-growing innovator offering supply chain solutions to industry leading healthcare systems, hospitals, and pharmacy businesses to distributors, retailers, and 3PLs.
Toronto, ON, CanadaMontreal, QC, CanadaOttawa, ON, Canada
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Healthcare · Logistics

Description For Site Reliability Engineer

Tecsys, a fast-growing innovator in supply chain solutions, is seeking a Site Reliability Engineer to join their "Network and Security Operations Center" department. This role is part of a digital-first company that values employee wellbeing and productivity. The ideal candidate will have at least 5 years of systems engineering experience and a strong background in designing and deploying large-scale systems.

As a Site Reliability Engineer, you'll be responsible for improving the reliability and uptime of Tecsys' platform and applications. Your duties will include collaborating with engineering teams, maintaining services, developing automation tools, implementing monitoring and alerting systems, and managing high-severity incidents. You'll work with cutting-edge technologies like Azure, AWS, and various monitoring tools to ensure the smooth operation of Tecsys' systems.

The role requires a bachelor's degree in computer science or a related field, along with expertise in system design, full-stack automation, and cloud platforms (AWS or Azure). Knowledge of Java or .NET development, as well as experience with tools like Datadog, Rapid7 Insight, and GitLab, will be beneficial. The ideal candidate will be a self-starter who can work independently and collaborate effectively across teams and time zones.

Tecsys offers a flexible, remote-first work environment with occasional travel for team meetings and conferences. They value diversity and inclusion, welcoming applicants from all backgrounds. This position provides an excellent opportunity for those interested in working with industry-leading healthcare systems, distributors, and retailers while tackling interesting challenges in a continuously learning environment.

Join Tecsys to transform supply chains through technology and be part of a team that's making a significant impact in the industry. Apply now to contribute to the reliability and scalability of Tecsys' innovative solutions!

Last updated 14 days ago

Responsibilities For Site Reliability Engineer

  • Collaborate with other Engineering teams to support services
  • Maintain services by measuring and monitoring availability, latency and overall system health
  • Develop tools & automation on top of Azure & AWS
  • Scale systems sustainably through automation
  • Be on-call
  • Practice sustainable incident response and blameless postmortems
  • Implement automated solutions for continuous integration and delivery (CI / CD)
  • Implement monitoring, Logging, alerting, and SLA Reporting
  • Implement service monitoring dashboards displaying key metrics
  • Create and maintain technical documentation
  • Apply SRE best practices
  • Take command of high-severity incidents and facilitate their resolution
  • Provide support for planning and deployment teams
  • Collaborate with Platform Engineering team
  • Work cross-functionally with internal teams and vendors

Requirements For Site Reliability Engineer

Java
Linux
Kubernetes
  • Bachelor's degree in computer science or related technical discipline
  • At least 5 years' experience in systems engineering
  • Experience designing and deploying large scale systems
  • Strong knowledge of system design
  • High level of understanding and examples of executing projects with full stack automation
  • Self-organize, collaborate, and manage efforts across teams
  • Be a self-starter, curious, and not afraid to ask questions
  • Knowledge of Datadog preferred
  • Knowledge of Rapid7 Insight preferred
  • Knowledge and experience of AWS or Azure required
  • Basic knowledge of Java- or .Net-based development required
  • Knowledge of GitLab preferred
  • Experience with SaaS company is a strong asset
  • Experience with Fedramp compliance is a strong asset
  • Strong English communication skills
  • Canadian Citizen, Permanent Resident, or valid Canadian work permit

Benefits For Site Reliability Engineer

  • Remote work options
  • Collaborative workspaces
  • Freedom and flexibility to work productively

Interested in this job?

Jobs Related To Tecsys Inc. Site Reliability Engineer

Site Reliability Engineer

Join Tecsys as a Site Reliability Engineer to improve platform reliability and uptime through innovative solutions and best practices.

Platform Engineer (Service Reliability Engineer)

Senior Platform Engineer role focusing on service reliability, cloud infrastructure, and DevOps practices in a financial services environment.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at NordVPN, focusing on infrastructure automation and reliability for a leading VPN service provider.

Site Reliability Engineer- SRE

Senior Site Reliability Engineer position at Apple, focusing on platform engineering and cloud infrastructure for hardware engineering tools and data analytics.

Senior Site Reliability Engineer - Observability and Telemetry Platform

Senior SRE position at NVIDIA focusing on observability and telemetry platforms, offering competitive salary and opportunity to work with cutting-edge cloud technologies.