Microsoft's Azure Data engineering team is seeking a Senior Site Reliability Engineer to join their databases team, specifically working on Azure Cosmos DB. This role is crucial in maintaining Microsoft's operational Database systems, focusing on developer-friendly, mission-critical, AI-enabled operational Databases. The position involves working with a globally distributed, massively scalable, multi-model cloud database service designed for planet-scale applications.
The ideal candidate will be responsible for building and optimizing solutions that analyze massive amounts of telemetry and service health indicators in near real-time, performing automated root cause analysis, and implementing necessary mitigations to maintain strict Service Level Objectives (SLOs). The role requires collaboration with engineering teams, customer interaction, and a data-driven approach to problem-solving.
Working in Vancouver with a hybrid work arrangement (up to 50% work from home), you'll be part of a team that operates like a startup while having the resources and impact of Microsoft. The position offers competitive compensation (CAD $108,100 - $199,700) and comprehensive benefits, including healthcare, educational resources, and parental leave.
This is an excellent opportunity for experienced engineers who are passionate about service reliability, automation, and working with large-scale distributed systems. The role combines technical expertise with customer interaction, making it ideal for those who enjoy both deep technical work and collaborative problem-solving.