Microsoft's Azure Data engineering team is seeking a Site Reliability Engineer II to join their databases team, focusing on operational Database systems. This role is part of the Azure Cosmos DB team, Microsoft's globally distributed, massively scalable, multi-model cloud database service. The position offers up to 100% remote work with 0-25% travel requirements.
The ideal candidate will focus on building and optimizing solutions for analyzing massive amounts of telemetry and service health indicators in real-time, performing automated root cause analysis, and implementing necessary mitigations to maintain Service Level Objectives (SLOs). The role requires expertise in large-scale cloud services, with emphasis on improving service reliability, availability, and performance.
The position offers competitive compensation ranging from $98,300 to $193,200 per year (higher in SF and NYC areas), along with comprehensive benefits including healthcare, educational resources, and parental leave. This is an excellent opportunity to work with cutting-edge technology in a team that operates like a startup while being part of a major tech company.
The role combines technical expertise with customer interaction, requiring both strong engineering skills and the ability to communicate effectively with enterprise customers. You'll be working on critical systems that serve various industries including Healthcare, Retail, Telecommunications, and IoT, where service availability and latency are paramount.
Microsoft values diversity and encourages applications from candidates with different experiences and perspectives. The company's mission is to empower every person and organization on the planet to achieve more, making this an excellent opportunity for those passionate about making a global impact through technology.