Microsoft's M365 COSMIC team is seeking a Senior Site Reliability Engineer to join their innovative platform team. The role focuses on maintaining and improving a global-scale managed-runtime environment based on Azure Kubernetes Service for Microsoft Substrate service and developers. As an SRE, you'll be responsible for ensuring platform health, managing upgrades, and implementing automation for incident remediation. The position offers a unique opportunity to work with cutting-edge cloud technology while maintaining critical infrastructure components.
The ideal candidate will bring strong technical expertise in software engineering or systems administration, with particular emphasis on cloud services and Kubernetes. You'll be part of a team that designs, builds, and operates solutions enabling substrate service teams to focus on their core business requirements rather than infrastructure concerns.
Working in a hybrid environment with up to 50% work from home flexibility, you'll collaborate with cross-functional teams to improve platform stability and efficiency. Microsoft offers comprehensive benefits including industry-leading healthcare, educational resources, and parental leave, along with a strong culture of inclusion and innovation.
This role presents an excellent opportunity for experienced engineers who want to impact Microsoft's cloud infrastructure at a global scale while working with the latest technologies in cloud computing and container orchestration. The position combines technical challenges with the opportunity to contribute to Microsoft's mission of empowering every person and organization on the planet to achieve more.