Senior Site Reliability Engineer

Microsoft empowers every person and organization on the planet to achieve more through technology and cloud solutions.
Site Reliability
Senior Software Engineer
Hybrid
5,000+ Employees
6+ years of experience
Enterprise SaaS · Cloud

Description For Senior Site Reliability Engineer

Microsoft's M365 COSMIC team is seeking a Senior Site Reliability Engineer to join their innovative platform team. The role focuses on maintaining and improving a global-scale managed-runtime environment based on Azure Kubernetes Service for Microsoft Substrate service and developers. As an SRE, you'll be responsible for ensuring platform health, managing upgrades, and implementing automation for incident response and remediation. The position offers a unique opportunity to work with cutting-edge cloud technology while maintaining critical infrastructure components. The team operates like a 'Kubernetes PaaS', enabling substrate service teams to focus on their business requirements rather than infrastructure concerns. You'll be part of Microsoft's mission to empower global achievement, working in a culture that values growth mindset, innovation, and collaboration. The role combines technical expertise with strategic thinking, requiring both hands-on engineering skills and system design capabilities. Benefits include comprehensive healthcare, educational resources, and work-life balance support. The hybrid work environment offers flexibility with up to 50% work from home opportunity.

Last updated 6 hours ago

Responsibilities For Senior Site Reliability Engineer

  • Keep the platform components updated incorporating the dependencies from other applications/tech stacks and debug any issues
  • Improve platform by identifying patterns in service alerts/incidents and building auto-remediation solutions
  • Build dashboard/alerts for faster identification of issues and maintaining system health
  • Collaborate with cross-functional teams to define, design, and ship new features

Requirements For Senior Site Reliability Engineer

Kubernetes
Linux
  • 6+ years technical experience in software engineering, network engineering, or systems administration
  • Bachelor's/Master's Degree in Computer Science, Information Technology, or related field
  • Experience with Agile and iterative development processes
  • Must pass Microsoft Cloud Background Check
  • Cloud and services experience, preferably with Azure
  • Working knowledge of Kubernetes

Benefits For Senior Site Reliability Engineer

Medical Insurance
Education Budget
Parental Leave
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Interested in this job?

Jobs Related To Microsoft Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Microsoft working on Windows 365 Cloud PC and Azure Virtual Desktop services, focusing on automation, reliability, and system optimization.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Microsoft, focusing on Windows services reliability and customer support, offering remote work and competitive compensation.

Senior Site Reliability Engineer

Senior SRE position at Microsoft Security's Red Team, focusing on building and managing secure infrastructure for offensive security operations.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Microsoft focusing on cloud infrastructure health and datacenter operations.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Microsoft Security, focusing on Identity and Access Management systems, offering competitive pay and remote work options.