Site Reliability Engineer II

Microsoft's mission is to empower every person and every organization on the planet to achieve more.
$98,300 - $208,800
Site Reliability
Senior Software Engineer
Hybrid
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Site Reliability Engineer II

The Azure Customer Experience Platform (CXP) team at Microsoft is seeking a Site Reliability Engineer II to join their mission of transforming Microsoft Cloud customers into fans. As part of the Azure engineering organization, the team focuses on improving Cloud quality, security, and reliability through deep engineering engagements with customers and teams across Microsoft. The role involves partnering with customers and first parties in migrating to Azure, designing highly reliable solutions, contributing to next-generation cloud infrastructure architecture, and engaging in production triage efforts. The ideal candidate will have 5+ years of experience in Site Reliability Engineering, Service Engineering, or Production Engineering within online services environments, supporting both Linux and Windows platforms. They should have demonstrated experience with large-scale distributed systems, cloud computing, and modern distributed design patterns. The role requires collaboration with various teams, participation in on-call rotations, and driving continuous improvement in customer self-supportability and incident management. Microsoft offers a comprehensive benefits package and is committed to fostering an inclusive work environment.

Last updated 2 months ago

Responsibilities For Site Reliability Engineer II

  • Partner with customers in migrating to Azure and designing highly reliable solutions
  • Contribute to next-generation architecture for Cloud infrastructure services
  • Engage in production triage efforts and identify product gaps
  • Collaborate with teams to ensure non-functional production support requirements are adopted early
  • Help customers achieve the right RPO/RTO and Composite SLA
  • Participate in on-call coverage rotation and provide leadership for major incidents
  • Drive implementation of customer-centric mitigation levers and playbooks
  • Provide excellent incident communication to stakeholders
  • Work within a 'Follow the Sun' global shift rotation

Requirements For Site Reliability Engineer II

Linux
  • 4+ years technical experience in software engineering, network engineering, or systems administration
  • Bachelor's Degree in Computer Science, Information Technology, or related field
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Pass Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • Experience with Azure, Azure Services, and dependencies
  • Knowledge of cloud computing concepts (compute, storage, networking, container orchestration)
  • Familiarity with modern distributed design patterns and cloud systems architecture

Benefits For Site Reliability Engineer II

Medical Insurance
Education Budget
Parental Leave
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Interested in this job?

Jobs Related To Microsoft Site Reliability Engineer II

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Microsoft Digital, focusing on building and maintaining scalable infrastructure and driving automation initiatives.

Senior Site Reliability Engineer

Senior SRE role at Microsoft working on Azure Cosmos DB, focusing on service reliability, automation, and maintaining high-availability systems at global scale.

Senior Site Reliability Engineer

Senior SRE position at Microsoft maintaining global-scale Kubernetes platform with focus on automation and system reliability.

Senior Site Reliability Engineer (SRE) - Teams

Senior Site Reliability Engineer position at Microsoft Teams, focusing on improving service reliability, performance, and security through software engineering solutions.

Senior Site Reliability Engineer - CTJ - POLY

Senior SRE role at Microsoft working on Azure SQL services for government clouds, requiring security clearance and distributed systems expertise.