Senior Site Reliability Engineer

Microsoft Digital (MSD) is the team that innovates and delivers Microsoft's employee experience, running internal infrastructure and building hybrid solutions.
Site Reliability
Senior Software Engineer
In-Person
8+ years of experience
Enterprise SaaS

Description For Senior Site Reliability Engineer

Microsoft Digital (MSD) is seeking a Senior Site Reliability Engineer to join their Data Platform & Growth organization. This role is crucial in powering, protecting, and transforming the employee experience at Microsoft globally. As an SRE, you'll work with cutting-edge technologies including Gen AI, ML, and modern infrastructure tools to build and maintain highly scalable distributed systems.

The position offers an opportunity to work with Microsoft's internal network and infrastructure, focusing on campus modernization and hybrid solutions. You'll be responsible for ensuring high stability and performance of services, working closely with cross-functional teams to drive system improvements and automation initiatives.

The ideal candidate should be passionate about distributed systems, enjoy technological challenges, and thrive in a fast-paced environment. You'll have the chance to make a significant impact on millions of end users while working with an amazing team in a collaborative culture that embraces growth mindset.

Key aspects of the role include providing technical leadership, architecting large-scale integrated systems, implementing AIOps, and driving a culture of automation and resilience. You'll be instrumental in managing network monitoring tools and ensuring service reliability at scale.

Microsoft offers comprehensive benefits including industry-leading healthcare, educational resources, parental leave, and investment opportunities. The position is based in Hyderabad, India, with a focus on in-person work and potential travel (0-25%). Join Microsoft's mission to empower every person and organization on the planet to achieve more.

Last updated 9 days ago

Responsibilities For Senior Site Reliability Engineer

  • Uphold high organizational standard of great employee and team satisfaction
  • Provide technical leadership to a team of highly passionate and skilled engineers
  • Build, run and improve critical public-sector service environments
  • Own deployment, availability, reliability, performance and customer escalation targets
  • Architect and review designs for large scale integrated systems
  • Drive culture of creating resilient architectures
  • Identify efficient operations practice and drive culture of automating repetitive tasks
  • Managing Network monitoring tools including architecture, deployment and engineering aspects

Requirements For Senior Site Reliability Engineer

Python
Java
  • 8+ years technical experience in software engineering, network monitoring tools, or systems administration
  • Bachelor's/Master's Degree in Computer Science, Information Technology, or related field
  • Current software development expertise in programming languages (C#, C++, Ansible, Shell Scripting, Python, Java)
  • Proven experience with effectively driving improvement and delivering solutions
  • Experience with AIOPs and automations at scale
  • Experience designing, building, servicing, and driving ongoing improvement of service infrastructure & systems
  • Technical understanding of Network as code and automation as well as AIOps in network space

Benefits For Senior Site Reliability Engineer

Medical Insurance
Education Budget
Parental Leave
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Interested in this job?

Jobs Related To Microsoft Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior SRE role at Microsoft working on Azure Cosmos DB, focusing on service reliability, automation, and maintaining high-availability systems at global scale.

Senior Site Reliability Engineer

Senior SRE position at Microsoft maintaining global-scale Kubernetes platform with focus on automation and system reliability.

Senior Site Reliability Engineer (SRE) - Teams

Senior Site Reliability Engineer position at Microsoft Teams, focusing on improving service reliability, performance, and security through software engineering solutions.

Senior Site Reliability Engineer - CTJ - POLY

Senior SRE role at Microsoft working on Azure SQL services for government clouds, requiring security clearance and distributed systems expertise.

Site Reliability Engineer

Senior Site Reliability Engineer role at Microsoft Azure focusing on platform reliability, customer experience, and cloud infrastructure in Sydney.