Microsoft's Observability Platform and Experience engineering team is seeking a Principal Software Engineer to join their rapidly growing multi-billion dollar business. The role focuses on building and designing services that enable customers to monitor, detect, troubleshoot, and mitigate issues with their services through Azure Monitor products including Log Analytics, Application Insights, and Container Insights.
The position offers an exciting opportunity to work on one of the world's highest-scale observability services, processing over 1 Exabyte of logs daily and tracking over 100 billion active metrics. The team serves both Microsoft's internal needs and external customers, making it a crucial component of Microsoft's cloud infrastructure.
As a Principal Engineer, you'll be at the forefront of tackling complex technical challenges, particularly in addressing the observability needs of emerging AI workloads and AIOps. The role involves designing scalable, high-performance services while mentoring others and driving technical excellence across the team.
The position requires deep expertise in distributed systems, with experience in modern cloud technologies like Kubernetes, microservices architecture, and observability tools. You'll be working with various programming languages including Go, Java, and Rust, and will need to bring strong system design and problem-solving skills to the table.
Microsoft offers an excellent compensation package, with a base salary range of $161,600 - $286,200 (higher in SF and NYC areas), along with comprehensive benefits including healthcare, educational resources, and parental leave. The role offers up to 100% work from home flexibility with some travel (0-25%).
This is an exceptional opportunity for a senior technologist who wants to make a significant impact on the future of cloud observability while working with cutting-edge technologies at massive scale. You'll be joining a mission-driven team focused on building the world's greatest observability services, directly influencing how companies worldwide monitor and maintain their critical systems.