Microsoft's Azure Machine Learning team is seeking a Principal Software Engineer to join their Inference team, focusing on building a model-serving platform for large models including OpenAI generative models. This role is part of Microsoft's vision to democratize ML and make it accessible to every enterprise, developer, and data scientist. The platform currently serves billions of requests daily for cutting-edge scenarios and models.
As a Principal Engineer, you'll be working on high-impact projects that support model inferencing at scale, particularly focusing on hosting models at the scale of Bing and Office. The role requires expertise in designing and implementing highly reliable, available platforms with emphasis on high throughput and low latency performance optimization.
The position offers a hybrid work environment with up to 50% work from home flexibility and requires 0-25% travel. You'll be working as an individual contributor in the Software Engineering discipline, bringing your 18+ years of experience to solve complex problems at the intersection of AI and Cloud computing.
This is an excellent opportunity for experienced engineers passionate about AI infrastructure who want to make a significant impact on Microsoft's AI platform capabilities. The role offers comprehensive benefits including industry-leading healthcare, educational resources, and various other perks that make Microsoft a great place to work.