Microsoft's Azure Managed Inference team is seeking a Senior Software Engineer to join their cutting-edge AI platform initiative. This role focuses on building and maintaining a highly reliable platform supporting model inferencing at massive scale - handling billions of requests daily. You'll be working at the intersection of AI and Cloud, supporting both Azure customers and internal Microsoft products like Bing and Office.
The position offers an opportunity to work with Generative LLMs and solve complex problems in distributed computing and high-performance systems. You'll be responsible for designing scalable architectures, ensuring security and performance optimization, and leading technical initiatives that directly impact Microsoft's AI infrastructure.
The ideal candidate brings strong expertise in distributed systems, cloud platforms, and software engineering fundamentals. You'll need to demonstrate proficiency in languages like Python/Java/C#, have experience with cloud platforms, and understand large-scale data processing. The role combines hands-on technical work with leadership responsibilities, including code reviews, incident management, and cross-team collaboration.
This hybrid position offers comprehensive benefits, including industry-leading healthcare, educational resources, and parental leave. You'll be part of Microsoft's mission to democratize AI technology while working with cutting-edge technologies and world-class engineers.