Microsoft's Azure Machine Learning (ML) team is seeking a Software Engineer II to join their Inference team, which is responsible for building the model-serving platform for large models, including OpenAI generative models. The role involves working on a platform that serves billions of requests per day for cutting-edge ML scenarios and models.
The position offers an exciting opportunity to work at the intersection of AI and Cloud, building systems that support model inferencing at scale. You'll be part of a team that hosts models at the scale of Bing and Office, solving complex technical challenges in machine learning infrastructure.
As a mid-level developer, you'll be responsible for the complete lifecycle of features, from conception to production deployment. The role requires strong software engineering skills, with a focus on building highly reliable and available systems. You'll work on optimizing performance for high-throughput and low-latency scenarios.
The ideal candidate should have at least 3 years of software development experience and a strong educational background in computer science or related fields. Experience with real-time services handling high throughput and low latency requirements is valuable.
Microsoft offers comprehensive benefits, including industry-leading healthcare, educational resources, parental leave, and opportunities for professional growth. The position is based in Hyderabad, India, with a hybrid work arrangement allowing up to 50% work from home. Join Microsoft's mission to democratize ML and make it accessible to every enterprise, developer, and data scientist.