The Artificial Intelligence (AI) Frameworks team at Microsoft develops AI software that enables running AI models everywhere, from world's fastest AI supercomputers to mobile devices. This Principal Software Engineer role is within the team that builds software stacks for AI across Microsoft, working directly with OpenAI and shipping ONNX Runtime in major Microsoft products. The position involves working on multiple levels of the AI software stack, including fundamental abstractions, programming models, compilers, runtimes, and APIs to enable large-scale training and inferencing of models.
The role requires a senior technical leader who can set technical direction and demonstrate hands-on software design and development skills. The ideal candidate should have a proven track record of solving complex technical problems in building end-to-end AI stacks. The position involves optimizing performance of state-of-the-art LLMs, working with various hardware platforms including NVIDIA/AMD GPUs, and collaborating with internal and external partners.
Key responsibilities include driving improvements in LLM inference performance, building software tooling for performance insights, and enabling fast deployment of models at scale. The role requires expertise in high-performance computing, deep learning frameworks, and GPU architecture. The team operates in a collaborative, inclusive culture and contributes to the open-source community.
This is an exceptional opportunity for someone passionate about AI infrastructure who wants to impact Microsoft's AI capabilities across its product portfolio. The position offers comprehensive benefits, including industry-leading healthcare, educational resources, and various other perks. The work environment is hybrid, allowing up to 50% work from home, with 0-25% travel requirements.