Microsoft's AI Frameworks team is seeking a Senior Software Engineer specializing in GPU optimization to join their cutting-edge AI development efforts. This role sits at the intersection of AI innovation and hardware optimization, working directly with OpenAI and other state-of-the-art Large Language Models.
The position involves developing AI software that enables running AI models across various platforms, from supercomputers to mobile devices. You'll be responsible for optimizing inference performance of OpenAI models and working on multiple levels of the AI software stack, including fundamental abstractions, programming models, compilers, and runtimes.
The ideal candidate will have strong experience in high-performance computing, GPU optimization, and deep learning frameworks. You'll work with technologies like PyTorch, TensorFlow, and CUDA, while collaborating with researchers and developers to optimize and scale model training and inference.
This role offers the opportunity to impact major Microsoft products including Office, Windows, Bing, and SQL Server, serving trillions of inferences per day. You'll be part of a team that's pushing the boundaries of AI acceleration and optimization, working with both software and hardware teams to build the future of AI infrastructure.
The position comes with competitive compensation, comprehensive benefits, and the chance to work on some of the most challenging problems in AI computing. Microsoft offers a collaborative, inclusive work environment with a growth mindset culture, making it an ideal place for engineers passionate about AI and high-performance computing.
Working at Microsoft means joining a company committed to empowering others and achieving ambitious goals. You'll have access to world-class resources, leading-edge technology, and the opportunity to shape the future of AI computing while working with some of the best minds in the industry.