Principal Software Engineer - GPU Performance

Microsoft builds cloud-enabled solutions and innovative technology products, including Azure AI Platform and enterprise software solutions.
$161,600 - $286,200
Machine Learning
Principal Software Engineer
Remote
5,000+ Employees
8+ years of experience
AI · Enterprise SaaS

Description For Principal Software Engineer - GPU Performance

Microsoft's AI Platform organization is seeking a Principal Software Engineer to join their team working on GPU performance analysis and optimization. This role is part of the team building Azure OpenAI, Azure Machine Learning, Cognitive Services, and global Azure AI infrastructure. The position involves working with cutting-edge AI hardware and software, collaborating with OpenAI, and optimizing large-scale AI model training and inference.

The role requires deep expertise in GPU optimization, C/C++ programming, and AI/ML systems. You'll be working on Microsoft's next-generation supercomputers and the new Maia-100 AI accelerator, directly contributing to the development of fundamental abstractions, programming models, and APIs for AI hardware.

As a Principal Engineer, you'll be at the forefront of AI infrastructure development, working with world-class teams to solve complex technical challenges in AI computation. The position offers competitive compensation, comprehensive benefits, and the opportunity to work on projects that impact millions of users worldwide.

Microsoft provides an inclusive work environment and values diversity of thought and perspective. The role offers flexibility with up to 100% work from home options, though some travel (0-25%) may be required. This is an exceptional opportunity for someone passionate about high-performance computing, AI systems, and building scalable solutions for the future of artificial intelligence.

Last updated 4 days ago

Responsibilities For Principal Software Engineer - GPU Performance

  • Collaborate across multiple disciplines from hardware designers to ML developers
  • Engage with partners to implement performance analysis and optimization for LLMs
  • Perform software development in C/C++, Python, and GPU development (CUDA, ROCm, Triton)
  • Identify requirements, scope solutions, estimate work, schedule deliverables

Requirements For Principal Software Engineer - GPU Performance

Python
  • Bachelor's Degree in Computer Science or related technical discipline
  • 8+ years technical engineering experience with coding
  • 4+ years experience with C/C++
  • 4+ years practical experience working with GPUs and optimizing GPU kernels
  • Experience in low-level program behavior and proficiency using profiling tools
  • Technical background in software engineering principles and architecture design
  • Must pass Microsoft Cloud Background Check

Benefits For Principal Software Engineer - GPU Performance

Medical Insurance
Education Budget
Parental Leave
Mental Health Assistance
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Interested in this job?

Jobs Related To Microsoft Principal Software Engineer - GPU Performance

Principal Research Software Engineer

Principal Research Software Engineer position at Microsoft Research AI Frontiers, focusing on advanced AI systems development with competitive compensation and benefits.

Principal Applied Machine Learning Engineer

Principal Applied Machine Learning Engineer position at Microsoft focusing on cybersecurity AI research and development.

Principal Software Engineer - AI Platform

Principal Software Engineer role at Microsoft working on AI platforms and infrastructure

Principal Machine Learning Research Engineer

Principal Machine Learning Research Engineer position at Microsoft, focusing on ML/hardware co-design and optimization for cloud infrastructure.

Principal Software Engineer

Principal Software Engineer role at Microsoft working on Azure AI and machine learning infrastructure, requiring 8+ years of experience.