Annapurna Labs, now fully integrated with AWS after its 2015 acquisition, is seeking a Senior Machine Learning Engineer for their Distributed Training team. This role focuses on AWS Neuron, the complete software stack for AWS Trainium and Inferentia cloud-scale ML accelerators. The position involves working with cutting-edge ML technologies, including Large Language Models like GPT and Llama, as well as Stable Diffusion and Vision Transformers. The team operates at the intersection of hardware and software, developing solutions that push the boundaries of what's possible in cloud computing.
The role demands expertise in distributed training libraries such as FSDP, Deepspeed, and Nemo, with a focus on extending these capabilities for Neuron-based systems. You'll collaborate with cross-functional teams, including chip architects and compiler engineers, to optimize performance on AWS custom silicon. The position offers significant growth opportunities within AWS's innovative culture, which values diversity, continuous learning, and work-life harmony.
AWS, as the world's leading cloud platform, provides an environment where you'll work on challenging problems that impact global businesses. The company offers competitive compensation, including base pay ranging from $129,300 to $223,600 depending on location, plus equity and comprehensive benefits. This is an opportunity to join a team that celebrates knowledge-sharing, mentorship, and inclusive culture while working on technology that shapes the future of cloud computing.