Annapurna Labs, now fully integrated into AWS after its 2015 acquisition, is seeking a Senior Machine Learning Engineer for their Distribute Training team within AWS Neuron. This role focuses on developing and optimizing machine learning solutions for AWS's custom silicon accelerators, Trainium and Inferentia.
The position involves working with cutting-edge ML technologies, including Large Language Models (LLM) like GPT and Llama, as well as Stable Diffusion and Vision Transformers. You'll be at the intersection of hardware and software, collaborating with chip architects and compiler engineers to push the boundaries of distributed training solutions.
As part of AWS, you'll join a team that values knowledge-sharing, mentorship, and career growth. The role offers competitive compensation ($151,300 - $261,500 based on location) and comprehensive benefits. AWS's inclusive culture celebrates diversity through employee-led affinity groups and ongoing learning experiences.
The ideal candidate will bring strong software development skills, deep ML expertise, and experience with frameworks like PyTorch/JAX/TensorFlow. You'll be working on AWS Neuron, the complete software stack for AWS's cloud-scale ML accelerators, making direct impacts on how customers leverage AWS's infrastructure for their ML needs.
This is an opportunity to shape the future of machine learning infrastructure at AWS, working with a team that's dedicated to innovation and technical excellence. The role combines hands-on technical leadership with the chance to mentor others and contribute to AWS's mission of being Earth's Best Employer.