AWS Neuron is seeking a Software Engineer to join their Machine Learning Applications team, focusing on distributed training solutions. This role is part of Annapurna Labs, acquired by AWS in 2015, which serves as the infrastructure provider for AWS. The position involves working with cutting-edge ML technologies, including large language models like Llamas, Deepseeks, and GPTs.
The role combines software development expertise with machine learning knowledge, requiring work with distributed training libraries like FSDP and Deepspeed. You'll collaborate with chip architects and compiler engineers to optimize performance on AWS Trainium and Inferentia platforms.
AWS offers a strong culture of inclusion with ten employee-led affinity groups across 190 global chapters. The team values work-life balance and provides flexibility in working hours. There's a strong emphasis on mentorship and knowledge sharing, with opportunities for career growth through challenging projects.
The position offers competitive compensation ranging from $129,300 to $223,600 based on location, plus equity and comprehensive benefits. You'll be part of a team delivering products that impact millions, including AWS Nitro, ENA, EFA, Graviton, and ML Accelerators.
This is an excellent opportunity for someone passionate about machine learning infrastructure who wants to work at the intersection of hardware and software, developing solutions that power the next generation of AI applications at scale.