AWS Neuron is seeking a Software Engineer to join their Machine Learning Applications team, focusing on distributed training solutions. This role is part of Annapurna Labs, acquired by AWS in 2015, which serves as the infrastructure provider for AWS. The position involves working with cutting-edge ML technologies, including large language models, stable diffusion, and Vision Transformers.
The role combines software development expertise with machine learning knowledge, requiring collaboration with chip architects and compiler engineers. You'll be responsible for developing and optimizing distributed training support across multiple frameworks like PyTorch, TensorFlow, and JAX, while working with AWS's custom silicon solutions (Trainium and Inferentia).
AWS offers a highly inclusive culture with ten employee-led affinity groups and innovative benefits. The team values work-life balance and provides flexibility in working hours. There's a strong emphasis on mentorship and knowledge sharing, with opportunities for career growth through challenging projects.
The compensation is competitive, ranging from $129,300 to $223,600 based on location and experience, plus additional benefits and equity opportunities. This is an excellent opportunity for someone passionate about machine learning infrastructure and distributed systems who wants to impact millions of users worldwide while working with cutting-edge technology at scale.