AWS Neuron is seeking a Software Engineer II to join their Machine Learning Applications team, focusing on distributed training solutions. This role is part of Annapurna Labs, acquired by AWS in 2015, which serves as the infrastructure provider for AWS. The position involves working with cutting-edge ML technologies, including AWS Inferentia and Trainium cloud-scale machine learning accelerators.
The role requires expertise in distributed training libraries like FSDP and Deepspeed, and involves close collaboration with chip architects, compiler engineers, and runtime engineers. You'll be responsible for developing and optimizing support for various ML model families, including large language models like GPT2/GPT3, stable diffusion, and Vision Transformers.
AWS offers a strong emphasis on work-life balance, mentorship, and career growth. The company maintains an inclusive culture with ten employee-led affinity groups and innovative benefit offerings. The team values knowledge sharing and supports new members through a broad mix of experience levels and tenures.
This position offers competitive compensation ranging from $129,300 to $223,600 based on geographic location, plus equity and comprehensive benefits. The role presents significant opportunities for working with large-scale systems and contributing to AWS's continued innovation in cloud infrastructure and machine learning acceleration.