AWS Utility Computing (UC) is seeking a Senior Machine Learning Engineer to join their Distributed Training team for AWS Neuron. This role is part of Annapurna Labs, AWS's infrastructure provider focusing on silicon and software innovation. The position involves working on AWS Inferentia and Trainium, cloud-scale Machine Learning accelerators, developing solutions for massive-scale Large Language Models like GPT and Llama.
The role combines deep software engineering expertise with machine learning knowledge, requiring hands-on experience with distributed training libraries like FSDP and Deepspeed. You'll collaborate with chip architects and compiler engineers to optimize performance on custom AWS silicon. The position offers exposure to cutting-edge AI technologies and cloud computing innovations.
AWS provides a supportive environment emphasizing knowledge-sharing and mentorship, with opportunities for career growth and skill development. The company values diverse experiences and maintains an inclusive culture through various employee-led initiatives and affinity groups. Work-life harmony is prioritized, with flexibility built into the working culture.
The compensation package is comprehensive, including competitive base pay ranging from $151,300 to $261,500 depending on location, plus equity, sign-on payments, and extensive benefits. This is an opportunity to work at the intersection of cloud computing and machine learning, developing solutions that push the boundaries of what's possible in AI acceleration.