Amazon's Shopping Convo Foundations team is seeking a Machine Learning Engineer to lead the development of cutting-edge AI technology using Large Language Models and Natural Language Processing. This role focuses on building core foundational capabilities to train and optimize large language models, creating tools and infrastructure to enhance experiences like Rufus. The position offers an opportunity to work with state-of-the-art deep learning and generative models, developing internet-scale data solutions for customer-facing shopping conversation experiences.
The role combines technical expertise in machine learning with practical software engineering, requiring skills in distributed systems, model optimization, and large-scale data processing. You'll work alongside talented engineers and scientists, tackling unprecedented technical challenges while building experiences used by millions globally. The team values collaboration, diversity, and continuous learning, making it an ideal environment for those passionate about advancing the field of generative AI.
Key technical aspects include implementing distributed training pipelines using tools like FSDP and DeepSpeed, customizing LLMs through pre-training and RLHF, and optimizing models for AWS silicon. The position offers competitive compensation ($129,300-$223,600 based on location), comprehensive benefits, and the opportunity to work with cutting-edge technology at one of the world's leading tech companies.
This role is perfect for candidates who combine strong machine learning expertise with software engineering skills and are excited about creating innovative customer experiences. You'll have the chance to influence technical strategy, lead architecture decisions, and work with multiple teams across Amazon, making a significant impact on the future of AI-powered shopping experiences.