The Generative AI Innovation Center at AWS is at the forefront of AI technology, empowering customers to leverage cutting-edge AI solutions for transformative business opportunities. As a Senior Machine Learning Engineer, you'll join a multidisciplinary team of strategists, scientists, engineers, and architects working on state-of-the-art Large Language Models (LLMs).
Your role will involve designing and implementing distributed training pipelines for LLMs, utilizing advanced tools like Fully Sharded Data Parallel (FSDP) and DeepSpeed. You'll be responsible for adapting and fine-tuning LLMs for various applications, including new languages, domains, and vision applications. A key aspect of your work will be optimizing AI models for AWS's custom silicon (Inferentia and Trainium) using the AWS Neuron SDK.
Working directly with enterprise customers and foundational model providers, you'll help solve complex business and technical challenges through customized generative AI solutions. This position offers an opportunity to impact the future of AI technology while working with top AWS clients.
The role requires extensive experience in software development, machine learning, and system architecture. You'll need to demonstrate leadership abilities and have a track record of delivering complex technical solutions. The compensation is competitive, ranging from $151,300 to $261,500 annually, plus additional benefits including equity and comprehensive healthcare.
Join AWS to be part of a team that's pushing the boundaries of AI technology and delivering innovative solutions at scale. This role offers the chance to work with cutting-edge technology while solving real-world problems for major enterprises.