AWS Machine Learning accelerators represent the cutting edge of AWS innovation, particularly in the realm of Generative AI development. The position is within the Amazon Annapurna Labs team, which spearheads silicon development at AWS, focusing on the AWS Neuron team that optimizes neural net model performance on custom AWS hardware.
As a Machine Learning Compiler Engineer II, you'll be at the forefront of developing a compiler system that handles the world's largest ML workloads. The role involves working with AWS's custom chips - Inferentia, which delivers best-in-class ML inference performance at the lowest cost in cloud, and Trainium, designed for optimal ML training performance.
The position requires expertise in compiler optimization and software development, with a focus on the AWS Neuron Software Development Kit (SDK). This SDK includes an ML compiler and runtime that integrates with popular ML frameworks like PyTorch, TensorFlow, and MxNet. The technology you'll work with is already being used at scale by major customers including Snap, Autodesk, Amazon Alexa, and Amazon Rekognition.
The role offers an exceptional opportunity to work with some of the brightest minds in engineering, research, and product development. You'll be involved in architecting and implementing critical features, publishing cutting-edge research, and working directly with AWS ML services teams. The position also includes participation in pre-silicon design and bringing new products and features to market.
Amazon offers a comprehensive benefits package, including competitive salary, medical and financial benefits, and flexible working hours. The company strongly emphasizes work-life balance and provides numerous opportunities for professional growth through mentorship and knowledge sharing. The team culture is highly inclusive, supported by ten employee-led affinity groups with over 40,000 members globally.
This is an ideal position for someone who wants to make a significant impact in the field of machine learning hardware acceleration while working with cutting-edge technology at one of the world's leading tech companies. The role combines technical depth with broad impact, offering the chance to influence the future of ML infrastructure at AWS.