Software Engineer- AI/ML, AWS Neuron

Amazon is a global technology company that provides cloud computing, e-commerce, artificial intelligence, and digital streaming services.
$129,300 - $223,600
Machine Learning
Senior Software Engineer
Contact Company
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Description For Software Engineer- AI/ML, AWS Neuron

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. The role is responsible for development, enablement, and performance tuning of various ML model families, including large language models like GPT2 and GPT3, stable diffusion, and Vision Transformers.

Key responsibilities include:

  • Leading efforts to build distributed training and inference support into PyTorch, TensorFlow, and JAX using XLA and the Neuron compiler and runtime stacks.
  • Tuning models for highest performance and efficiency on AWS Trainium and Inferentia silicon and the TRn1 and Inf1 servers.
  • Working closely with chip architects, compiler engineers, and runtime engineers to create, build, and tune distributed training solutions with Trn1.

The role requires strong software development skills and ML knowledge. Experience with Python, FSDP, Deepspeed, and other distributed training libraries is essential.

AWS values work-life balance and offers flexibility in working hours. The team embraces diversity and inclusion, with employee-led affinity groups and ongoing learning experiences. Career growth and mentorship opportunities are prioritized, with projects assigned to help team members develop into well-rounded professionals.

This position offers a competitive compensation package, including base pay ranging from $129,300 to $223,600 per year depending on the geographic market, as well as potential equity, sign-on payments, and a full range of benefits.

Last updated 3 months ago

Responsibilities For Software Engineer- AI/ML, AWS Neuron

  • Lead efforts to build distributed training and inference support into PyTorch, TensorFlow, and JAX
  • Tune ML models for highest performance on AWS Trainium and Inferentia silicon
  • Work with chip architects, compiler engineers, and runtime engineers on distributed training solutions
  • Develop and optimize various ML model families, including large language models and vision transformers

Requirements For Software Engineer- AI/ML, AWS Neuron

Python
  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture experience
  • Experience programming with at least one software programming language
  • Strong software development and ML knowledge
  • Experience with Python and distributed training libraries (e.g., FSDP, Deepspeed)

Benefits For Software Engineer- AI/ML, AWS Neuron

Medical Insurance
Equity
  • Flexible working hours
  • Career growth opportunities
  • Mentorship programs
  • Competitive compensation package
  • Equity options
  • Sign-on payments
  • Full range of medical and financial benefits

Interested in this job?

Jobs Related To Amazon Software Engineer- AI/ML, AWS Neuron

Software Development Engineer, Prime Video Sports

Senior Software Engineer role at Amazon Prime Video Sports, focusing on ML/CV technology to enhance sports streaming experiences.

Machine Learning Engineer III, FAR (Frontier AI & Robotics)

Senior ML Engineer role at Amazon Robotics, optimizing large-scale foundation models and working with world-class AI researchers to advance robotics technology.

ASIC Design Engineer, Cloud-Scale Machine Learning Acceleration team

Senior ASIC Design Engineer position for AWS's Machine Learning Acceleration team, focusing on custom SoC design and optimization.

Applied Scientist, AWS SAAR

Senior Applied Scientist role at AWS focusing on machine learning and security analytics, developing innovative solutions for cloud security services.

Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

Senior ML Engineer role at AWS focusing on distributed training systems and ML accelerators, offering competitive pay and growth opportunities.