Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

World's most comprehensive and broadly adopted cloud platform, pioneering cloud computing and continuous innovation.
$151,300 - $261,500
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

AWS Utility Computing (UC) is seeking a Senior Software Engineer to join their Machine Learning Applications team for AWS Neuron. This role focuses on developing and optimizing machine learning solutions using AWS's custom silicon accelerators - Inferentia and Trainium. The position involves working with cutting-edge ML technologies, including large language models like GPT-2/3 and vision transformers.

As a senior engineer, you'll collaborate with chip architects and software teams to build distributed training solutions, implement ML frameworks, and optimize performance for AWS's specialized hardware. The role combines deep software engineering expertise with machine learning knowledge, making it ideal for those passionate about both fields.

AWS offers an inclusive culture with employee-led affinity groups and continuous learning opportunities. The team values knowledge-sharing and mentorship, making it an excellent environment for professional growth. You'll be part of AWS's larger mission to innovate in cloud computing while working on solutions that help customers tackle previously impossible challenges.

The position offers competitive compensation ($151,300-$261,500 based on location) plus equity and comprehensive benefits. You'll be working in Seattle, contributing to AWS's industry-leading cloud platform while developing expertise in specialized areas like distributed ML systems and custom accelerator optimization.

This role represents a unique opportunity to work at the intersection of cloud computing, machine learning, and custom silicon, making a direct impact on how the world's most demanding ML workloads are trained and deployed.

Last updated an hour ago

Responsibilities For Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

  • Lead development of distributed training and inference support in PyTorch, TensorFlow, and JAX
  • Tune ML models for optimal performance on AWS Trainium and Inferentia silicon
  • Work with chip architects, compiler engineers, and runtime engineers
  • Create and build distributed training solutions with Trn1
  • Support development of AWS Compute, Database, Storage, Platform services
  • Enable and performance tune various ML model families including GPT2, GPT3, stable diffusion, and Vision Transformers

Requirements For Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

Python
Java
TypeScript
  • 5+ years of non-internship professional software development experience
  • 5+ years of programming experience with at least one programming language
  • 5+ years of leading design or architecture experience
  • 5+ years of full software development life cycle experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Machine Learning knowledge in frameworks and end-to-end model training
  • Bachelor's degree in computer science or equivalent (preferred)
  • Experience with distributed training libraries like FSDP and Deepspeed

Benefits For Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

Medical Insurance
401k
Parental Leave
  • Full range of medical benefits
  • Financial benefits
  • Work-life harmony
  • Career growth opportunities
  • Mentorship programs
  • Employee-led affinity groups

Interested in this job?

Jobs Related To Amazon Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

Senior ML Engineer role at AWS developing distributed training solutions for cloud-scale ML accelerators, working with cutting-edge AI technologies and custom silicon.

ML Compiler Engineer, Annapurna Labs

Senior ML Compiler Engineer role at AWS developing cutting-edge deep learning compiler stack for custom ML accelerators, offering competitive compensation and growth opportunities.

Senior ML Engineer, AWS Generative AI Innovation Center

Senior ML Engineer position at AWS Generative AI Innovation Center, focusing on developing AI solutions and helping customers implement generative AI technologies.

Sr. Software Development Engineer, AWS Compute Services

Senior SDE role at AWS focusing on ML/AI services, building distributed systems with 5+ years experience required, competitive salary $151K-$261K.

Sr. Software Engineer- AI/ML, AWS Neuron Apps

Senior Software Engineering role at AWS focusing on machine learning infrastructure, distributed training, and ML model optimization for cloud-scale accelerators.