Sr Software Development Engineer

World's most comprehensive and broadly adopted cloud platform, pioneering cloud computing and continuous innovation.
$151,300 - $261,500
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
4+ years of experience
AI · Enterprise SaaS · Cloud

Description For Sr Software Development Engineer

AWS AI is seeking exceptional software developers for their Deep Learning cross-framework team. This role focuses on developing extensions for TensorFlow and PyTorch machine learning frameworks and creating cross-framework solutions for large-scale Deep Learning model training. As part of the SageMaker Engines team, you'll work on implementing model parallelism, memory optimization techniques, and network communication optimizations for AWS infrastructure. The position offers the opportunity to work with leading engineers and researchers in a fast-paced environment, contributing to innovative solutions that support training of Deep Learning models across thousands of accelerators. AWS values diverse experiences and maintains an inclusive culture through employee-led affinity groups and ongoing learning experiences. The role offers competitive compensation, comprehensive benefits, and emphasizes work-life harmony while providing opportunities for mentorship and career growth. Join AWS to be part of a team that's continuously innovating in cloud computing and trusted by companies from startups to Global 500 enterprises.

Last updated 11 minutes ago

Responsibilities For Sr Software Development Engineer

  • Developing innovative solutions for supporting Large Language Model training in a cluster of nodes
  • Implementing model parallelism methods such as pipeline and tensor parallelism as extensions to the PyTorch framework
  • Implementing sharding of the model training state, activation checkpointing/offloading and other memory saving techniques
  • Optimizing distributed training by profiling, identifying bottlenecks and improving performance
  • Optimizing communication collectives for the AWS network infrastructure

Requirements For Sr Software Development Engineer

Python
TypeScript
Kubernetes
  • 4+ years of non-internship professional software development experience
  • 4+ years of programming with at least one software programming language
  • 4+ years of leading design or architecture of new and existing systems
  • Experience as a mentor, tech lead or leading an engineering team
  • Bachelor's degree in computer science or equivalent
  • Strong working knowledge of C++ and Python programming languages
  • Experience with CUDA programming
  • Experience in developing highly scalable, fault-tolerant, distributed systems

Benefits For Sr Software Development Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
  • Medical, financial, and other benefits
  • Equity compensation
  • Mentorship and career growth opportunities
  • Work-life harmony
  • Inclusive team culture

Interested in this job?

Jobs Related To Amazon Sr Software Development Engineer

Sr. Machine Learning Engineer, Amazon General Intelligence (AGI)

Senior Machine Learning Engineer role focused on developing cutting-edge LLMs and Generative AI solutions at Amazon's AGI team.

Machine Learning Engineer, Amazon General Intelligence (AGI)

Senior Machine Learning Engineer role at Amazon's AGI team, focusing on developing cutting-edge LLMs and Generative AI applications.

Software Development Engineer, Customer Engagement Technology

Senior Software Engineer role at Amazon focusing on building AI-powered customer service solutions using LLMs and conversational AI systems.

Applied Scientist, AWS SAAR

Senior Applied Scientist role at AWS focusing on machine learning and security analytics, offering competitive compensation and growth opportunities in a collaborative environment.

Senior Software Development Engineer, Customer Engagement Technology

Senior Software Engineer role at Amazon focusing on AI-powered customer service technology, building conversational AI systems and LLM infrastructure.