Software Development Engineer II, AWS SageMaker Training

World's most comprehensive and broadly adopted cloud platform, pioneering cloud computing and continuous innovation.
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS · Cloud

Description For Software Development Engineer II, AWS SageMaker Training

AWS AI is revolutionizing cloud-based deep learning with Amazon SageMaker Training, building customer-facing services to empower data scientists and software engineers. As customers increasingly adopt LLMs and Generative AI, we're developing a next-generation AI platform optimized for distributed training. This role offers an opportunity to work on cutting-edge AI infrastructure, handling models with 100+ billion parameters across thousands of GPU devices.

The position sits within AWS Utility Computing (UC), which provides foundational services like S3 and EC2, along with continuous product innovations. You'll be part of a dynamic team building scalable solutions for worldwide customers, working closely with ML scientists and contributing to the team's strategic direction.

As an SDE, you'll design and develop distributed machine learning systems, collaborate with industry leaders, and help shape the future of AI computing. The role combines technical expertise with leadership opportunities, allowing you to influence architecture decisions and mentor junior engineers.

AWS offers an inclusive culture that values diverse experiences, with employee-led affinity groups and ongoing learning opportunities. The company emphasizes work-life harmony and provides extensive career development resources, including mentorship programs.

This is an ideal opportunity for someone passionate about large-scale deep learning, distributed systems, and building platforms that will shape the future of AI development. You'll work with cutting-edge technology while contributing to products that serve customers globally.

Last updated 2 days ago

Responsibilities For Software Development Engineer II, AWS SageMaker Training

  • Design, develop, test, and deploy distributed machine learning systems
  • Build and improve next-generation AI platform
  • Collaborate with ML scientists and customers to influence overall strategy
  • Drive system architecture and best practices
  • Coach and develop junior engineers
  • Build scalable systems for large language model training
  • Collaborate with internal teams and leading technology companies

Requirements For Software Development Engineer II, AWS SageMaker Training

Python
  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture experience
  • Experience programming with at least one software programming language
  • Experience with design patterns, reliability and scaling of systems

Benefits For Software Development Engineer II, AWS SageMaker Training

Medical Insurance
Dental Insurance
Vision Insurance
  • Work-life balance
  • Career development opportunities
  • Mentorship programs
  • Inclusive workplace culture
  • Learning and development resources

Interested in this job?

Jobs Related To Amazon Software Development Engineer II, AWS SageMaker Training

Software Engineer II, Personalization, Generative AI

Build and deploy AI-driven recommendation systems at Amazon to enhance customer shopping experiences through personalization and generative AI technologies.

Software Development Engineer, AGI - Modelling Services

Software Development Engineer role focused on developing multi-modal and multi-lingual large language models (LLM) for Amazon's AGI team.

Software Dev Engineer II, Amazon Q Developer

Software Dev Engineer II position at Amazon Q Developer team, focusing on AI-driven development tools and machine learning applications.

Software Dev Engineer II, AWS Trusted Advisor

AI-focused Software Engineer role at AWS Trusted Advisor, building next-generation cloud optimization systems.

Machine Learning Engineer II, Just Walk Out (JWO)

Machine Learning Engineer role at Amazon AWS, working on Just Walk Out technology, developing advanced ML solutions for checkout-free shopping experiences.