Senior Software Development Engineer, ML Ops, AWS Infrastructure Science Engineering

AWS Infrastructure Services owns and operates all AWS global infrastructure, managing data centers and cloud operations worldwide.
$151,300 - $261,500
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Development Engineer, ML Ops, AWS Infrastructure Science Engineering

AWS Infrastructure Services (AIS) is at the heart of Amazon's cloud operations, responsible for the design, planning, delivery, and operation of AWS's global infrastructure. The Science team within AIS focuses on leveraging big data and machine learning to optimize power and cooling across data centers.

As a Senior Software Engineer on the AIS Science team, you'll join the Lanner team - a tight-knit group of eight developers working on critical infrastructure. Your role will focus on building, operationalizing, and scaling machine learning workflows and platform services. You'll collaborate with scientists, program managers, and data engineers to develop improved training and inference infrastructure that accelerates innovation.

Key responsibilities include:

  • Leading the design and implementation of stable and efficient training/inference infrastructure
  • Developing platforms for deploying, productionalizing, and scaling ML models
  • Engineering solutions for robust, fault-tolerant systems across input and output organizations
  • Focusing on model retraining and ongoing monitoring

The team culture emphasizes work-life balance while tackling complex challenges in data processing, model hosting, and metric monitoring. You'll be part of a group that includes one Senior SDE, three junior, and three entry-level engineers. The team maintains healthy boundaries while staying connected through weekly happy hours, regular lunches, and team events.

The compensation package is comprehensive, ranging from $151,300 to $261,500 per year based on location, plus equity, sign-on payments, and full benefits. This role offers an opportunity to make a lasting impact on AWS infrastructure worldwide while working with cutting-edge ML technologies in a collaborative environment.

Last updated 4 days ago

Responsibilities For Senior Software Development Engineer, ML Ops, AWS Infrastructure Science Engineering

  • Lead design and implementation of training and inference infrastructure for ML models
  • Collaborate with scientists and engineers to develop improved training systems
  • Build and maintain scalable platforms for ML model deployment
  • Implement model monitoring and retraining systems
  • Engineer robust and fault-tolerant solutions for distributed workflows

Requirements For Senior Software Development Engineer, ML Ops, AWS Infrastructure Science Engineering

Python
  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language
  • 5+ years of leading design or architecture experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Experience with full software development life cycle
  • Master's degree in machine learning or equivalent preferred
  • Experience with developing MLOps tooling and frameworks preferred

Interested in this job?

Jobs Related To Amazon Senior Software Development Engineer, ML Ops, AWS Infrastructure Science Engineering

Software Development Engineer, AWS SageMaker Studio

Senior Software Engineer role at Amazon building ML development tools and IDE features for AWS SageMaker Studio, offering competitive compensation and growth opportunities.

Sr. Software Development Engineer, Amazon

Senior Software Engineer role at Amazon focusing on ML-driven product recommendations, offering $151K-$261K salary plus benefits.

Senior Software Development Engineer, Sponsored Products

Senior Software Engineer role at Amazon Ads focusing on ML systems for ad matching and delivery, requiring 5+ years of experience and offering competitive compensation.

Research Engineer III, Deep Science for Systems and Services

Senior research engineering role at Amazon focusing on optimizing foundation models and AI accelerators, combining machine learning expertise with system architecture.

Sr. Software Dev Engineer, Amazon Robotics

Senior Software Engineer role at Amazon Robotics focusing on developing foundation models for robotic mobility and manipulation, offering competitive compensation and comprehensive benefits.