Sr. SDE (L6), ML Ops

AWS Infrastructure Services team manages global cloud infrastructure design, planning, delivery, and operations.
$150,700 - $251,700
Machine Learning
Staff Software Engineer
In-Person
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Sr. SDE (L6), ML Ops

The AWS Infrastructure Services (AIS) team is seeking a Senior Software Engineer to join their Science team, focusing on MLOps and infrastructure optimization. This role sits at the intersection of cloud infrastructure and machine learning, working to optimize power and cooling across AWS's global data centers.

The position offers a unique opportunity to impact AWS's infrastructure worldwide by building and maintaining machine learning workflows and platform services. You'll be working with a team of scientists, program managers, and data engineers to develop solutions that directly influence server demand planning and infrastructure efficiency.

As a Sr. SDE (L6), you'll lead the development of platforms for deploying, productionalizing, and scaling machine learning models. Your responsibilities will include designing and implementing training and inference infrastructure, collaborating on improved systems that accelerate innovation, and engineering robust, fault-tolerant solutions for rack planning and forecasting.

The role requires strong software development experience (5+ years) and expertise in ML infrastructure. You'll be working in a collaborative environment that values work-life balance while tackling complex challenges in data processing, model hosting, and metric monitoring. The position offers competitive compensation ($150,700 - $251,700) plus equity and comprehensive benefits.

This is an excellent opportunity for someone passionate about machine learning operations who wants to make a significant impact on cloud infrastructure. You'll be working with cutting-edge technology while helping shape the future of AWS data centers. The role combines technical leadership with hands-on development, making it ideal for experienced engineers who enjoy both building systems and mentoring others.

The position is based in Vancouver, Canada, and offers the chance to work with a global team on mission-critical infrastructure. Amazon provides a comprehensive benefits package, including medical coverage, financial benefits, and equity compensation. If you're excited about scaling ML operations and optimizing cloud infrastructure while working with a talented team, this role offers the perfect blend of challenge and opportunity.

Last updated an hour ago

Responsibilities For Sr. SDE (L6), ML Ops

  • Lead the design and implementation of training and inference infrastructure for machine learning models
  • Collaborate with scientists and data engineers to develop improved training and inference infrastructure
  • Engineer solutions for AWS infrastructure's rack planning and forecasting distributed workflows
  • Build infrastructure to support all phases of ML models from R&D to production
  • Manage model retraining and iteration processes

Requirements For Sr. SDE (L6), ML Ops

Python
  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture of new and existing systems experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Experience with developing MLOps tooling and frameworks

Benefits For Sr. SDE (L6), ML Ops

Medical Insurance
Equity
  • Medical benefits
  • Financial benefits
  • Equity compensation
  • Sign-on payments
  • Comprehensive benefits package

Interested in this job?

Jobs Related To Amazon Sr. SDE (L6), ML Ops

Applied Scientist, Neuron ARG

Applied Scientist role in AWS Neuron Compiler team, working on AI and program analysis for deep learning compiler optimization.

ML Architect, SSG Science

ML Architect position at Amazon Devices, developing next-generation SOCs for machine learning-enabled consumer products.

AIML - Engineering Manager, ML Systems Evaluation Engineering

Engineering Manager position at Apple leading ML systems evaluation, focusing on Siri and Apple Intelligence products, offering competitive compensation and benefits.

Engineering Program Manager, On-Device ML

Engineering Program Manager position at Apple focusing on on-device machine learning implementation and cross-functional team leadership.

Engineering Manager - Machine Learning for Content Personalization

Lead Netflix's Machine Learning team for content personalization, developing next-gen algorithms for entertainment recommendation.