ML Engineer L4, Consumer Inference

Netflix is one of the world's leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages.
$100,000 - $464,000
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
4+ years of experience
AI · Enterprise SaaS · Entertainment

Description For ML Engineer L4, Consumer Inference

Netflix is seeking a Machine Learning Engineer to join their Machine Learning Platform (MLP) team. This role will focus on bridging the gap between ML research and productization. Key responsibilities include:

  • Developing customer-facing libraries and services for efficient and scalable ML model inference
  • Building and maintaining online inference services for real-time predictions
  • Optimizing and deploying large language models (LLMs) for production environments
  • Maintaining and improving a model registry for ML model governance
  • Participating in ML Platform incident management and support

The ideal candidate will have:

  • Strong programming skills in Python and Java
  • Experience with ML libraries like TensorFlow and PyTorch
  • Familiarity with GPU inference optimization tools (e.g., Triton Inference Server, TensorRT)
  • Knowledge of containerization (Docker) and orchestration (Kubernetes)
  • Experience in large-scale build, release, CI/CD, and observability techniques
  • Strong customer focus and excellent communication skills

Netflix offers a unique culture with true transparency and autonomy. The role provides opportunities for impact, responsibility, and continuous learning in a collaborative environment. The company provides comprehensive benefits, including health plans, mental health support, 401(k) with employer match, stock options, and paid time off.

Netflix is committed to diversity and inclusion, providing equal opportunities to all candidates regardless of background.

Last updated 3 months ago

Responsibilities For ML Engineer L4, Consumer Inference

  • Develop customer-facing libraries and services for ML model inference
  • Build and maintain online inference services for real-time predictions
  • Optimize and deploy large language models (LLMs) for production
  • Maintain and improve a model registry for ML model governance
  • Participate in ML Platform incident management and support

Requirements For ML Engineer L4, Consumer Inference

Python
Java
Kubernetes
  • Strong programming skills in Python and Java
  • Familiarity with ML libraries like TensorFlow and PyTorch
  • Experience with GPU inference optimization tools (e.g., Triton Inference Server, TensorRT)
  • Knowledge of containerization (Docker) and orchestration (Kubernetes)
  • Experience in large-scale build, release, CI/CD, and observability techniques
  • Strong customer focus and communication skills

Benefits For ML Engineer L4, Consumer Inference

401k
Medical Insurance
Mental Health Assistance
Parental Leave
Equity
  • Health Plans
  • Mental Health support
  • 401(k) Retirement Plan with employer match
  • Stock Option Program
  • Disability Programs
  • Health Savings and Flexible Spending Accounts
  • Family-forming benefits
  • Life and Serious Injury Benefits
  • Paid leave of absence programs
  • 35 days annually for paid time off (for hourly employees)
  • Flexible time off (for salaried employees)

Interested in this job?

Jobs Related To Netflix ML Engineer L4, Consumer Inference

Software Engineer (L4), Consumer ML Model Compute & Serving Systems

Netflix is hiring a Software Engineer (L4) for their Consumer ML Model Compute & Serving Systems team to develop scalable ML infrastructure and advance AI initiatives.

Software Engineer (L4), Consumer ML Model Compute & Serving Systems

Netflix is hiring a Software Engineer (L4) for Consumer ML Model Compute & Serving Systems to build scalable ML infrastructure and advance AI initiatives.

Machine Learning Engineer, Robotic Storage Technologies - Simulation & Machine Learning

Machine Learning Engineer role at Amazon Robotics, focusing on AI-driven warehouse optimization and robotic storage solutions.

GenAI Platform Engineer, Applied Machine Learning

GenAI Platform Engineer position at Apple focusing on building and maintaining AI platforms for generative AI use-cases and infrastructure development.

Software Dev Engineer II, Amazon Q

Software Development Engineer II position at Amazon Q, focusing on AI-powered developer tools with competitive compensation and comprehensive benefits.