ML Engineer L4, Consumer Inference

Netflix is one of the world's leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages.
$100,000 - $464,000
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
4+ years of experience
AI · Enterprise SaaS · Entertainment

Description For ML Engineer L4, Consumer Inference

Netflix is seeking a Machine Learning Engineer to join their Machine Learning Platform (MLP) team. This role will focus on bridging the gap between ML research and productization. Key responsibilities include:

  • Developing customer-facing libraries and services for efficient and scalable ML model inference
  • Building and maintaining online inference services for real-time predictions
  • Optimizing and deploying large language models (LLMs) for production environments
  • Maintaining and improving a model registry for ML model governance
  • Participating in ML Platform incident management and support

The ideal candidate will have:

  • Strong programming skills in Python and Java
  • Experience with ML libraries like TensorFlow and PyTorch
  • Familiarity with GPU inference optimization tools (e.g., Triton Inference Server, TensorRT)
  • Knowledge of containerization (Docker) and orchestration (Kubernetes)
  • Experience in large-scale build, release, CI/CD, and observability techniques
  • Strong customer focus and excellent communication skills

Netflix offers a unique culture with true transparency and autonomy. The role provides opportunities for impact, responsibility, and continuous learning in a collaborative environment. The company provides comprehensive benefits, including health plans, mental health support, 401(k) with employer match, stock options, and paid time off.

Netflix is committed to diversity and inclusion, providing equal opportunities to all candidates regardless of background.

Last updated 2 months ago

Responsibilities For ML Engineer L4, Consumer Inference

  • Develop customer-facing libraries and services for ML model inference
  • Build and maintain online inference services for real-time predictions
  • Optimize and deploy large language models (LLMs) for production
  • Maintain and improve a model registry for ML model governance
  • Participate in ML Platform incident management and support

Requirements For ML Engineer L4, Consumer Inference

Python
Java
Kubernetes
  • Strong programming skills in Python and Java
  • Familiarity with ML libraries like TensorFlow and PyTorch
  • Experience with GPU inference optimization tools (e.g., Triton Inference Server, TensorRT)
  • Knowledge of containerization (Docker) and orchestration (Kubernetes)
  • Experience in large-scale build, release, CI/CD, and observability techniques
  • Strong customer focus and communication skills

Benefits For ML Engineer L4, Consumer Inference

401k
Medical Insurance
Mental Health Assistance
Parental Leave
Equity
  • Health Plans
  • Mental Health support
  • 401(k) Retirement Plan with employer match
  • Stock Option Program
  • Disability Programs
  • Health Savings and Flexible Spending Accounts
  • Family-forming benefits
  • Life and Serious Injury Benefits
  • Paid leave of absence programs
  • 35 days annually for paid time off (for hourly employees)
  • Flexible time off (for salaried employees)

Interested in this job?

Jobs Related To Netflix ML Engineer L4, Consumer Inference

Software Engineer II

Mid-level Software Engineer position at Microsoft's Azure ML team, building large-scale model serving platform for AI inference, including OpenAI models.

Machine Learning Engineer

Machine Learning Engineer role at Apple, focusing on developing ML solutions for the Apple Online Store, including search, recommendations, and personalization systems.

Software Engineer

Software Engineer role at Microsoft focusing on Azure Machine Learning infrastructure and large-scale AI model serving.

Field Service AI Solution Architect

Field Service AI Solution Architect position at Salesforce, focusing on implementing AI solutions for field service operations with 3+ years of experience required.

Deep Learning Engineer, Datacenters

Deep Learning Engineer position at NVIDIA focusing on datacenter optimization, AI infrastructure, and performance analysis for large-scale machine learning systems.