Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games.
$100,000 - $720,000
Machine Learning
Staff Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Entertainment

Description For Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Netflix, a global entertainment leader with 283 million paid memberships, is seeking a Staff Software Engineer for their Machine Learning Platform team. This role focuses on Model Observability & Lifecycle Management, working within the centralized MLOps platform that enhances productivity across Netflix's ML practitioners.

The position involves building comprehensive systems for managing ML models, including visualization, observability, and performance benchmarking. You'll be working on cutting-edge projects supporting bandits, multi-task learning models, Large Language Models (LLMs), and other foundation models. The role is highly cross-functional, requiring collaboration with engineers, product managers, and data scientists.

Key projects include developing observability dashboards, implementing model registry systems, creating anomaly detection mechanisms, and building cost monitoring solutions. The team's work directly impacts hundreds of ML practitioners developing business-critical models across personalization, growth, commerce, ads, and studio algorithms.

The ideal candidate should have strong experience in backend distributed systems, full-stack development, and cloud platforms. Knowledge of MLOps best practices and model lifecycle management is crucial. The position offers a flexible compensation structure where you can choose between salary and stock options annually, with a competitive range of $100,000 - $720,000.

Netflix values inclusion and diversity, providing equal opportunities to all candidates. The role offers the opportunity to work remotely while contributing to one of the world's leading entertainment platforms, making a significant impact on how millions of users experience content globally.

Last updated 7 days ago

Responsibilities For Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

  • Develop and expand model observability and visualization workflows
  • Build observability dashboard and backend system for ML practitioners
  • Implement model registry to catalog ML models and their versions
  • Implement anomaly and drift detection on models, features, and embeddings
  • Create cost monitoring and chargeback dashboards
  • Enhance user interfaces for ML practitioners

Requirements For Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Java
React
  • Experience building backend distributed systems and full-stack systems using object-oriented programming (preferably Java)
  • Experience with web API frameworks (preferably Spring Boot) and UI frameworks like React
  • Experience working with public cloud like AWS, Azure, or GCP
  • Knowledge of ML model lifecycle management and MLOps best practices
  • Proactive communication skills with cross-functional teams
  • BS/MS in Computer Science, Applied Math, Engineering, or related field

Benefits For Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Equity
  • Flexible compensation structure with choice between salary and stock options
  • Remote work opportunity
  • Equal opportunity employer

Interested in this job?

Jobs Related To Netflix Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Research Scientist 4 - Content and Studio

Senior Research Scientist role at Netflix focusing on computer vision and machine learning for content promotion and studio operations.

Research Scientist 4 - Content and Studio

Senior Research Scientist position at Netflix focusing on generative speech technologies and ML research for content localization, offering competitive compensation and comprehensive benefits.

Engineering Manager, Training Platform, Machine Learning Platform

Lead Netflix's Machine Learning Platform team, building cutting-edge training infrastructure for global entertainment innovation.

Engineering Manager, for Foundation Models Development

Lead Netflix's Foundation Models Development team, driving innovation in LLMs and generative AI for personalization and content discovery systems.

Engineering Manager, Machine Learning and Asset Optimization

Lead Netflix's ML team in optimizing content presentation through innovative personalization algorithms, managing research and implementation of cutting-edge ML solutions.