Platform engineer, MLOps

Writer

AI-powered writing platform company focused on enterprise solutions

San Francisco, CA, USA

Machine Learning

Senior Software Engineer

Remote

5+ years of experience

AI · Enterprise SaaS

Description For Platform engineer, MLOps

Writer is seeking a Platform Engineer specializing in MLOps to join their team. This role is crucial for deploying and managing cutting-edge AI/ML infrastructure. The position involves working closely with AI/ML engineers and researchers to develop robust CI/CD pipelines, manage monitoring systems, and maintain large Kubernetes clusters with GPU workloads. The ideal candidate should have 5+ years of experience building core infrastructure, with expertise in model training, Huggingface Transformers, PyTorch, and cloud platforms. The role offers a comprehensive benefits package including healthcare, paid parental leave, and various stipends for professional development and wellness. This is an excellent opportunity for an experienced engineer passionate about ML infrastructure and scalable systems to make a significant impact in a dynamic environment. The position offers flexibility with a remote work option while being based in San Francisco.

Last updated 2 days ago

Responsibilities For Platform engineer, MLOps

Work with AI/ML engineers to design and deploy CI/CD pipeline for reproducible experiments
Set up and manage monitoring, logging, and alerting systems for training runs and APIs
Manage training environments across multiple clusters
Develop and manage containerization and orchestration systems
Operate large Kubernetes clusters with GPU workloads
Improve reliability, quality, and time-to-market of software solutions
Optimize system performance
Provide operational support for large-scale distributed software applications

Requirements For Platform engineer, MLOps

Python

Kubernetes

Experience with model training
Experience with Huggingface Transformers
Experience with Pytorch, vLLM, TensorRT
Knowledge of infrastructure as code tools like Terraform
Proficiency in Python or Bash
Experience with cloud platforms (Google Cloud, AWS or Azure)
Experience with Git and GitHub workflows
Experience with tracing and monitoring
Familiar with high-performance, large-scale ML systems
5+ years building core infrastructure
Experience running inference clusters at scale
Experience with Kubernetes at scale

Benefits For Platform engineer, MLOps

Medical Insurance

Dental Insurance

Vision Insurance

401k

Parental Leave

Education Budget

Generous PTO and company holidays
Medical, dental, and vision coverage
12 weeks paid parental leave
Fertility and family planning support
Early-detection cancer testing
Flexible spending account and dependent FSA options
Health savings account with company contribution
Home office setup stipend
Wellness stipend
Learning and development stipend
Company-wide off-sites
Competitive compensation
Stock options
401k

Writer

AI-powered writing platform company focused on enterprise solutions

San Francisco, CA, USA

Machine Learning

Senior Software Engineer

Remote

5+ years of experience

AI · Enterprise SaaS

Interested in this job?

Jobs Related To Writer Platform engineer, MLOps

Platform engineer, MLOps (UK)

Writer

Senior Platform Engineer MLOps position at Writer, focusing on AI/ML infrastructure and Kubernetes management, offering hybrid work in London with comprehensive benefits.

Senior Machine Learning Engineer

Microsoft

Senior Machine Learning Engineer role at Microsoft focusing on building evaluation frameworks for cutting-edge deep learning models and AI platforms.

Machine Learning Research Engineer – Speech for On-Device Agentic AI

Qualcomm

Senior Machine Learning Research Engineer role at Qualcomm Korea, focusing on speech recognition and conversational AI development for on-device applications.

Platform engineer, MLOps (UK)

Writer

Senior Platform Engineer MLOps position at Writer, focusing on AI/ML infrastructure and Kubernetes management, offering hybrid work in London with comprehensive benefits.

Senior Software Engineer, AI/ML GenAI, Google Cloud AI

Google

Senior Software Engineer position at Google Cloud AI focusing on GenAI development, offering $166K-$244K salary plus benefits, requiring 5 years of software development experience.