Platform engineer, MLOps

AI-powered writing platform company focused on enterprise solutions
Machine Learning
Senior Software Engineer
Remote
5+ years of experience
AI · Enterprise SaaS

Description For Platform engineer, MLOps

Writer is seeking a Platform Engineer specializing in MLOps to join their team. This role is crucial for deploying and managing cutting-edge AI/ML infrastructure. The position involves working closely with AI/ML engineers and researchers to develop robust CI/CD pipelines, manage monitoring systems, and maintain large Kubernetes clusters with GPU workloads. The ideal candidate should have 5+ years of experience building core infrastructure, with expertise in model training, Huggingface Transformers, PyTorch, and cloud platforms. The role offers a comprehensive benefits package including healthcare, paid parental leave, and various stipends for professional development and wellness. This is an excellent opportunity for an experienced engineer passionate about ML infrastructure and scalable systems to make a significant impact in a dynamic environment. The position offers flexibility with a remote work option while being based in San Francisco.

Last updated 2 days ago

Responsibilities For Platform engineer, MLOps

  • Work with AI/ML engineers to design and deploy CI/CD pipeline for reproducible experiments
  • Set up and manage monitoring, logging, and alerting systems for training runs and APIs
  • Manage training environments across multiple clusters
  • Develop and manage containerization and orchestration systems
  • Operate large Kubernetes clusters with GPU workloads
  • Improve reliability, quality, and time-to-market of software solutions
  • Optimize system performance
  • Provide operational support for large-scale distributed software applications

Requirements For Platform engineer, MLOps

Python
Kubernetes
  • Experience with model training
  • Experience with Huggingface Transformers
  • Experience with Pytorch, vLLM, TensorRT
  • Knowledge of infrastructure as code tools like Terraform
  • Proficiency in Python or Bash
  • Experience with cloud platforms (Google Cloud, AWS or Azure)
  • Experience with Git and GitHub workflows
  • Experience with tracing and monitoring
  • Familiar with high-performance, large-scale ML systems
  • 5+ years building core infrastructure
  • Experience running inference clusters at scale
  • Experience with Kubernetes at scale

Benefits For Platform engineer, MLOps

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
  • Generous PTO and company holidays
  • Medical, dental, and vision coverage
  • 12 weeks paid parental leave
  • Fertility and family planning support
  • Early-detection cancer testing
  • Flexible spending account and dependent FSA options
  • Health savings account with company contribution
  • Home office setup stipend
  • Wellness stipend
  • Learning and development stipend
  • Company-wide off-sites
  • Competitive compensation
  • Stock options
  • 401k

Interested in this job?

Jobs Related To Writer Platform engineer, MLOps

Platform engineer, MLOps (UK)

Senior Platform Engineer MLOps position at Writer, focusing on AI/ML infrastructure and Kubernetes management, offering hybrid work in London with comprehensive benefits.

Senior Machine Learning Engineer

Senior Machine Learning Engineer role at Microsoft focusing on building evaluation frameworks for cutting-edge deep learning models and AI platforms.

Machine Learning Research Engineer – Speech for On-Device Agentic AI

Senior Machine Learning Research Engineer role at Qualcomm Korea, focusing on speech recognition and conversational AI development for on-device applications.

Platform engineer, MLOps (UK)

Senior Platform Engineer MLOps position at Writer, focusing on AI/ML infrastructure and Kubernetes management, offering hybrid work in London with comprehensive benefits.

Senior Software Engineer, AI/ML GenAI, Google Cloud AI

Senior Software Engineer position at Google Cloud AI focusing on GenAI development, offering $166K-$244K salary plus benefits, requiring 5 years of software development experience.