Platform engineer, MLOps

Writer is the full-stack generative AI platform delivering transformative ROI for enterprises, named top 50 in AI by Forbes.
DevOps
Senior Software Engineer
Hybrid
101 - 500 Employees
5+ years of experience
AI · Enterprise SaaS

Description For Platform engineer, MLOps

Writer, a leading enterprise AI platform ranked among Forbes' top 50 AI companies, is seeking a Platform Engineer, MLOps for their London office. This role is crucial for deploying and managing cutting-edge AI/ML infrastructure. You'll work with AI/ML engineers to develop robust CI/CD pipelines, manage monitoring systems, and oversee large Kubernetes clusters with GPU workloads. The ideal candidate brings 5+ years of core infrastructure experience, with expertise in model training, Huggingface Transformers, PyTorch, and cloud platforms. Writer offers a comprehensive benefits package including medical coverage, parental leave, and professional development opportunities. As part of a 250+ employee team across global hubs, you'll contribute to transforming the future of work for major enterprises like Accenture, Intuit, and Salesforce. The role combines technical depth with the excitement of working in a fast-paced, innovative environment, making it perfect for those passionate about scaling AI infrastructure.

Last updated 25 minutes ago

Responsibilities For Platform engineer, MLOps

  • Work closely with AI/ML engineers to design and deploy CI/CD pipeline
  • Set up and manage monitoring, logging, and alerting systems
  • Ensure training environments availability across multiple clusters
  • Develop and manage containerization and orchestration systems
  • Operate and oversee large Kubernetes clusters with GPU workloads
  • Improve reliability, quality, and time-to-market of software solutions
  • Measure and optimize system performance
  • Provide operational support for large-scale distributed software applications

Requirements For Platform engineer, MLOps

Python
Kubernetes
  • Experience with model training
  • Experience with Huggingface Transformers
  • Experience with Pytorch
  • Experience with vLLM
  • Experience with TensorRT
  • Knowledge of infrastructure as code tools like Terraform
  • Proficiency in Python or Bash
  • Experience with cloud platforms (Google Cloud, AWS or Azure)
  • Experience with Git and GitHub workflows
  • Experience with tracing and monitoring
  • Familiar with high-performance, large-scale ML systems
  • 5+ years building core infrastructure
  • Experience running inference clusters at scale
  • Experience operating Kubernetes at scale

Benefits For Platform engineer, MLOps

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
  • Generous PTO and company holidays
  • Medical, dental, and vision coverage for family
  • Paid parental leave (12 weeks)
  • Fertility and family planning support
  • Early-detection cancer testing
  • Flexible spending account and dependent FSA options
  • Health savings account with company contribution
  • Home office setup, cell phone, internet stipend
  • Wellness stipend
  • Learning and development stipend
  • Company-wide off-sites and team off-sites
  • Competitive compensation
  • Company stock options
  • 401k

Interested in this job?

Jobs Related To Writer Platform engineer, MLOps

Platform engineer, MLOps

Senior Platform Engineer role at Writer, focusing on MLOps and AI infrastructure, requiring 5+ years experience in DevOps and ML systems.

Platform engineer, MLOps

Join Writer as a Platform engineer, MLOps in London, UK. Deploy and manage cutting-edge AI/ML infrastructure in a dynamic, fast-paced environment.

Senior Platform Engineer

Senior Platform Engineer position at Adverity focusing on cloud infrastructure, DevOps practices, and platform optimization with remote work options across Europe.

Senior Cloud DevOps Engineer

Senior Cloud DevOps Engineer position at Qode, working with cloud technologies and DevOps practices in an international environment.

DevOps Engineer (Hybrid)

Senior DevOps Engineer position at Wyetech, focusing on AWS infrastructure and automation for federal government projects, offering hybrid work and exceptional benefits.