Member of Technical Staff, Training Infra Engineer

AI company training and deploying frontier models for developers and enterprises to power AI systems for content generation, semantic search, RAG, and agents.
Machine Learning
Senior Software Engineer
Remote
AI

Description For Member of Technical Staff, Training Infra Engineer

Cohere is at the forefront of AI development, focusing on training and deploying frontier models for developers and enterprises. This role as a Member of Technical Staff, Training Infra Engineer presents an exciting opportunity to work with cutting-edge AI technology and infrastructure.

The position involves designing and implementing high-performance training systems for large-scale AI models. You'll be working with state-of-the-art infrastructure and contributing to both engineering and research efforts. The role offers significant autonomy and the chance to work with some of the best researchers in the field.

What makes this role particularly attractive is Cohere's impressive compute-to-engineer ratio and their commitment to bridging the gap between research and production. You'll be working on critical infrastructure that powers next-generation AI models, with access to substantial computational resources and data.

The company culture emphasizes diversity, inclusion, and work-life balance, offering attractive benefits including 6 weeks of vacation, health and dental coverage, and mental health support. The role is remote-friendly with offices in major tech hubs, providing flexibility in work location.

This is an ideal position for a senior engineer passionate about machine learning infrastructure, distributed systems, and high-performance computing. You'll be contributing to Cohere's mission of scaling intelligence to serve humanity while working alongside world-class talent in the AI field.

The role requires expertise in Python, ML frameworks, and distributed systems, but Cohere encourages applications even if candidates don't perfectly match all requirements. They value diverse perspectives and offer comprehensive support for professional growth and development.

Last updated 9 hours ago

Responsibilities For Member of Technical Staff, Training Infra Engineer

  • Design and write high-performant and scalable software for training
  • Improve training setup from infrastructure and codebase performance standpoint
  • Craft and implement tools to speed up training cycles
  • Research and experiment with ideas on supercompute and data infrastructure
  • Work with researchers in the field

Requirements For Member of Technical Staff, Training Infra Engineer

Python
Kubernetes
  • Extremely strong software engineering skills
  • Proficiency in Python and ML frameworks (JAX, Pytorch, XLA/MLIR)
  • Experience with distributed training infrastructures (Kubernetes, Slurm) and frameworks (Ray)
  • Experience using large-scale distributed training strategies
  • Hands on experience training large models at scale

Benefits For Member of Technical Staff, Training Infra Engineer

Dental Insurance
Medical Insurance
Mental Health Assistance
Parental Leave
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits
  • Mental health budget
  • 100% Parental Leave top-up for 6 months (Canada, US, UK)
  • Personal enrichment benefits
  • Remote-flexible work
  • Co-working stipend
  • 6 weeks of vacation

Interested in this job?

Jobs Related To Cohere Member of Technical Staff, Training Infra Engineer

Software Development Engineer - Machine Learning, Sponsored Products

Senior Machine Learning Software Engineer role at Amazon Advertising, focusing on Sponsored Products search relevance and ad serving systems.

Machine Learning / Computer Vision Engineer - Generative AI

Senior Machine Learning Engineer role at Apple focusing on Generative AI and Computer Vision, offering competitive salary and comprehensive benefits.

AIML - Senior Software Engineer, ML Systems and Evaluation Engineering

Senior Software Engineer role at Apple focusing on ML Systems and Evaluation Engineering, working on Siri and AI technologies.

Applied Machine Learning Engineer - Customer Feedback

Senior ML Engineer role at Apple focusing on customer feedback analysis using ML and AI, offering competitive salary and comprehensive benefits.

Software Developer in Test, Applied ML Analytics

Senior Software Developer in Test position focusing on ML Analytics for Apple Pay and Wallet services, combining test automation with machine learning expertise.