Cohere is on a mission to scale intelligence to serve humanity, focusing on training and deploying frontier models for AI systems. As a Senior Software Engineer in MLOps and Infrastructure, you'll join a team responsible for building critical infrastructure that underpins all of Cohere's success. The role demands expertise in designing and managing large-scale distributed systems, particularly with Kubernetes and GPU workloads. You'll work with cutting-edge cloud technologies across multiple platforms (GCP, Azure, AWS, OCI) and build automation systems for deployment and monitoring.
The position requires participation in a 24x7 on-call rotation (with compensation) and targets candidates based in EMEA. You'll be instrumental in building self-service systems, custom Kubernetes operators, and ensuring high availability of mission-critical infrastructure. The role emphasizes both technical excellence and team collaboration, with opportunities to mentor others and influence the infrastructure roadmap.
Working at Cohere means joining a diverse team of world-class professionals passionate about advancing AI technology. The company offers comprehensive benefits including health coverage, parental leave, enrichment benefits, and flexible work arrangements. With offices in major tech hubs and a hybrid work model, you'll have the opportunity to shape the future of AI infrastructure while maintaining work-life balance.
If you're experienced in production infrastructure, passionate about automation and scalability, and want to contribute to cutting-edge AI development, this role offers an exciting opportunity to make a significant impact in the field of artificial intelligence.