Replicate is revolutionizing machine learning accessibility by enabling cloud-based model deployment. Founded by veterans from Docker, Spotify, Dropbox, and other tech giants, they're transforming AI deployment to match web deployment's simplicity.
The Models team maintains a comprehensive public model library of cutting-edge generative AI models, ensuring optimal performance and usability. As a Machine Learning Engineer, you'll be at the forefront of deploying and optimizing image, audio, and video models, while implementing the latest research developments.
The role requires a strong foundation in software engineering (5+ years experience) combined with expertise in media models. While a PhD isn't mandatory, you'll need solid mathematical understanding and research paper comprehension skills. You'll be responsible for maintaining model libraries, developing training code for LoRA fine-tuning, and transforming academic research into practical applications.
The position offers flexibility with remote work options across the United States, though there's a preference for PST timezone alignment. For those near San Francisco, there's an opportunity to work from the office three days a week. The company culture emphasizes technical excellence, community involvement, and open-source contributions.
Replicate stands out for its mission to democratize machine learning, making it accessible to everyone without requiring advanced degrees. The team comprises hackers, engineers, researchers, and artists who prioritize API design, infrastructure reliability, and community engagement. With leadership from engineers who've created technologies like Docker Compose and OpenAPI, Replicate is positioned to make significant impacts in AI accessibility and deployment.