Hugging Face, a leading AI platform with over 5 million users and 100k organizations, is seeking a Machine Learning Engineer focused on Fast Optimized Inference. This role is perfect for engineers passionate about AI and proficient in Python, Rust, and specialized CUDA kernels Frameworks.
As a Machine Learning Engineer, you'll be at the forefront of creating specialized libraries for real-world ML use cases, building on top of our open-source foundation to develop industrial-grade solutions. You'll work on projects similar to text-generation-inference, focusing on scalability, reliability, and performance optimization.
The role offers an opportunity to work with some of the smartest people in the industry, in a culture that values diversity, equity, and inclusivity. We're a distributed team with offices in NYC and Paris, offering flexible remote work options and comprehensive benefits including health coverage, equity, and professional development support.
Join a company that's actively democratizing good AI, with a proven track record of open-source success (400k+ Github stars) and a commitment to supporting the ML/AI community. You'll be part of a progressive, nimble team developing real-world solutions while enjoying the benefits of a supportive, growth-oriented environment.