Machine Learning Engineer, Fast Optimized Inference

AI platform builder with over 5 million users & 100k organizations, sharing 1M+ models, 300k datasets & apps, with 400k+ Github stars.
$120,000 - $200,000
Machine Learning
Mid-Level Software Engineer
Remote
501 - 1,000 Employees
3+ years of experience
AI

Description For Machine Learning Engineer, Fast Optimized Inference

Hugging Face, a leading AI platform with over 5 million users and 100k organizations, is seeking a Machine Learning Engineer focused on Fast Optimized Inference. This role is perfect for engineers passionate about AI and proficient in Python, Rust, and specialized CUDA kernels Frameworks.

As a Machine Learning Engineer, you'll be at the forefront of creating specialized libraries for real-world ML use cases, building on top of our open-source foundation to develop industrial-grade solutions. You'll work on projects similar to text-generation-inference, focusing on scalability, reliability, and performance optimization.

The role offers an opportunity to work with some of the smartest people in the industry, in a culture that values diversity, equity, and inclusivity. We're a distributed team with offices in NYC and Paris, offering flexible remote work options and comprehensive benefits including health coverage, equity, and professional development support.

Join a company that's actively democratizing good AI, with a proven track record of open-source success (400k+ Github stars) and a commitment to supporting the ML/AI community. You'll be part of a progressive, nimble team developing real-world solutions while enjoying the benefits of a supportive, growth-oriented environment.

Last updated a month ago

Responsibilities For Machine Learning Engineer, Fast Optimized Inference

  • Develop specialized software for specific machine learning use cases with broad applications
  • Create scalable software solutions for industrial purposes using existing library frameworks
  • Enhance reliability, quality, and time-to-market of software suite
  • Monitor and manage production environment availability and system health
  • Measure and optimize system performance

Requirements For Machine Learning Engineer, Fast Optimized Inference

Python
Rust
  • Proficiency in Python and Rust
  • Experience with specialized CUDA kernels Frameworks
  • Knowledge of Transformers, Keras or PyTorch
  • Passion for AI and Machine Learning

Benefits For Machine Learning Engineer, Fast Optimized Inference

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Education Budget
Equity
  • Flexible working hours
  • Remote work options
  • Health, dental, and vision benefits
  • Flexible parental leave
  • Paid time off
  • Company equity
  • Conference and education reimbursement
  • Office visits opportunity
  • Workstation support

Interested in this job?

Jobs Related To Hugging Face Machine Learning Engineer, Fast Optimized Inference

Open-Source Machine Learning Engineer - International Remote

Join Hugging Face as an Open-Source Machine Learning Engineer to improve ML ecosystem and work with leading open-source libraries while fostering a global ML community.

Community ML Research Engineer, non-AI scientific fields - EMEA Remote

Community ML Research Engineer position at Hugging Face, focusing on applying machine learning to non-AI scientific fields and fostering research collaborations.

AI Software Engineer

AI Software Engineer position at HelpFlow to build multi-agent AI systems, requiring strong programming and AI development experience, offering remote work with potential for full-time employment.

Programmer Analyst 4 - AI Developer

Mid-level AI Developer position at DMV IT Service LLC, focusing on Document AI application maintenance and enhancement using GCP, requiring 3+ years of experience.

Research Scientist, Google Cloud AI

Research Scientist position at Google Cloud AI, focusing on advancing AI technology through research and practical applications across various industries.