Machine Learning Engineer, Fast Optimized Inference

AI platform builder with over 5 million users & 100k organizations, sharing 1M+ models, 300k datasets & apps, with 400k+ Github stars.
$120,000 - $200,000
Machine Learning
Mid-Level Software Engineer
Remote
501 - 1,000 Employees
3+ years of experience
AI

Description For Machine Learning Engineer, Fast Optimized Inference

Hugging Face, the fastest growing AI platform, is seeking a Machine Learning Engineer to focus on Fast Optimized Inference. As part of our mission to democratize good AI, you'll join a platform serving 5M+ users and 100k+ organizations. The role involves creating specialized libraries for real-world ML use cases, building on our open-source foundation to develop industrial-grade solutions.

You'll work on developing specialized software similar to text-generation-inference, focusing on scalability and performance optimization. The position requires expertise in Python, Rust, and CUDA kernels Frameworks, including Transformers, Keras, or PyTorch. You'll be responsible for enhancing software reliability, monitoring system health, and driving innovation in our production environment.

We offer a collaborative, inclusive environment with offices in NYC and Paris, though we're largely distributed. Our benefits include flexible working hours, comprehensive health coverage, parental leave, equity compensation, and professional development support. We're committed to building a diverse, equitable workplace where all team members can thrive.

Join us in advancing AI technology while working with industry-leading professionals. You'll be part of a community that values open collaboration and scientific advancement in the ML/AI field. If you're passionate about creating impactful AI solutions and want to contribute to a company that's actively shaping the future of machine learning, this role offers an excellent opportunity to make a difference.

Last updated 2 days ago

Responsibilities For Machine Learning Engineer, Fast Optimized Inference

  • Develop specialized software for specific machine learning use cases with broad applications
  • Create scalable software solutions for industrial purposes using existing library frameworks
  • Enhance reliability, quality, and time-to-market of software suite
  • Monitor and manage production environment availability and system health
  • Measure and optimize system performance

Requirements For Machine Learning Engineer, Fast Optimized Inference

Python
Rust
  • Proficiency in Python and Rust
  • Experience with specialized CUDA kernels Frameworks
  • Knowledge of Transformers, Keras or PyTorch
  • Passion for AI and Machine Learning

Benefits For Machine Learning Engineer, Fast Optimized Inference

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Education Budget
Equity
  • Flexible working hours
  • Remote work options
  • Health, dental, and vision benefits
  • Flexible parental leave
  • Paid time off
  • Company equity
  • Conference and education reimbursement
  • Office visits opportunity
  • Workstation support

Interested in this job?

Jobs Related To Hugging Face Machine Learning Engineer, Fast Optimized Inference

Open-Source Machine Learning Engineer - International Remote

Join Hugging Face as an Open-Source Machine Learning Engineer to improve ML ecosystem and work with leading open-source libraries while fostering a global ML community.

Machine Learning Engineer for Audio - US Remote

Machine Learning Engineer role at Hugging Face focusing on audio technologies, speech-to-text, and text-to-speech development in an open-source environment.

Community ML Research Engineer, non-AI scientific fields - EMEA Remote

Community ML Research Engineer position at Hugging Face, focusing on applying machine learning to non-AI scientific fields and fostering research collaborations.

Software Dev Engineer II, Amazon

Software Engineer II position at Amazon working on foundational LLM development for Amazon Stores, focusing on e-commerce applications

Cloud Support Engineer (Big Data / AI ML), Support Engineering

Cloud Support Engineer role at AWS focusing on Big Data and AI/ML support, combining technical expertise with customer service in a hybrid work environment.