Machine Learning Engineer, Fast Optimized Inference - US Remote

Building the fastest growing platform for AI builders with over 5 million users & 100k organizations sharing 1M+ models, 300k datasets & apps.
Machine Learning
Mid-Level Software Engineer
Remote
501 - 1,000 Employees
3+ years of experience
AI

Description For Machine Learning Engineer, Fast Optimized Inference - US Remote

Hugging Face, the fastest growing AI platform with over 5 million users and 100k organizations, is seeking a Machine Learning Engineer focused on Fast Optimized Inference. This role is perfect for passionate engineers interested in creating specialized ML libraries for real-world applications. You'll work on developing software similar to text-generation-inference, focusing on industrial-level usage and scalability. The position involves creating specialized code building upon their open-source foundation, with 400k+ Github stars across their libraries.

The role combines hands-on development with performance optimization and production management. You'll be responsible for developing ML-specific software, ensuring system reliability, and monitoring production environments. The ideal candidate should be proficient in Python, Rust, and specialized Cuda kernels Frameworks, including transformers, Keras, or PyTorch.

Hugging Face offers an inclusive, development-focused environment where you'll work with industry-leading professionals. They provide comprehensive benefits including flexible remote work, health/dental/vision coverage, parental leave, and equity participation. The company strongly values diversity and community contribution, supporting the broader ML/AI ecosystem through collaborative scientific advancement.

This position offers a unique opportunity to impact AI democratization while working with cutting-edge technologies. You'll be part of a progressive, decentralized team developing solutions that enhance user experiences and push the boundaries of AI applications. The role combines technical expertise with real-world impact, making it ideal for engineers passionate about advancing AI technology while maintaining practical applications.

Last updated 3 days ago

Responsibilities For Machine Learning Engineer, Fast Optimized Inference - US Remote

  • Develop specialized software for specific machine learning use cases with broad applications
  • Create scalable software solutions for industrial purposes using existing library frameworks
  • Enhance reliability, quality, and time-to-market of software suite
  • Measure and optimize system performance
  • Manage production environment by monitoring availability and system health

Requirements For Machine Learning Engineer, Fast Optimized Inference - US Remote

Python
  • Proficiency in Python
  • Experience with Rust
  • Knowledge of specialized Cuda kernels Frameworks
  • Experience with transformers, Keras or PyTorch

Benefits For Machine Learning Engineer, Fast Optimized Inference - US Remote

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Education Budget
Equity
  • Flexible working hours
  • Remote work options
  • Health, dental, and vision benefits
  • Flexible parental leave
  • Paid time off
  • Company equity
  • Conference and training reimbursement
  • Office visits opportunity
  • Workstation equipment support

Interested in this job?

Jobs Related To Hugging Face Machine Learning Engineer, Fast Optimized Inference - US Remote

Open-Source Machine Learning Engineer - International Remote

Open-Source Machine Learning Engineer position at Hugging Face, working remotely to improve ML ecosystem through open-source development and community engagement.

Software Engineer III, Machine Learning, Search

Software Engineer III position at Google focusing on machine learning and search, requiring 2 years of experience in software development and ML algorithms.

Software Engineer III, Core Machine Learning, Google Cloud

Software Engineer III position at Google Cloud focusing on core machine learning infrastructure and systems, offering competitive compensation and opportunity to work on large-scale ML systems.

Software Development Engineer, Predictive Targeting

Software Development Engineer role at Amazon focusing on machine learning and predictive targeting to customize customer experiences across all Amazon platforms.

Software Engineer III, Core Machine Learning, Google Cloud

Software Engineer III position at Google focusing on core machine learning infrastructure and Cloud AI development, offering competitive compensation and opportunities to work on cutting-edge ML technologies.