Machine Learning Engineer, Fast Optimized Inference - US Remote

Hugging Face

Building the fastest growing platform for AI builders with over 5 million users & 100k organizations sharing 1M+ models, 300k datasets & apps.

New York, NY, USA

Machine Learning

Mid-Level Software Engineer

Remote

501 - 1,000 Employees

3+ years of experience

Description For Machine Learning Engineer, Fast Optimized Inference - US Remote

Hugging Face, the fastest growing AI platform with over 5 million users and 100k organizations, is seeking a Machine Learning Engineer focused on Fast Optimized Inference. This role is perfect for passionate engineers interested in creating specialized ML libraries for real-world applications. You'll work on developing software similar to text-generation-inference, focusing on industrial-level usage and scalability. The position involves creating specialized code building upon their open-source foundation, with 400k+ Github stars across their libraries.

The role combines hands-on development with performance optimization and production management. You'll be responsible for developing ML-specific software, ensuring system reliability, and monitoring production environments. The ideal candidate should be proficient in Python, Rust, and specialized Cuda kernels Frameworks, including transformers, Keras, or PyTorch.

Hugging Face offers an inclusive, development-focused environment where you'll work with industry-leading professionals. They provide comprehensive benefits including flexible remote work, health/dental/vision coverage, parental leave, and equity participation. The company strongly values diversity and community contribution, supporting the broader ML/AI ecosystem through collaborative scientific advancement.

This position offers a unique opportunity to impact AI democratization while working with cutting-edge technologies. You'll be part of a progressive, decentralized team developing solutions that enhance user experiences and push the boundaries of AI applications. The role combines technical expertise with real-world impact, making it ideal for engineers passionate about advancing AI technology while maintaining practical applications.

Last updated 3 days ago

Responsibilities For Machine Learning Engineer, Fast Optimized Inference - US Remote

Develop specialized software for specific machine learning use cases with broad applications
Create scalable software solutions for industrial purposes using existing library frameworks
Enhance reliability, quality, and time-to-market of software suite
Measure and optimize system performance
Manage production environment by monitoring availability and system health

Requirements For Machine Learning Engineer, Fast Optimized Inference - US Remote

Python

Proficiency in Python
Experience with Rust
Knowledge of specialized Cuda kernels Frameworks
Experience with transformers, Keras or PyTorch

Benefits For Machine Learning Engineer, Fast Optimized Inference - US Remote

Medical Insurance

Dental Insurance

Vision Insurance

Parental Leave

Education Budget

Equity

Flexible working hours
Remote work options
Health, dental, and vision benefits
Flexible parental leave
Paid time off
Company equity
Conference and training reimbursement
Office visits opportunity
Workstation equipment support

Hugging Face

Building the fastest growing platform for AI builders with over 5 million users & 100k organizations sharing 1M+ models, 300k datasets & apps.

New York, NY, USA

Machine Learning

Mid-Level Software Engineer

Remote

501 - 1,000 Employees

3+ years of experience

Interested in this job?

Jobs Related To Hugging Face Machine Learning Engineer, Fast Optimized Inference - US Remote

Open-Source Machine Learning Engineer - International Remote

Hugging Face

Open-Source Machine Learning Engineer position at Hugging Face, working remotely to improve ML ecosystem through open-source development and community engagement.

Software Engineer III, Machine Learning, Search

Google

Software Engineer III position at Google focusing on machine learning and search, requiring 2 years of experience in software development and ML algorithms.

Software Engineer III, Core Machine Learning, Google Cloud

Google

Software Engineer III position at Google Cloud focusing on core machine learning infrastructure and systems, offering competitive compensation and opportunity to work on large-scale ML systems.

Software Development Engineer, Predictive Targeting

Amazon

Software Development Engineer role at Amazon focusing on machine learning and predictive targeting to customize customer experiences across all Amazon platforms.

Software Engineer III, Core Machine Learning, Google Cloud

Google

Software Engineer III position at Google focusing on core machine learning infrastructure and Cloud AI development, offering competitive compensation and opportunities to work on cutting-edge ML technologies.