Machine Learning Systems Engineer

CentML develops solutions to reduce the cost of developing and deploying ML models, enabling widespread AI adoption.
Machine Learning
Mid-Level Software Engineer
Hybrid
2+ years of experience
AI

Description For Machine Learning Systems Engineer

CentML is on a mission to revolutionize AI accessibility by significantly reducing the costs associated with ML model development and deployment. Led by a distinguished team of AI, compiler, and ML hardware experts from industry giants like Amazon, Google, Microsoft Research, and Nvidia, the company is at the forefront of ML systems innovation. Their co-founder and CEO, Gennady Pekhimenko, brings world-renowned expertise in ML systems with multiple academic and industry research awards.

As a Machine Learning Systems Engineer, you'll be instrumental in developing high-performance, power-efficient datacenter solutions for Deep Learning. The role combines cutting-edge work in GPU architecture, networking, CPU, and IO systems, particularly focusing on next-generation inference and training frameworks. You'll work with advanced generative AI capabilities and help optimize systems that are reshaping the deep learning industry.

The position requires a strong background in ML/DL systems, excellent coding abilities in Python or C++, and deep understanding of computer architecture and GPU programming. You'll be joining a company that values diversity, offers competitive benefits, and provides an environment where you can make a significant impact on the future of AI technology. The hybrid work model offers flexibility while maintaining collaborative opportunities in either Toronto or San Francisco Bay Area offices.

Last updated 32 minutes ago

Responsibilities For Machine Learning Systems Engineer

  • Communicate with product teams and profile ML/DL workloads
  • Design and implement novel solutions to solve problems
  • Survey and reproduce state-of-the-art research work
  • Write unit tests and benchmarks to validate and evaluate solutions

Requirements For Machine Learning Systems Engineer

Python
  • 2+ years of experience in ML/DL systems and frameworks
  • Excellent communication skills and ability to work in a team
  • Strong coding skills in Python and/or C++
  • Solid fundamentals in machine learning and deep learning
  • Knowledge of algorithms, data structures, operating systems, and computer architecture
  • Experience with GPU architecture and programming (CUDA, cuDNN, cuBLAS, CUTLASS)
  • Experience with TPU
  • Strong academic records or publication records in ML/DL or computer systems

Benefits For Machine Learning Systems Engineer

Equity
Medical Insurance
Dental Insurance
Parental Leave
Education Budget
  • Open and inclusive work environment
  • Employee stock options
  • Best-in-class medical and dental benefits
  • Parental Leave top-up
  • Professional development budget
  • Flexible vacation time

Interested in this job?

Jobs Related To CentML Machine Learning Systems Engineer

Software Engineer III, AI/ML, Google Cloud

Software Engineer III position at Google Cloud focusing on AI/ML development, requiring 2 years of software development experience and expertise in machine learning infrastructure.

Software Engineer, Machine Learning

Machine Learning Engineer position at Imbue working on cutting-edge deep learning research and infrastructure for general human-like machine intelligence.

Research Engineer

Research Engineer position at Waabi, developing AI algorithms for self-driving vehicles, offering $122K-$215K salary with hybrid work options in Toronto, San Francisco, or Dallas.

Machine Learning Engineer

Machine Learning Engineer position at EvolutionaryScale, focusing on developing and deploying AI models for biological research, offering $150K-$350K salary with hybrid work in NY or SF.

Machine Learning Engineer

Machine Learning Engineer position at Anlatan, focusing on developing and training cutting-edge LLMs and Image models in a remote-first environment.