CentML is on a mission to revolutionize AI accessibility by significantly reducing the costs associated with ML model development and deployment. Led by a distinguished team of AI, compiler, and ML hardware experts from industry giants like Amazon, Google, Microsoft Research, and Nvidia, the company is at the forefront of ML systems innovation. Their co-founder and CEO, Gennady Pekhimenko, brings world-renowned expertise in ML systems with multiple academic and industry research awards.
As a Machine Learning Systems Engineer, you'll be instrumental in developing high-performance, power-efficient datacenter solutions for Deep Learning. The role combines cutting-edge work in GPU architecture, networking, CPU, and IO systems, particularly focusing on next-generation inference and training frameworks. You'll work with advanced generative AI capabilities and help optimize systems that are reshaping the deep learning industry.
The position requires a strong background in ML/DL systems, excellent coding abilities in Python or C++, and deep understanding of computer architecture and GPU programming. You'll be joining a company that values diversity, offers competitive benefits, and provides an environment where you can make a significant impact on the future of AI technology. The hybrid work model offers flexibility while maintaining collaborative opportunities in either Toronto or San Francisco Bay Area offices.