Technical Lead, ML Acceleration

Stack

Stack develops revolutionary AI and advanced autonomous systems for enhancing safety, reliability, and efficiency in modern trucking operations.

Machine Learning

Staff Software Engineer

Contact Company

5+ years of experience

AI · Automotive

Description For Technical Lead, ML Acceleration

Stack is at the forefront of developing revolutionary AI and advanced autonomous systems for the trucking industry. Our technology integrates cutting-edge AI, robotics, machine learning, and cloud technologies to create innovative solutions that address the unique challenges of modern trucking operations.

We're seeking a visionary and hands-on technical lead for our ML acceleration team within the ML Platform org. This role will be crucial in designing the architecture and leading a team to automate the optimization and deployment of complex ML models, including transformer-based models like VLMs, for next-gen AI Autonomous Vehicle applications.

The ideal candidate will have a deep understanding of GPUs and optimization, coupled with excellent leadership skills. You'll be responsible for analyzing and profiling ML models, implementing optimizations using CUDA, Triton, and custom kernels, and automating the process of exporting models to optimized formats like TensorRT.

Key responsibilities include collaborating with ML researchers to balance model accuracy and speed, leading cross-team projects related to model optimization and deployment, and setting a culture of engineering excellence within the team.

This role offers an exciting opportunity to work at the intersection of AI, autonomous vehicles, and high-performance computing. You'll be part of a team dedicated to pushing the boundaries of what's possible in autonomous trucking, working with cutting-edge technologies and solving complex challenges that have real-world impact.

Join Stack and be part of a diverse, inclusive team that values entrepreneurship and innovation. Together, we're shaping the future of transportation with advanced AI and autonomous systems.

Last updated 5 months ago

Responsibilities For Technical Lead, ML Acceleration

Analyze and profile ML models to identify performance bottlenecks
Use OSS tooling to enhance our platform to enable ML engineers to profile models and optimize them
Automate the process of exporting the model to optimized format and deploying them
Implement optimizations using CUDA, Triton, and custom kernels
Collaborate with ML researchers to balance model accuracy and speed
Lead efforts within the team as well as cross-team projects related to model optimization and deployment
Collaborate with cross-functional teams to understand data requirements and design appropriate solutions
Stay updated with the latest technologies and trends in ML inference and ML accelerators
Identify and resolve performance bottlenecks in models
Set a culture of engineering excellence within the team

Requirements For Technical Lead, ML Acceleration

Python

Bachelor's or Master's degree in Computer Science, Engineering, or a related field
5+ years of experience in GPU programming and optimization
Strong programming skills in C++ and Python
Proven experience in GPU programming and optimization
Familiarity with deep learning frameworks, especially PyTorch
CUDA programming
Triton language for GPU kernels
PyTorch optimization techniques
TensorRT implementation
ONNX model conversion and deployment
Custom GPU kernel development
Deep understanding of GPU architectures and performance optimization
Proven ability to lead and mentor a team, manage projects, and drive technical initiatives
Strong analytical and problem-solving skills
Excellent verbal and written communication skills

Stack

Stack develops revolutionary AI and advanced autonomous systems for enhancing safety, reliability, and efficiency in modern trucking operations.

Machine Learning

Staff Software Engineer

Contact Company

5+ years of experience

AI · Automotive

Interested in this job?

Jobs Related To Stack Technical Lead, ML Acceleration

Staff Engineer/Senior Staff Engineer, Static World Perception

Stack

Lead the development of autonomous vehicle perception systems at Stack, focusing on mapping algorithms and real-time robotic applications.

Staff Software Engineer - Tracking

Stack

Staff Software Engineer role at Stack, developing ML solutions for tracking in autonomous vehicles.

Staff Software Engineer

Stack

Staff Software Engineer role at Stack, developing ML solutions for tracking in autonomous vehicles.

Sr. Staff Software Engineer, AI Infra

Senior Staff Software Engineer position at LinkedIn focusing on AI infrastructure, distributed systems, and large-scale machine learning, offering competitive compensation and hybrid work arrangement.

AI Engineering Manager - Enterprise AI

Lead LinkedIn's Enterprise AI team developing GenAI tools and ML systems for recruiting, learning, and jobs platforms, managing 6-10 engineers in Sunnyvale, CA.