Technical Lead, ML Acceleration

Stack develops revolutionary AI and advanced autonomous systems for enhancing safety, reliability, and efficiency in modern trucking operations.
Machine Learning
Staff Software Engineer
Contact Company
5+ years of experience
AI · Automotive

Description For Technical Lead, ML Acceleration

Stack is at the forefront of developing revolutionary AI and advanced autonomous systems for the trucking industry. Our technology integrates cutting-edge AI, robotics, machine learning, and cloud technologies to create innovative solutions that address the unique challenges of modern trucking operations.

We're seeking a visionary and hands-on technical lead for our ML acceleration team within the ML Platform org. This role will be crucial in designing the architecture and leading a team to automate the optimization and deployment of complex ML models, including transformer-based models like VLMs, for next-gen AI Autonomous Vehicle applications.

The ideal candidate will have a deep understanding of GPUs and optimization, coupled with excellent leadership skills. You'll be responsible for analyzing and profiling ML models, implementing optimizations using CUDA, Triton, and custom kernels, and automating the process of exporting models to optimized formats like TensorRT.

Key responsibilities include collaborating with ML researchers to balance model accuracy and speed, leading cross-team projects related to model optimization and deployment, and setting a culture of engineering excellence within the team.

This role offers an exciting opportunity to work at the intersection of AI, autonomous vehicles, and high-performance computing. You'll be part of a team dedicated to pushing the boundaries of what's possible in autonomous trucking, working with cutting-edge technologies and solving complex challenges that have real-world impact.

Join Stack and be part of a diverse, inclusive team that values entrepreneurship and innovation. Together, we're shaping the future of transportation with advanced AI and autonomous systems.

Last updated 2 months ago

Responsibilities For Technical Lead, ML Acceleration

  • Analyze and profile ML models to identify performance bottlenecks
  • Use OSS tooling to enhance our platform to enable ML engineers to profile models and optimize them
  • Automate the process of exporting the model to optimized format and deploying them
  • Implement optimizations using CUDA, Triton, and custom kernels
  • Collaborate with ML researchers to balance model accuracy and speed
  • Lead efforts within the team as well as cross-team projects related to model optimization and deployment
  • Collaborate with cross-functional teams to understand data requirements and design appropriate solutions
  • Stay updated with the latest technologies and trends in ML inference and ML accelerators
  • Identify and resolve performance bottlenecks in models
  • Set a culture of engineering excellence within the team

Requirements For Technical Lead, ML Acceleration

Python
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field
  • 5+ years of experience in GPU programming and optimization
  • Strong programming skills in C++ and Python
  • Proven experience in GPU programming and optimization
  • Familiarity with deep learning frameworks, especially PyTorch
  • CUDA programming
  • Triton language for GPU kernels
  • PyTorch optimization techniques
  • TensorRT implementation
  • ONNX model conversion and deployment
  • Custom GPU kernel development
  • Deep understanding of GPU architectures and performance optimization
  • Proven ability to lead and mentor a team, manage projects, and drive technical initiatives
  • Strong analytical and problem-solving skills
  • Excellent verbal and written communication skills

Interested in this job?

Jobs Related To Stack Technical Lead, ML Acceleration

Staff Engineer/Senior Staff Engineer, Static World Perception

Lead the development of autonomous vehicle perception systems at Stack, focusing on mapping algorithms and real-time robotic applications.

Staff Software Engineer - Tracking

Staff Software Engineer role at Stack, developing ML solutions for tracking in autonomous vehicles.

Staff Software Engineer

Staff Software Engineer role at Stack, developing ML solutions for tracking in autonomous vehicles.

Research Scientist 4 - Content and Studio

Senior Research Scientist role at Netflix focusing on computer vision and machine learning for content promotion and studio operations.

Senior Staff Machine Learning Engineer, Security

Senior Staff ML Engineer role at Airbnb focusing on security, combining advanced machine learning expertise with information security to protect platform and users.