ML Training and DevEx Engineer

Stack develops revolutionary AI and advanced autonomous systems for enhancing safety, reliability, and efficiency in the trucking transportation industry.
Machine Learning
Contact Company
AI · Automotive

Description For ML Training and DevEx Engineer

Stack is at the forefront of developing revolutionary AI and advanced autonomous systems for the trucking transportation industry. Our mission is to enhance safety, reliability, and efficiency in modern operations. The ML Training and DevEx team plays a crucial role in providing a reliable, scalable, and user-friendly training framework for Stack AV's modeling needs. This team is also responsible for improving the overall developer experience for ML engineers, building tools for testing, validation, and understanding models and training data. Additionally, they handle model optimization and deployment.

As a member of this team, you'll work on cutting-edge technology incorporating advancements in artificial intelligence, robotics, machine learning, and cloud technologies. You'll be part of a dedicated team with decades of experience in creating and deploying real-world systems for demanding environments.

Key responsibilities include:

  • Developing scalable and reliable infrastructure in a fast-paced environment
  • Collaborating across teams to build ML infrastructure for multiple customer teams
  • Making and articulating design tradeoffs to achieve alignment with other teams
  • Working on model training, optimization, and large data processing pipelines

The ideal candidate will have experience with both ML platforms and building ML-based applications, with bonus points for modeling experience. Experience in autonomous vehicles, perception, and decision-making domains is desirable but not required.

Preferred qualifications include:

  • Expertise in optimizing GPU performance from Python to CUDA kernel level
  • Experience building inference or training loops for large models (ideally with LLM flavor)
  • Track record of shipping ML products at scale with business impact
  • Ability to build low latency / high throughput batch or stream processing pipelines
  • Proficiency in writing readable, high-performance C++
  • Prior experience in the autonomous vehicle industry

Stack is committed to diversity and inclusion, fostering a culture of entrepreneurship and innovation across all identities. Join us in shaping the future of autonomous transportation technology!

Last updated 5 months ago

Responsibilities For ML Training and DevEx Engineer

  • Provide a reliable, scalable, and easy to use training framework for modeling needs of Stack AV
  • Build tools for testing, validation, and understanding models and the data used to train them
  • Handle model optimization and deployment

Requirements For ML Training and DevEx Engineer

Python
  • Experience with both ML Platforms and building ML-based applications
  • Experience building scalable, reliable infra at a fast-paced environment
  • Ability to work across teams
  • Experience building or using ML infra built for a large number of customer teams
  • Deep understanding of design tradeoffs and ability to articulate those tradeoffs
  • Experience with model training, model optimization, or large data processing pipelines

Interested in this job?