Senior Machine Learning Platform Engineer

Wayve

Leading developer of Embodied AI technology for autonomous vehicles

Sunnyvale, CA, USA

Machine Learning

Senior Software Engineer

Hybrid

5+ years of experience

AI · Automotive

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior Machine Learning Platform Engineer

Wayve, founded in 2017, is at the forefront of Embodied AI technology for autonomous vehicles. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate complex environments, enhancing the safety and usability of automated driving systems. We're seeking skilled engineers to join our Machine Learning Platform team, focusing on optimizing large-scale training jobs as we scale our models.

The role involves maximizing the MFU of large-scale training jobs, profiling and identifying bottlenecks in training code, implementing GPU kernels to improve training throughput, and working closely with Research teams. You'll also own and improve our GPU training clusters.

The ideal candidate should have 5+ years of experience in performance optimization or ML engineering, experience in optimizing large-scale training jobs on GPU compute clusters, and working with platform and research teams. Strong Python coding skills and a BS or MS in a relevant technical discipline are essential.

At Wayve, we value diversity and foster an inclusive work environment. We operate a hybrid working policy, combining office time to fuel innovation and culture with the flexibility of working from home. Join us in creating autonomy that propels the world forward and make Wayve the defining experience of your career!

Last updated 8 months ago

Responsibilities For Senior Machine Learning Platform Engineer

Maximising the MFU of our large scale training jobs
Profiling and identifying bottlenecks in training code
Implementing GPU kernels to improve training throughput
Working closely with Research teams to integrate and test training efficiency improvements
Owning and improving our GPU training clusters

Requirements For Senior Machine Learning Platform Engineer

Python

5+ years experience in performance optimization or ML engineering
Experience optimize large scale training jobs on GPU compute clusters
Experience in working in platform teams and working with research teams
Experience in reporting and tracking over time benchmarked performance in an open and accessible way
Ability to write high quality, well-structured and tested Python code
BS or MS in Machine Learning, Computer Science, Engineering, or a related technical discipline or equivalent experience

Benefits For Senior Machine Learning Platform Engineer

Inclusive work environment
Hybrid working policy
Flexible core working hours