Wayve, founded in 2017, is at the forefront of Embodied AI technology for autonomous vehicles. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate complex environments, enhancing the safety and usability of automated driving systems. We're seeking skilled engineers to join our Machine Learning Platform team, focusing on optimizing large-scale training jobs as we scale our models.
The role involves maximizing the MFU of large-scale training jobs, profiling and identifying bottlenecks in training code, implementing GPU kernels to improve training throughput, and working closely with Research teams. You'll also own and improve our GPU training clusters.
The ideal candidate should have 5+ years of experience in performance optimization or ML engineering, experience in optimizing large-scale training jobs on GPU compute clusters, and working with platform and research teams. Strong Python coding skills and a BS or MS in a relevant technical discipline are essential.
At Wayve, we value diversity and foster an inclusive work environment. We operate a hybrid working policy, combining office time to fuel innovation and culture with the flexibility of working from home. Join us in creating autonomy that propels the world forward and make Wayve the defining experience of your career!