Tesla is seeking a Machine Learning Software Engineer to join their Dojo team, focusing on their custom-built supercomputer designed for training autonomous driving models. This role represents a unique opportunity to work at the intersection of artificial intelligence and hardware acceleration, specifically designed for Tesla's Full Self-Driving (FSD) technology.
The position involves collaborating with world-class Machine Learning Researchers, Compiler Engineers, and Hardware Engineers to optimize and scale neural network training infrastructure. You'll be responsible for bridging the gap between Tesla's cutting-edge Dojo training accelerator and the neural networks developed by the Autopilot ML team.
Key responsibilities include profiling and optimizing training workloads, maintaining high cluster availability, and integrating training software into continuous integration systems. You'll work directly with FSD models on Tesla's proprietary ML training accelerator, focusing on improving throughput and reducing convergence time.
The ideal candidate should have a strong background in both software engineering and machine learning, with practical experience in Python/C++ programming and familiarity with training frameworks like PyTorch. System-level software expertise and understanding of hardware-software interactions are crucial for this role.
Tesla offers a comprehensive benefits package including competitive salary, equity opportunities, and extensive health coverage. The company's mission to accelerate the world's transition to sustainable energy, combined with the opportunity to work on cutting-edge AI technology, makes this an exciting opportunity for engineers passionate about pushing the boundaries of what's possible in autonomous driving and machine learning infrastructure.