Software Engineer, ML Infra, Dojo

Tesla is an automotive and clean energy company that develops electric vehicles and AI-powered autonomous driving technology.
$120,000 - $318,000
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Automotive

Description For Software Engineer, ML Infra, Dojo

Tesla is seeking a Machine Learning Software Engineer to join their Dojo team, focusing on their custom-built supercomputer designed for training autonomous driving models. This role represents a unique opportunity to work at the intersection of artificial intelligence and hardware acceleration, specifically designed for Tesla's Full Self-Driving (FSD) technology.

The position involves collaborating with world-class Machine Learning Researchers, Compiler Engineers, and Hardware Engineers to optimize and scale neural network training infrastructure. You'll be responsible for bridging the gap between Tesla's cutting-edge Dojo training accelerator and the neural networks developed by the Autopilot ML team.

Key responsibilities include profiling and optimizing training workloads, maintaining high cluster availability, and integrating training software into continuous integration systems. You'll work directly with FSD models on Tesla's proprietary ML training accelerator, focusing on improving throughput and reducing convergence time.

The ideal candidate should have a strong background in both software engineering and machine learning, with practical experience in Python/C++ programming and familiarity with training frameworks like PyTorch. System-level software expertise and understanding of hardware-software interactions are crucial for this role.

Tesla offers a comprehensive benefits package including competitive salary, equity opportunities, and extensive health coverage. The company's mission to accelerate the world's transition to sustainable energy, combined with the opportunity to work on cutting-edge AI technology, makes this an exciting opportunity for engineers passionate about pushing the boundaries of what's possible in autonomous driving and machine learning infrastructure.

Last updated 2 months ago

Responsibilities For Software Engineer, ML Infra, Dojo

  • Work with Machine Learning Researchers and Engineers to run FSD models on in-house ML training accelerator
  • Profile performance of training workloads and optimize throughput and scalability
  • Coordinate with hardware cluster team to maintain high availability
  • Integrate training software into continuous integration cluster
  • Support metrics persistence across experiments and neural network builds

Requirements For Software Engineer, ML Infra, Dojo

Python
  • Degree in Engineering, Computer Science, or equivalent experience
  • Practical experience programming in Python and/or C++
  • Experience working with training frameworks, ideally PyTorch
  • Proficient in system-level software
  • Understanding of modern machine learning concepts
  • Experience in profiling and optimizing CPU-accelerator interactions
  • Devops experience with training clusters and large-scale filesystems

Benefits For Software Engineer, ML Infra, Dojo

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Commuter Benefits
  • Medical plans with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental and vision plans
  • Company Paid HSA Contribution
  • Healthcare and Dependent Care FSA
  • 401(k) with employer match
  • Employee Stock Purchase Plans
  • Company paid Basic Life, AD&D, disability insurance
  • Employee Assistance Program
  • Sick and Vacation time
  • Back-up childcare
  • Commuter benefits
  • Employee discounts

Interested in this job?

Jobs Related To Tesla Software Engineer, ML Infra, Dojo

Software Engineer, Image Processing, Cell Qualification

Software Engineer position at Tesla focusing on developing computer vision algorithms for battery cell analysis and qualification.

Software Engineer, Scenario Creation, Simulation

Software Engineer position at Tesla focusing on developing simulation scenarios and validation tools for Autopilot system, requiring 3+ years of software development experience.

Software Validation Engineer, Integration, AI Platforms

Software Validation Engineer role at Tesla, focusing on AI platform integration for autonomous vehicles and robotics systems.

Software Engineer, Generalist, AI Infrastructure

Software Engineer position at Tesla focusing on AI infrastructure development and optimization for autonomous driving systems.

Software Engineer, Machine Learning Infrastructure, Tesla Bot

Build and maintain machine learning infrastructure for Tesla Bot, focusing on neural network architecture, training optimization, and model deployment.