Software Engineer, Performance, AI Infrastructure

Tesla is an automotive and technology company leading in electric vehicles and AI development.
$104,000 - $360,000
Machine Learning
Senior Software Engineer
In-Person
5+ years of experience
AI · Automotive · Robotics
This job posting may no longer be active. You may be interested in these related jobs instead:
Developer Technology Engineer - HPC and AI

Senior Developer Technology Engineer position at NVIDIA focusing on HPC and AI, requiring 3+ years experience and advanced degree, based in Seoul.

Sr. Software Development Engineer, Demand Science Optimization (DSO)

Senior Software Engineering role at Amazon focusing on machine learning and big data analytics for device demand forecasting and supply chain optimization.

Delivery Consultant - Machine Learning Engineer, WWPS ProServe

Senior ML Engineering role at AWS Professional Services, focusing on implementing machine learning solutions for enterprise customers using AWS cloud services.

Senior Software Engineer, LLM Inference

Senior Software Engineer position at NVIDIA focusing on LLM Inference development, requiring expertise in C++, deep learning, and AI technologies.

Generative AI Engineer - Model Optimization & Evaluation

Senior AI Engineering role focused on optimizing and evaluating transformer-based models, requiring expertise in model compression, quantization, and deployment across various computing environments.

Description For Software Engineer, Performance, AI Infrastructure

Tesla is seeking a Senior Software Engineer to join their AI Infrastructure team, focusing on performance optimization for neural network training systems. This role is crucial for both the Autopilot and Humanoid robot initiatives, working with state-of-the-art GPU clusters and Tesla's supercomputer, Dojo. The position demands expertise in CUDA programming, deep learning frameworks, and high-performance computing. You'll be responsible for optimizing training workflows, reducing model convergence time, and maximizing hardware efficiency. Tesla offers a comprehensive benefits package and the opportunity to work on cutting-edge AI applications in autonomous driving and robotics. The role combines deep technical expertise with practical implementation in one of the most advanced AI infrastructure environments. This is an excellent opportunity for experienced engineers passionate about pushing the boundaries of AI performance and scalability.

Last updated 4 months ago

Responsibilities For Software Engineer, Performance, AI Infrastructure

  • Reduce wall clock time to convergence of training jobs by identifying bottlenecks in the ML stack
  • Integrate efficient, low-level code with the overall high-level training framework
  • Profile workloads and implement solutions to increase training efficiency
  • Optimize workloads for efficient hardware utilization (CPU, GPU compute, data throughput, networking)

Requirements For Software Engineer, Performance, AI Infrastructure

Python
Linux
  • Extensive experience in CUDA kernel programming and pushing GPUs to their limits
  • Experience programming in Python
  • Experience with at least one deep learning framework (ideally in PyTorch)
  • Demonstrated experience in profiling CPU/GPU code
  • Proficient in system-level software, hardware-software interactions and resource utilization
  • Good knowledge of CUDA kernels used in training state-of-the-art deep learning models
  • Experience with high-performance networking (Infiniband, RDMA, NCCL)
  • Experience with Triton (preferred)

Benefits For Software Engineer, Performance, AI Infrastructure

Medical Insurance
Dental Insurance
Vision Insurance
401k
Mental Health Assistance
Parental Leave
Commuter Benefits
  • Medical plans with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental and vision plans with $0 paycheck contribution
  • Company Paid HSA Contribution
  • Healthcare and Dependent Care FSA
  • 401(k) with employer match
  • Employee Stock Purchase Plans
  • Company paid Basic Life, AD&D, short-term and long-term disability insurance
  • Employee Assistance Program
  • Sick and Vacation time
  • Back-up childcare and parenting support
  • Commuter benefits
  • Employee discounts and perks program

Interested in this job?