Sr. Site Reliability Engineer, Dojo

Tesla is a leading electric vehicle and clean energy company pioneering sustainable transportation and energy solutions.
$120,000 - $228,000
Site Reliability
Senior Software Engineer
In-Person
3+ years of experience
AI

Description For Sr. Site Reliability Engineer, Dojo

Tesla is seeking a Senior Site Reliability Engineer to join their Dojo cluster infrastructure team. This role combines technical expertise with customer support, focusing on maintaining and optimizing critical infrastructure systems. The position offers a competitive salary range of $120,000-$228,000 plus additional benefits.

As an SRE, you'll be responsible for ensuring the reliability and performance of Tesla's Dojo cluster infrastructure. Your daily activities will involve troubleshooting complex systems, implementing automation solutions, and collaborating with various teams and vendors. The role requires strong Linux knowledge, networking expertise, and experience with modern monitoring tools.

The ideal candidate brings 3+ years of SRE experience and excels in problem-solving and communication. You'll work in Tesla's innovative environment, contributing to the company's AI infrastructure while enjoying comprehensive benefits including medical coverage, 401(k) matching, and stock purchase options.

This position offers an exciting opportunity to work with cutting-edge technology in Tesla's AI division, making a direct impact on the company's autonomous driving capabilities. You'll be part of a team that values technical excellence, innovation, and collaborative problem-solving, while enjoying the stability and benefits of working for a leading technology company.

Last updated an hour ago

Responsibilities For Sr. Site Reliability Engineer, Dojo

  • Respond to customer inquiries and resolve issues in a timely manner
  • Manage and prioritize change requests for cluster operations
  • Collaborate with third-party storage vendors to resolve issues and outages
  • Troubleshoot and debug storage-related problems
  • Work with network vendors to debug and resolve issues
  • Create visibility into network issues and implement monitoring tools
  • Collaborate with facility and operations teams for maintenance and upgrades
  • Ensure seamless communication during planned and unplanned outages
  • Develop and implement automation scripts for hardware monitoring

Requirements For Sr. Site Reliability Engineer, Dojo

Python
Linux
  • 3+ years of experience in SRE or infrastructure engineering role
  • Strong understanding of Linux, networking, and storage systems
  • Excellent problem-solving and troubleshooting skills
  • Experience with automation tools like Ansible, Python
  • Strong communication and collaboration skills
  • Ability to work in a fast-paced environment
  • Familiarity with monitoring tools like Prometheus, Grafana, or ELK preferred
  • Experience with cloud-based infrastructure preferred

Benefits For Sr. Site Reliability Engineer, Dojo

401k
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
Parental Leave
Commuter Benefits
  • Aetna PPO and HSA plans with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental and vision plans with $0 paycheck contribution
  • Company Paid HSA Contribution
  • Healthcare and Dependent Care FSA
  • 401(k) with employer match
  • Employee Stock Purchase Plans
  • Company paid Basic Life, AD&D, disability insurance
  • Employee Assistance Program
  • Sick and Vacation time
  • Back-up childcare and parenting support
  • Commuter benefits
  • Employee discounts and perks program

Interested in this job?

Jobs Related To Tesla Sr. Site Reliability Engineer, Dojo

Sr. Site Reliability Engineer, Energy

Senior Site Reliability Engineer position at Tesla, focusing on energy IoT applications and infrastructure, offering competitive salary and comprehensive benefits.

Sr. Site Reliability Engineer, Simulation Cluster Infrastructure

Senior SRE position at Tesla leading simulation infrastructure, focusing on Kubernetes, distributed systems, and cloud architecture with competitive compensation.

Site Reliability Engineer, AI Infrastructure

Senior Site Reliability Engineer position at Tesla, focusing on AI infrastructure maintenance and optimization for autonomous driving and robotics projects.

Sr. Site Reliability Engineer, Vehicle Software

Senior SRE position at Tesla leading simulation infrastructure initiatives for vehicle software, offering competitive compensation and comprehensive benefits.

Sr. Site Reliability Engineer, Energy

Senior Site Reliability Engineer position at Tesla, focusing on energy IoT applications and infrastructure, offering competitive salary and comprehensive benefits.