Tesla's Simulations Team is seeking a Senior Site Reliability Engineer to lead their Simulation Cluster Infrastructure. This role is crucial in managing large-scale software infrastructure that simulates billions of miles of vehicle driving daily. The position offers an exciting opportunity to work on cutting-edge technology in the electric vehicle industry.
The role involves leading major initiatives such as implementing new generation Processor-in-the-Loop cluster infrastructure, managing Bazel/Buck Remote Execution clusters, and establishing distributed observability at scale. You'll be working with Kubernetes and Tesla's internal orchestration systems to build robust and reliable infrastructure.
As a Sr. SRE, you'll be joining a team of expert engineers dedicated to revolutionizing electric vehicle production through advanced software development. The position requires strong experience with Linux internals, Kubernetes, and proficiency in languages like Python, Rust, or Go. You'll be responsible for designing and implementing observability infrastructure, automating deployment processes, and ensuring scalable cloud-first architecture.
The compensation package is highly competitive, ranging from $140,000 to $300,000 annually, plus additional cash and stock awards. Tesla offers comprehensive benefits including medical, dental, and vision coverage, 401(k) matching, stock purchase plans, and various family-friendly benefits. This is an excellent opportunity for an experienced SRE to make a significant impact in the automotive industry while working with cutting-edge technology and infrastructure.