Production Systems Engineer, Sustaining

Meta builds technologies that help people connect, find communities, and grow businesses through social platforms like Facebook, Instagram, WhatsApp, and virtual reality experiences.
$132,000 - $191,000
DevOps
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Production Systems Engineer, Sustaining

Meta is seeking an experienced Production Systems Engineer to join their Release to Production (RTP) team, focusing on the hardware lifecycle of Meta's server infrastructure. This role sits at the intersection of hardware and software, requiring expertise in AI/HPC systems and large-scale infrastructure support. The position involves working with cutting-edge technology in Meta's data centers, collaborating with various teams including hardware designers, system manufacturers, and component vendors.

The role demands a strong background in both hardware systems and software development, with particular emphasis on AI/HPC infrastructure. You'll be responsible for developing and implementing testing practices, troubleshooting complex system issues, and ensuring the reliability of Meta's server infrastructure. The position offers competitive compensation ranging from $132,000 to $191,000 annually, plus bonus and equity opportunities.

This is an excellent opportunity for experienced engineers who want to work at the forefront of technology infrastructure at one of the world's leading tech companies. You'll be part of Meta's mission to build technologies that connect people and push the boundaries of social technology, including ventures into augmented and virtual reality. The role offers the chance to work on challenging technical problems at scale while collaborating with top talent in the industry.

The ideal candidate will bring at least 5 years of experience in hardware systems or production hardware support, with strong programming skills and a deep understanding of server and network datacenter systems. This position is perfect for someone who enjoys working at the intersection of hardware and software, has a passion for system optimization, and wants to make an impact on infrastructure that serves billions of users worldwide.

Last updated 11 hours ago

Responsibilities For Production Systems Engineer, Sustaining

  • Develop robust, industry leading practices for supporting AI/HPC infrastructure at scale
  • Interface with external vendors and internal teams to develop and execute test suites
  • Create experiments and tooling to detect and diagnose hardware/firmware/software health issues
  • Implement sustaining workflows across hardware and software stacks
  • Troubleshoot, diagnose and root cause system failures
  • Drive discussions on test specification and methodologies improvement

Requirements For Production Systems Engineer, Sustaining

Python
Linux
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 5+ years experience in hardware systems technologies or supporting production hardware at scale
  • Experience in deploying and productionizing AI/HPC systems at scale
  • Experience in software and hardware co-design for hyperscale systems
  • Experience in object oriented programming (e.g., Python, C/C++)
  • Engineering for different server and network datacenter systems

Benefits For Production Systems Engineer, Sustaining

Medical Insurance
Equity
  • Base salary
  • Bonus
  • Equity
  • Benefits package

Interested in this job?

Jobs Related To Meta Production Systems Engineer, Sustaining

Network Operations Engineer

Senior Network Operations Engineer position at Meta focusing on improving operations efficiency and reliability of large-scale network infrastructure through automation and technical solutions.

Production Engineer

Production Engineer position at Meta focusing on maintaining and scaling infrastructure and services used by billions of users worldwide.

Production Engineering

Senior Production Engineer role at Meta combining software development and systems engineering to maintain and scale infrastructure serving billions of users.

Production Engineer

Senior Production Engineer role at Meta, combining software development with infrastructure management, offering $177K-$251K plus benefits.

Production Engineer

Senior Production Engineer position at Meta, combining software and systems engineering to maintain and scale massive infrastructure serving billions of users.