Meta is seeking an experienced Production Systems Engineer to join their Release to Production (RTP) team, focusing on the hardware lifecycle of Meta's server infrastructure. This role combines hardware engineering with cloud infrastructure management, requiring expertise in both systems architecture and large-scale deployment. The position offers an opportunity to work with cutting-edge technology in Meta's data centers, collaborating with various teams including hardware designers, system manufacturers, and component vendors.
The role involves leading the development and execution of test suites for various architectures, creating tooling for hardware health monitoring, and implementing large-scale automation solutions. You'll be responsible for troubleshooting complex system failures, developing visibility tools, and establishing industry-leading practices for hardware infrastructure support at scale.
As a Staff Engineer level position, you'll have significant impact on Meta's infrastructure, working with both internal teams and external vendors. The role offers competitive compensation ($163,000-$225,000/year) plus bonus and equity, and is based in either Bellevue, WA or Menlo Park, CA.
This is an ideal opportunity for experienced engineers who want to work at the intersection of hardware and software, particularly those interested in AI systems and large-scale infrastructure. The role requires strong technical leadership, problem-solving abilities, and excellent communication skills to work effectively across multiple teams and stakeholders.