Production Systems Engineer, Fleet AI Systems Lead

Meta builds technologies that help people connect, find communities, and grow businesses through social technology and immersive experiences like AR and VR.
$163,000 - $225,000
Cloud
Staff Software Engineer
In-Person
5,000+ Employees
7+ years of experience
Enterprise SaaS · AI

Description For Production Systems Engineer, Fleet AI Systems Lead

Meta is seeking an experienced Production Systems Engineer to join their Release to Production (RTP) team, focusing on the hardware lifecycle of Meta's server infrastructure. This role combines hardware engineering with cloud infrastructure management, requiring expertise in both systems architecture and large-scale deployment. The position offers an opportunity to work with cutting-edge technology in Meta's data centers, collaborating with various teams including hardware designers, system manufacturers, and component vendors.

The role involves leading the development and execution of test suites for various architectures, creating tooling for hardware health monitoring, and implementing large-scale automation solutions. You'll be responsible for troubleshooting complex system failures, developing visibility tools, and establishing industry-leading practices for hardware infrastructure support at scale.

As a Staff Engineer level position, you'll have significant impact on Meta's infrastructure, working with both internal teams and external vendors. The role offers competitive compensation ($163,000-$225,000/year) plus bonus and equity, and is based in either Bellevue, WA or Menlo Park, CA.

This is an ideal opportunity for experienced engineers who want to work at the intersection of hardware and software, particularly those interested in AI systems and large-scale infrastructure. The role requires strong technical leadership, problem-solving abilities, and excellent communication skills to work effectively across multiple teams and stakeholders.

Last updated 2 days ago

Responsibilities For Production Systems Engineer, Fleet AI Systems Lead

  • Lead interfacing with external vendors and internal teams to develop and execute test suites
  • Contribute as a Tech Lead, creating experiments and tooling to detect hardware/firmware/software health issues
  • Develop test framework for large-scale test automation
  • Implement remediations across software and hardware stack
  • Develop and publish updates on resolutions
  • Troubleshoot, diagnose and root cause system failures
  • Develop visibility through data visualization
  • Drive discussions on test specification and methodologies
  • Develop robust practices for supporting hardware infrastructure at scale

Requirements For Production Systems Engineer, Fleet AI Systems Lead

Python
Linux
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 7+ years of experience in hardware server system support, troubleshooting server architecture and components
  • Expertise with Linux and scripting (Python or similar)
  • 5+ years of experience in changing system configurations and measuring change impact
  • 3+ years of experience engineering innovations in support of different server system/data center products

Benefits For Production Systems Engineer, Fleet AI Systems Lead

  • bonus
  • equity
  • benefits

Interested in this job?

Jobs Related To Meta Production Systems Engineer, Fleet AI Systems Lead

SiteOps Area Capacity Engineer

Meta is seeking an experienced SiteOps Area Capacity Engineer to lead data center capacity planning and operations.

Systems Integration Engineer

Lead Systems Integration Engineer role at Meta, focusing on hardware platform integration and infrastructure technologies in data centers, with emphasis on AI/ML systems.

Technical Sourcing Manager, Advanced Thermal Technologies

Technical Sourcing Manager for Advanced Thermal Technologies at Meta, developing strategies for AI & Non-AI HW Platforms cooling solutions.

Senior Technical Program Manager I, Google Cloud

Senior Technical Program Manager position at Google Cloud, leading complex technical projects with 8+ years experience required, offering competitive compensation $183K-$271K+benefits.

Field Sales Manager, Startups and Corporate, Google Cloud

Lead Google Cloud's sales team focusing on startups and corporate clients, managing Field Sales Representatives and driving cloud solution adoption.