Network Production Engineer - Network.AI

Meta builds technologies that help people connect, find communities, and grow businesses through social platforms like Facebook, Instagram, WhatsApp, and immersive AR/VR experiences.
$147,000 - $208,000
Backend
Senior Software Engineer
In-Person
5+ years of experience
AI · Enterprise SaaS

Description For Network Production Engineer - Network.AI

Meta is seeking a Network Production Engineer to join their Network AI team, focusing on maintaining and optimizing the backend datacenter networks that support their GPU-based AI Training Clusters. This role combines software engineering and network engineering expertise to ensure the robust performance and reliability of Meta's AI infrastructure.

The position requires a deep understanding of network architecture, hardware, and software systems, with responsibilities ranging from writing code and developing documentation to troubleshooting complex network issues in real-time. You'll be working with some of the largest and most complex networks in the world, directly impacting Meta's AI capabilities and infrastructure reliability.

As a Network Production Engineer, you'll be responsible for building tools and implementing automation to efficiently scale network impact mitigation, investigate performance trends, and drive innovative monitoring solutions. The role involves participating in on-call rotations, performing deep technical analysis, and contributing to team growth through mentorship.

The ideal candidate should have extensive experience in both software development and network engineering, with proven expertise in programming languages like Python or Go, and a strong understanding of datacenter networking concepts. You'll be working in an environment that offers new challenges daily, requiring both technical depth and the ability to influence and collaborate across teams.

This position offers competitive compensation ranging from $147,000 to $208,000 annually, plus bonus and equity opportunities. You'll be working at Meta's Menlo Park location, contributing to the company's mission of connecting people and advancing AI technology. The role provides an opportunity to work on cutting-edge AI infrastructure while solving complex technical challenges at scale.

Meta provides a comprehensive benefits package and maintains a strong commitment to diversity, equality, and inclusion. This role offers significant growth potential and the chance to work with leading-edge technology in the AI and networking space.

Last updated 15 hours ago

Responsibilities For Network Production Engineer - Network.AI

  • Write and review code, develop documentation and capacity plans
  • Participate in weekly on-call rotation and handle service incidents
  • Perform deep dives on complex technical issues across networks
  • Analyze data to diagnose and identify root causes to network issues
  • Define, develop, and optimize automated network monitoring systems
  • Proactively find gaps that impact multiple teams and drive projects
  • Contribute to team growth through peer mentorship

Requirements For Network Production Engineer - Network.AI

Python
Go
  • Bachelor's degree in Computer Science, Computer Engineering, or equivalent practical experience
  • 4+ years experience coding in higher-level languages (Python, C++, Go, etc.)
  • 5+ years experience understanding and mitigating network hardware and topology failures
  • Experience in configuration and maintenance of network devices and NMS systems
  • Experience learning software, frameworks and APIs
  • Experience developing network device configuration for at least one vendor
  • Knowledge in routing and switching
  • Expert knowledge of data center networking concepts

Benefits For Network Production Engineer - Network.AI

Medical Insurance
Equity
  • bonus
  • equity
  • benefits

Interested in this job?

Jobs Related To Meta Network Production Engineer - Network.AI

Performance and Capacity Engineer

Senior Performance and Capacity Engineer role at Meta focusing on infrastructure scaling and performance optimization.

Production Systems Engineer, AI Systems

Meta is hiring a Production Systems Engineer for AI Systems to work on network technologies for large-scale AI training and inference.

Software Engineer

Meta is hiring a Senior Software Engineer in Bellevue, WA to work on large-scale infrastructure applications and build new features for their suite of products.

ASIC Engineer, Design Verification

ASIC Design Verification Engineer at Meta, developing innovative solutions for data center applications.

Software Engineer, Systems

Meta is hiring a Software Engineer, Systems to build next-gen systems for Facebook's products, creating web apps for millions and designing core backend components.