Taro Logo

Network Production Engineer - Network.AI

Meta builds technologies that help people connect, find communities, and grow businesses through social platforms like Facebook, Instagram, WhatsApp, and immersive AR/VR experiences.
$147,000 - $208,000
Backend
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Network Production Engineer - Network.AI

Meta is seeking a Network Production Engineer to join their Network AI team, focusing on maintaining and optimizing the backend datacenter networks supporting GPU-based AI Training Clusters. This role combines software engineering and network engineering expertise to ensure the robust performance and reliability of Meta's critical AI infrastructure.

The position offers an exciting opportunity to work on some of the largest and most complex networks in the world, directly impacting Meta's AI capabilities that drive content recommendations, ads relevance, and user experience improvements. You'll be responsible for building tools, implementing automation, and conducting deep technical investigations to enhance network performance and reliability.

As a Network Production Engineer, you'll be part of a team that handles real-time network issues, analyzes long-term performance trends, and develops innovative monitoring solutions. The role requires strong technical skills in both networking and software development, with opportunities to work on challenging problems at unprecedented scale.

The ideal candidate should have extensive experience in network infrastructure, coding proficiency in languages like Python or Go, and a deep understanding of datacenter networking concepts. You'll be working in Meta's collaborative environment, contributing to the company's AI infrastructure while having access to cutting-edge technology and resources.

This role offers competitive compensation, including base salary, bonus, equity, and comprehensive benefits. Join Meta's Network.AI team to shape the future of AI infrastructure while working on technically challenging problems that impact billions of users globally.

Last updated 7 months ago

Responsibilities For Network Production Engineer - Network.AI

  • Write and review code, develop documentation and capacity plans
  • Participate in a weekly on-call rotation and be an escalation contact for service incidents
  • Perform deep dives on complex technical issues across networks
  • Analyze data to diagnose and identify root causes to network issues
  • Define, develop, and optimize automated network monitoring systems
  • Proactively find gaps that impact multiple teams and drive projects
  • Contribute to team growth and development through peer mentorship

Requirements For Network Production Engineer - Network.AI

Python
Go
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 4+ years experience coding in higher-level languages (Python, C++, Go, etc.)
  • 5+ years experience understanding and mitigating network hardware and topology failures
  • Experience in configuration and maintenance of network devices and NMS systems
  • Experience learning software, frameworks and APIs
  • Experience developing and understanding network device configuration
  • Knowledge in routing and switching - hardware design and knowledge of forwarding and data planes
  • Expert knowledge of data center networking concepts

Benefits For Network Production Engineer - Network.AI

Medical Insurance
Equity
  • bonus
  • equity
  • benefits

Interested in this job?