Meta is seeking a Network Production Engineer to join their Network AI team, focusing on maintaining and optimizing the backend datacenter networks supporting GPU-based AI Training Clusters. This role combines software engineering and network engineering expertise to ensure the robust performance and reliability of Meta's critical AI infrastructure.
The position offers an exciting opportunity to work on some of the largest and most complex networks in the world, directly impacting Meta's AI capabilities that drive content recommendations, ads relevance, and user experience improvements. You'll be responsible for building tools, implementing automation, and conducting deep technical investigations to enhance network performance and reliability.
As a Network Production Engineer, you'll be part of a team that handles real-time network issues, analyzes long-term performance trends, and develops innovative monitoring solutions. The role requires strong technical skills in both networking and software development, with opportunities to work on challenging problems at unprecedented scale.
The ideal candidate should have extensive experience in network infrastructure, coding proficiency in languages like Python or Go, and a deep understanding of datacenter networking concepts. You'll be working in Meta's collaborative environment, contributing to the company's AI infrastructure while having access to cutting-edge technology and resources.
This role offers competitive compensation, including base salary, bonus, equity, and comprehensive benefits. Join Meta's Network.AI team to shape the future of AI infrastructure while working on technically challenging problems that impact billions of users globally.