Meta is seeking a Software Engineer, Infrastructure to join their MRS ML Infra team, focusing on ML infrastructure performance and efficiency for large-scale AI training and inference workflows in the recommendation domain. This role combines distributed systems expertise with ML infrastructure optimization, requiring both technical depth and leadership skills.
The position involves working on optimizing end-to-end stacks for model training and inference for large-scale recommendation models, with opportunities in distributed systems, model/system co-design, and GPU optimizations. You'll be responsible for identifying and leading short/mid-term efficiency optimization initiatives while also driving long-term strategies for performance automation and regression detection.
As a senior technical leader, you'll guide cross-functional teams, mentor other engineers, and shape the technical direction of the team. The ideal candidate brings 5+ years of AI infrastructure experience, strong system optimization skills, and a proven track record of technical leadership.
Working at Meta offers the opportunity to impact billions of users through their suite of applications including Facebook, Instagram, WhatsApp, and their emerging AR/VR technologies. The company is at the forefront of developing next-generation social technologies, pushing beyond traditional digital connections into immersive experiences.
This role offers the chance to work on cutting-edge ML infrastructure at massive scale, collaborate with world-class engineers, and shape the future of AI systems at one of the world's leading technology companies. Join Meta to tackle complex technical challenges while growing your career in a dynamic, innovative environment.