The Generative AI Org at Meta is seeking a strong technical leader to join our team and work on the next generation of large language models, particularly focusing on building new capabilities for Llama inspired by internal use cases. As a technical leader, you will play a critical role in building our series of efficient Llama models and building new capabilities on top of them. You will work with internal clients to understand their needs and push the boundaries of text LLMs via breakthroughs in several capabilities.
Responsibilities:
- Drive efficiency gains on training and deployment of LLMs through novel techniques
- Drive end-to-end development of LLMs models, including data sourcing and curation, filtering, experiment design, evaluation and more
- Lead a team of applied researchers to democratize Llama for Meta's users
- Communicate, collaborate, and build relationships with clients and peer teams to facilitate cross-functional projects
- Remain up-to-date on ongoing research and software development activities in the team, help work through technical challenges, and be involved in design decisions
- Remain deeply involved in the research community, both understanding trends, and setting them
Minimum Qualifications:
- 5+ years of hands-on experience in large language model, NLP, and Transformer modeling, in the setting of both research and engineering development
- Experience and track of recording in landing large research and/or product impacts in a fast-paced environment
- 3+ years of hands-on supporting and leading teams of research scientists and software engineers
- Proven technical vision in where the field of generative AI will go
- Experience of and knowledge of model efficiency techniques (quantization, distillation, etc.)
- Experience with cross functional collaboration with product and platform teams, as well as non-engineering functions
- Demonstrated experience recruiting, building, structuring, leading technical organizations, including performance management
Preferred Qualifications:
- PhD in deep learning, artificial intelligence, and/or related technical field
- Experience and knowledge of ML frameworks like PyTorch, TensorFlow, etc.
- Experience and knowledge of large-scale data platforms such as Spark, Hive, etc.
- Experience and knowledge of working with LLM frameworks like LangChain
- Experience and knowledge of training LLMs, fine-tuning on datasets, especially LLaMa
Meta is committed to providing reasonable accommodations for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support.