Principal Staff Software Engineer, AI Training Platform

LinkedIn is the world's largest professional network, built to help members of all backgrounds and experiences achieve more in their careers.
$207,000 - $340,000
Machine Learning
Principal Software Engineer
Hybrid
5,000+ Employees
7+ years of experience
AI · Enterprise SaaS

Description For Principal Staff Software Engineer, AI Training Platform

LinkedIn is seeking a Principal Staff Software Engineer to join their AI Platform group, specifically focusing on the AI Training team. This role is pivotal in developing and maintaining highly available and scalable deep learning training solutions that power LinkedIn's expanding AI initiatives. The team is responsible for scaling AI model training with hundreds of billions of parameters across recommendation systems, large language models (Generative AI), and computer vision models.

The position involves working with cutting-edge technologies and optimizing training performance across multiple dimensions: algorithms, AI frameworks, infrastructure software, and hardware. The team manages thousands of latest GPU cards and collaborates closely with the open source community, with many team members being active contributors to projects like TensorFlow, Horovod, Ray, and Hadoop.

As a Principal Staff Software Engineer, you'll lead the development of next-generation training infrastructure, focusing on high-performance AI training pipelines, data I/O optimization, and working with popular libraries like Huggingface, Horovod, and PyTorch. You'll be responsible for debugging and optimizing deep learning training, implementing advanced features like model parallelism, data parallelism, Zero, and automatic mixed precision.

The role offers the opportunity to work with state-of-the-art AI technologies, including LLMs, GNNs, and advanced LLM Agents. You'll be instrumental in developing containerized pipeline orchestration infrastructure and maintaining deep learning frameworks. The position combines technical leadership with hands-on development, requiring both architectural vision and practical implementation skills.

LinkedIn offers a collaborative environment where innovation is encouraged, and your work will directly impact millions of users worldwide. The company provides competitive compensation, comprehensive benefits, and the opportunity to work with leading experts in AI and distributed systems. This role is perfect for someone passionate about large-scale AI infrastructure who wants to shape the future of professional networking through advanced machine learning technologies.

Last updated 21 hours ago

Responsibilities For Principal Staff Software Engineer, AI Training Platform

  • Owning the technical strategy for broad or complex requirements with insightful approaches
  • Designing and implementing large-scale distributed training for recommendation and language models
  • Improving observability and developer productivity of various systems
  • Mentoring other engineers and defining technical culture
  • Working with open-source community on cutting edge projects
  • Functioning as tech-lead for key Training Infrastructure initiatives

Requirements For Principal Staff Software Engineer, AI Training Platform

Python
Java
Go
Rust
Scala
Kubernetes
  • BS/BA in Computer Science or related technical field or equivalent technical experience
  • 7+ years of industry experience in software design, development, and algorithm related solutions
  • 7+ years of programming in object-oriented languages such as Python, C++, Java, Go, Rust, Scala
  • 5+ years of experience as an architect, or technical leadership position
  • 5+ years of experience in the industry with leading / building deep learning systems
  • Hands-on experience developing distributed systems or other large-scale systems

Benefits For Principal Staff Software Engineer, AI Training Platform

Medical Insurance
Vision Insurance
Dental Insurance
401k
Parental Leave
Commuter Benefits
  • Medical insurance
  • Vision insurance
  • Dental insurance
  • 401(k)
  • Parental leave
  • Commuter benefits
  • Hybrid work options

Interested in this job?

Jobs Related To LinkedIn Principal Staff Software Engineer, AI Training Platform

Principal Product Manager, Search

Principal Product Manager position at LinkedIn leading Search team, focusing on AI and Infrastructure to enhance search experience for 1B+ members.

Principal Staff Software Engineer, AI Training Platform

Principal Staff Software Engineer position at LinkedIn, focusing on AI Training Platform development and leadership in Mountain View, CA with competitive compensation $207K-$340K.

Principal Staff Engineer - Video AI

Principal Staff Engineer position focused on Video AI technologies at LinkedIn, leading technical initiatives and architectural decisions for video understanding and personalization.

Director of Engineering - AI

Lead LinkedIn's Marketing Solutions AI Engineering Team, directing AI/ML initiatives for $6B+ advertising platform with competitive compensation $218K-$357K.

Principal Staff Engineer - Video AI

Principal Staff Engineer position leading Video AI initiatives at LinkedIn, focusing on developing cutting-edge video understanding and recommendation systems.