LinkedIn is seeking a Principal Staff Software Engineer to join their AI Platform group, specifically focusing on the AI Training team. This role is crucial in developing and maintaining highly available and scalable deep learning training solutions that power LinkedIn's growing AI initiatives. The position involves working with cutting-edge technologies including large language models, recommendation systems, and computer vision models.
The role combines deep technical expertise with leadership responsibilities, requiring the ability to optimize training performance across algorithms, AI frameworks, infrastructure software, and hardware. You'll be working with a state-of-the-art GPU fleet and collaborating with the open source community, particularly on projects involving TensorFlow, Horovod, Ray, and Hadoop.
As a Principal Staff Engineer, you'll be responsible for leading the technical strategy for complex requirements, implementing large-scale distributed training systems, and mentoring other engineers. The position offers the opportunity to work with advanced technologies like LLMs, GNNs, and Flash Attention, while contributing to the open-source community.
The ideal candidate will have extensive experience in software development, deep learning systems, and technical leadership. The role offers competitive compensation, comprehensive benefits, and the opportunity to work in a hybrid environment at LinkedIn's offices in Mountain View, CA, San Francisco, CA, or Bellevue, WA.
This is an excellent opportunity for someone passionate about AI infrastructure who wants to make a significant impact on LinkedIn's AI capabilities while working with cutting-edge technologies and leading technical initiatives. The role combines technical depth with leadership opportunities, making it ideal for those looking to advance their careers in AI infrastructure development.