LinkedIn, the world's largest professional network, is seeking a Principal Staff Software Engineer to join their AI Platform group's Training team. This role is pivotal in developing and maintaining highly scalable deep learning training solutions that power LinkedIn's growing AI initiatives.
The position offers a hybrid work arrangement with locations in Mountain View, San Francisco, or Bellevue, allowing flexibility to work from home while maintaining team collaboration. As part of the AI Training team, you'll be responsible for scaling LinkedIn's AI model training capabilities to handle hundreds of billions of parameters across various use cases, from recommendation systems to large language models and computer vision applications.
The role involves working with cutting-edge technologies including LLMs, GNNs, and advanced LLM Agents. You'll optimize training performance across multiple dimensions - algorithms, AI frameworks, infrastructure software, and hardware - to maximize the potential of LinkedIn's extensive GPU fleet. The team has strong ties to the open source community, with many team members being active contributors to projects like TensorFlow, Horovod, and Ray.
Key responsibilities include leading technical strategy development, implementing large-scale distributed training systems, improving system observability, mentoring team members, and collaborating with the open-source community. The ideal candidate will have extensive experience in software development, deep learning systems, and technical leadership.
The position offers competitive compensation ($207,000 - $340,000) and comprehensive benefits. This is an exceptional opportunity to work on cutting-edge AI infrastructure at scale, influence the direction of LinkedIn's AI capabilities, and contribute to open-source projects that shape the industry's future.
The role requires a blend of technical expertise in distributed systems, deep learning, and leadership skills. You'll work with technologies like PyTorch, TensorFlow, and various deep learning frameworks while helping build and maintain the infrastructure that powers LinkedIn's AI initiatives. This position offers the chance to tackle complex technical challenges while working with a talented team at the forefront of AI technology.