Distinguished Software Architect - Deep Learning and HPC Communications

World leader in accelerated computing, pioneering AI and digital twins technology.
$308,000 - $471,500
Distributed Systems
Principal Software Engineer
In-Person
5,000+ Employees
15+ years of experience
AI · Enterprise SaaS

Description For Distinguished Software Architect - Deep Learning and HPC Communications

NVIDIA, the pioneer in GPU technology and accelerated computing, is seeking a Distinguished Software Architect to join their GPU Communications Libraries and Networking team. This role represents a unique opportunity to shape the future of deep learning and high-performance computing communications infrastructure.

The position involves working on cutting-edge communication libraries like NCCL, NVSHMEM, and UCX for Deep Learning and HPC applications that operate at massive scale, spanning thousands of GPUs. The successful candidate will be responsible for co-designing next-generation data center platforms, pushing the boundaries of what's possible in GPU communications and networking technology.

The role requires deep expertise in HPC, parallel programming models, and high-performance networking, combined with strong understanding of GPU architecture and CUDA. The ideal candidate will have 15+ years of experience and be recognized as an industry leader in HPC/DL communications, with a track record of patents, publications, and conference presentations.

Working at NVIDIA means joining one of technology's most desirable employers, with access to some of the industry's brightest minds. The position offers competitive compensation, including a base salary range of $308,000 - $471,500, plus equity and comprehensive benefits. This is an opportunity to directly impact the future of AI and HPC while working with state-of-the-art technology in a collaborative, innovation-driven environment.

The role is based in Santa Clara, CA, where you'll be at the heart of NVIDIA's technical innovation. You'll collaborate with diverse teams across hardware and software, driving the adoption of new communication technologies and helping to realize NVIDIA's vision of transforming computing and artificial intelligence.

Last updated 5 days ago

Responsibilities For Distinguished Software Architect - Deep Learning and HPC Communications

  • Research new communication technologies and design new features for communication libraries
  • Propose innovative solutions in HW and SW for next-gen platforms
  • Co-design solutions with GPU, Networking, and SW architects
  • Inspire changes based on quantitative data and technical analysis
  • Drive adoption of new communication technologies across application verticals
  • Collaborate with DL researchers and customers

Requirements For Distinguished Software Architect - Deep Learning and HPC Communications

Python
  • PhD in Computer Science, Computer Engineering or related field or equivalent experience
  • 15+ years of relevant experience in academia or industry
  • Expert in HPC, parallel programming models, communication runtime
  • Deep understanding of high performance networking
  • Strong knowledge of ML/DL fundamentals
  • Programming fluency with C or C++
  • Ability to work across different HW/SW teams and timezones

Benefits For Distinguished Software Architect - Deep Learning and HPC Communications

Equity
Medical Insurance
  • Equity compensation
  • Comprehensive benefits package
  • Opportunity to work on cutting-edge technology

Interested in this job?

Jobs Related To NVIDIA Distinguished Software Architect - Deep Learning and HPC Communications

Senior System Software Architect, HPC Networking

Senior System Software Architect role at NVIDIA focusing on HPC Networking, developing next-gen InfiniBand and NVLink systems for advanced compute clusters.

Senior Software Research Architect

Lead the advancement of AI data centers and networks at NVIDIA as a Senior Software Research Architect, focusing on cloud solutions and networking technologies.

Distinguished Engineer – Data Center System Software Architect

Distinguished Engineer role focusing on data center system software architecture at NVIDIA, leading technical innovation for enterprise solutions.

Senior Software Architect, AI Networking

Senior Software Architect position at NVIDIA focusing on AI Networking and data center technologies, requiring expertise in network technologies and system architecture.

Principal Software Architect, GPU Networking Research

Principal Software Architect position at NVIDIA focusing on GPU Networking Research, requiring 15+ years of experience in systems architecture and expertise in AI data center technologies.