Distinguished Software Architect - Deep Learning and HPC Communications

World leader in accelerated computing, pioneering AI and digital twins technology.
$308,000 - $471,500
Distributed Systems
Principal Software Engineer
In-Person
5,000+ Employees
15+ years of experience
AI · Enterprise SaaS

Description For Distinguished Software Architect - Deep Learning and HPC Communications

NVIDIA, the pioneer in GPU technology and leader in AI and HPC, is seeking a Distinguished Software Architect for their GPU Communications Libraries and Networking team. This role focuses on delivering communication libraries like NCCL, NVSHMEM, and UCX for Deep Learning and HPC applications. The position involves working on systems that scale to thousands of GPUs, optimizing communication performance between GPUs connected via high-speed interconnects. The ideal candidate will be an industry-recognized leader in HPC/DL communications with extensive experience in parallel programming, system architecture, and high-performance networking. They will be responsible for co-designing next-generation data center platforms, researching new communication technologies, and driving their adoption across application verticals. This is a unique opportunity to shape the future of AI and HPC communications at one of technology's most innovative companies. The role offers competitive compensation, including equity, and the chance to work with cutting-edge technology that powers everything from artificial intelligence to autonomous vehicles.

Last updated a month ago

Responsibilities For Distinguished Software Architect - Deep Learning and HPC Communications

  • Research new communication technologies and design features for communication libraries
  • Propose innovative solutions in HW and SW for next-gen platforms
  • Inspire changes based on quantitative data and technical analysis
  • Drive adoption of new communication technologies
  • Collaborate with DL researchers and customers
  • Keep up with latest DL research

Requirements For Distinguished Software Architect - Deep Learning and HPC Communications

  • PhD in Computer Science, Computer Engineering or related field or equivalent experience
  • 15+ years of relevant experience in academia or industry
  • Expert in HPC, parallel programming models (MPI, SHMEM)
  • Deep understanding of high performance networking
  • Strong knowledge of ML/DL fundamentals
  • Programming fluency with C or C++ for systems software development
  • Ability to work across different HW/SW teams and timezones

Benefits For Distinguished Software Architect - Deep Learning and HPC Communications

Equity
  • Equity

Interested in this job?

Jobs Related To NVIDIA Distinguished Software Architect - Deep Learning and HPC Communications

Distinguished Engineer – Data Center System Software Architect

Lead data center system software architecture at NVIDIA, overseeing end-to-end software stack for enterprise and cloud platforms.

Senior Software Architect - Data Center Systems

Senior Software Architect role at NVIDIA leading data center systems development for GPU-accelerated deep learning applications.

Query Engine Architect - Accelerated Apache Spark

Principal Software Engineer role at NVIDIA leading query optimization for Apache Spark GPU acceleration, offering competitive compensation and cutting-edge technology work.

Senior Networking Architect

Senior Networking Architect position at NVIDIA focusing on developing next-generation network solutions for AI and accelerated computing.

Senior Software Architect, Advanced Development

Senior Software Architect role at NVIDIA focusing on advanced development in network programmability, distributed systems, and data center technologies.