Distinguished Software Architect - Deep Learning and HPC Communications

World leader in accelerated computing, pioneering AI and digital twins technology.
$308,000 - $471,500
Distributed Systems
Principal Software Engineer
In-Person
5,000+ Employees
15+ years of experience
AI · Enterprise SaaS

Description For Distinguished Software Architect - Deep Learning and HPC Communications

NVIDIA, the pioneer in GPU technology and accelerated computing, is seeking a Distinguished Software Architect to join their GPU Communications Libraries and Networking team. This role focuses on developing cutting-edge communication libraries for Deep Learning and HPC applications that scale to thousands of GPUs. The position combines expertise in high-performance computing, deep learning, and system architecture to push the boundaries of what's possible in data center platforms.

The role involves working with state-of-the-art technologies including NVLink, PCIe, and high-speed networking solutions like Infiniband and Ethernet. The successful candidate will be instrumental in co-designing next-generation platforms, collaborating with GPU, Networking, and Software architects to deliver unprecedented solutions. They will drive innovation in communication technologies, particularly expanding the GPUDirect technology portfolio.

This is an exceptional opportunity for an industry leader with deep expertise in HPC and deep learning communications. The position requires a blend of technical excellence, strategic thinking, and collaborative skills to work across different hardware and software teams. The role offers competitive compensation, including a substantial base salary range and equity benefits, reflecting NVIDIA's position as one of technology's most desirable employers.

Candidates should bring extensive experience in parallel programming models, communication runtimes, and system architecture, along with a proven track record of industry leadership through patents, publications, and conference presentations. The role represents a unique chance to shape the future of AI and HPC communications at a company that's driving innovation in artificial intelligence, autonomous vehicles, and high-performance computing.

Last updated 3 days ago

Responsibilities For Distinguished Software Architect - Deep Learning and HPC Communications

  • Research new communication technologies and design features for communication libraries
  • Propose innovative solutions in HW and SW for next-gen platforms
  • Drive adoption of new communication technologies across application verticals
  • Keep up with latest DL research and collaborate with diverse teams
  • Co-design solutions with GPU, Networking, and SW architects
  • Inspire changes based on quantitative data and technical analysis

Requirements For Distinguished Software Architect - Deep Learning and HPC Communications

Python
Linux
Kubernetes
  • PhD in Computer Science, Computer Engineering or related field or equivalent experience
  • 15+ years of relevant experience in academia or industry
  • Expert in HPC, parallel programming models (MPI, SHMEM)
  • Deep understanding of high performance networking
  • Strong knowledge of ML/DL fundamentals
  • Programming fluency with C or C++
  • Experience with network technologies (Infiniband, Ethernet)
  • Ability to work across different HW/SW teams and timezones

Benefits For Distinguished Software Architect - Deep Learning and HPC Communications

Equity
Medical Insurance
  • Equity
  • Medical Insurance

Interested in this job?

Jobs Related To NVIDIA Distinguished Software Architect - Deep Learning and HPC Communications

Distinguished Engineer – Data Center System Software Architect

Lead system software architecture for NVIDIA's data center systems, working with cutting-edge GPU and AI technologies.

Senior Networking Architect

Senior Networking Architect role at NVIDIA focusing on developing next-generation network solutions for AI computing, involving ASIC design and advanced networking protocols.

Senior Software Architect - Data Center Systems

Senior Software Architect role at NVIDIA leading data center systems development for GPU-accelerated deep learning applications.

Query Engine Architect - Accelerated Apache Spark

Principal Software Engineer role at NVIDIA leading query optimization for Apache Spark GPU acceleration, offering competitive compensation and cutting-edge technology work.

Senior Networking Architect

Senior Networking Architect position at NVIDIA focusing on developing next-generation network solutions for AI and accelerated computing.