Senior High-Performance System Architect

NVIDIA is the world leader in accelerated computing, pioneering accelerated computing to tackle challenges no one else can solve.
Distributed Systems
Senior Software Engineer
Contact Company
5+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior HPC Performance Engineer

Senior HPC Performance Engineer role at NVIDIA focusing on GPU Communications Libraries and Networking, optimizing performance for deep learning and HPC applications.

Senior System Software Engineer, NCCL - Partner Enablement

Senior System Software Engineer position at NVIDIA focusing on NCCL partner enablement, combining distributed systems expertise with customer support for AI and HPC applications.

Senior Software Engineer, Google Cloud Dataproc

Senior Software Engineer position at Google Cloud Dataproc focusing on distributed systems, Apache Spark, and data analytics infrastructure.

Senior Software Developer

Senior Software Developer position at Oracle focusing on cloud infrastructure and distributed systems development, requiring 4+ years of experience and strong technical expertise.

Senior Software Engineer, Infrastructure Storage, Google Cloud

Senior Software Engineer position at Google Cloud focusing on infrastructure storage systems, requiring expertise in distributed systems and 5+ years of software development experience.

Description For Senior High-Performance System Architect

NVIDIA is seeking a highly motivated Senior High-Performance System Architect to join their team of experts and help shape the future of high-performance and ML / AI computing. The role involves defining the Infiniband and NVL system architecture end-to-end, researching solutions for next-generation large-scale high-performance computing clusters, and collaborating with cross-functional teams.

Key responsibilities include:

  • Defining Infiniband and NVL system architecture throughout all product life cycles
  • Researching solutions for large-scale high-performance computing clusters
  • Collaborating with cross-functional teams to ensure successful project execution

Requirements:

  • B.Sc, M.Sc, or Ph.D in Computer Science, Computer Engineering, or Electrical Engineering
  • 5+ years of industry or research experience in computer networks
  • Excellent understanding of large-scale networks behavior and distributed computing workloads
  • Experience in developing simulation environments
  • Strong managerial, problem-solving, and critical thinking skills

Preferred qualifications:

  • Knowledge of network protocols (InfiniBand, IP, TCP, RoCE) and network topologies
  • Proficiency in Python and C++
  • Familiarity with HPC environments, routing algorithms, and simulation environments
  • Experience with AI workloads and communication libraries

NVIDIA offers the opportunity to work on cutting-edge technology and drive innovation in next-generation networks used by top researchers and engineers worldwide. The company is committed to fostering a diverse work environment and is an equal opportunity employer.

Last updated 7 months ago

Responsibilities For Senior High-Performance System Architect

  • Define Infiniband and NVL system architecture end-to-end
  • Research solutions for next-generation large-scale high-performance computing clusters
  • Collaborate with cross-functional teams to ensure successful project execution

Requirements For Senior High-Performance System Architect

Python
  • B.Sc, M.Sc, or Ph.D in Computer Science, Computer Engineering, or Electrical Engineering
  • 5+ years of industry or research experience in computer networks
  • Excellent understanding of large-scale networks behavior and distributed computing workloads
  • Experience in development of simulation environments
  • Strong managerial, problem-solving, and critical thinking skills
  • Ability to work in a highly dynamic environment

Interested in this job?