Senior Architect - Server Performance

World leader in accelerated computing, pioneering AI and digital twins technology.
Backend
Principal Software Engineer
In-Person
10+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Principal Software Engineer, RTL Optimization Tools

Principal Software Engineer position at NVIDIA developing RTL optimization tools, combining parallel computing, machine learning, and algorithm development in C++.

Principal Planning Software Engineer - Autonomous Vehicles

Principal Software Engineer role at NVIDIA focusing on autonomous vehicle planning systems, requiring 12+ years of experience and strong C++ skills.

Principal Software Engineer - Enterprise AI Platform

Principal Software Engineer role at NVIDIA focusing on Enterprise AI Platform development, requiring 15+ years of experience in cloud and distributed systems.

Principal Software Engineer - Microsoft Threat Protection

Principal Software Engineer role at Microsoft Security, focusing on cyber defense solutions and cloud-native services, requiring 12+ years of experience in distributed systems.

Principal Software Engineer

Principal Software Engineer role at Microsoft's Turing Team, building core AI systems for Microsoft 365 Chat Copilot and enterprise-scale language models.

Description For Senior Architect - Server Performance

NVIDIA, the pioneer in GPU technology and accelerated computing, is seeking a Senior Architect for Server Performance. This role is crucial for shaping the next generation of AI server systems. The position combines hardware architecture expertise with deep understanding of AI workloads, requiring a seasoned professional with 10+ years of experience. The ideal candidate will work at the intersection of hardware and software, analyzing complex workloads, optimizing system architecture, and driving performance improvements.

The role offers an opportunity to work with cutting-edge technology in AI and high-performance computing. You'll be responsible for evaluating and enhancing server performance for AI training, inference, and HPC applications. This position requires expertise in hardware architecture, strong programming skills, and deep understanding of performance optimization.

Working at NVIDIA means joining one of technology's most desirable employers, known for innovation in AI, gaming, and parallel computing. The company values creativity, autonomy, and technical excellence. You'll be part of a forward-thinking team that's revolutionizing computing technology and driving advancements in artificial intelligence.

This is an ideal opportunity for an experienced architect who wants to impact the future of AI computing infrastructure. You'll collaborate with world-class engineers and scientists, working on technologies that are transforming industries and advancing scientific discovery.

Last updated 5 months ago

Responsibilities For Senior Architect - Server Performance

  • Analyze workloads of interest on existing NVIDIA server platforms
  • Collaborate with cross-functional teams to define performance metrics and benchmarks
  • Conduct thorough performance evaluations and identify bottlenecks
  • Optimize system architecture to meet performance and scalability goals
  • Work closely with software and hardware teams
  • Act as a subject matter expert on system performance

Requirements For Senior Architect - Server Performance

Python
  • BE/BTech or MS/MTech in relevant area (PhD is a plus)
  • 10+ years of relevant experience in hardware architecture (CPU, GPU, caches, memory subsystem, PCIe)
  • Experience in deep learning training/inference, high performance computing, or cloud computing
  • Familiarity with performance tools and methodologies
  • Programming experience in C/C++/Python
  • Ability to work with large and unfamiliar software repositories
  • Experience in high performance networking (Infiniband, RoCE)
  • Familiarity with CUDA and GPU programming model

Interested in this job?