NVIDIA, the world leader in accelerated computing, is seeking a Deep Learning Engineer for their Datacenter team. This role is crucial in optimizing datacenter deployments and establishing data-driven approaches to hardware design and system software development. The position offers an opportunity to work with cutting-edge AI technologies and influence the development of high-performance datacenters.
The role involves collaborating with diverse teams, from DL research to CUDA Kernel development and Silicon Architecture. You'll be working on characterizing and analyzing Deep Learning applications, developing cost-efficient datacenter architectures for Large Language Models, and creating analysis tools for performance metrics.
The ideal candidate should have a strong background in Computer Science or Electrical Engineering, with experience in system software, GPU kernels, or DL frameworks. Proficiency in C/C++ and Python is essential, and familiarity with containerization platforms like Docker and workload managers like Slurm is valuable.
At NVIDIA, you'll be part of a forward-thinking team that's shaping the future of AI infrastructure. The company offers a collaborative environment where you can work with experts in various domains and contribute to next-generation systems and Deep Learning Software Stack development. This is an excellent opportunity for someone passionate about system architecture, performance optimization, and artificial intelligence.