NVIDIA is seeking a Senior Performance Software Engineer to join their Deep Learning Libraries team. This role focuses on developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA GPUs. The position involves working with cutting-edge technologies like cuDNN, cuBLAS, and TensorRT libraries to accelerate deep learning models. The successful candidate will be part of a team that enables breakthroughs in image classification, speech recognition, and natural language processing.
The role requires expertise in performance tuning parallel algorithms and developing highly optimized code for GPU efficiency. You'll be working at the lower levels of the deep learning software stack, directly interfacing with GPU hardware. The position offers the opportunity to collaborate across multiple NVIDIA teams, including the CUDA compiler team, deep learning performance teams, and hardware architecture teams.
NVIDIA is recognized as one of the most desirable employers in the technology sector, offering competitive compensation including a base salary range of $184,000 - $356,500 USD, plus equity and benefits. The company values creativity, autonomy, and problem-solving abilities, making it an ideal environment for those passionate about pushing the boundaries of deep learning and GPU computing.
The role is perfect for candidates with strong C++ programming skills, experience in parallel programming, and a deep understanding of computer architecture. Additional expertise in CUDA/OpenCL GPU programming, numerical methods, and linear algebra would be particularly valuable. Join NVIDIA in building the future of AI and accelerated computing technology.