Deep Learning Engineer, Datacenters

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Machine Learning
Mid-Level Software Engineer
Hybrid
3+ years of experience
AI · Enterprise SaaS

Description For Deep Learning Engineer, Datacenters

NVIDIA, the world leader in accelerated computing, is seeking a Deep Learning Engineer for their Datacenter team. This role is crucial in optimizing datacenter deployments and establishing data-driven approaches to hardware design and system software development. The position offers an opportunity to work with cutting-edge AI technologies and influence the development of high-performance datacenters.

The role involves collaborating with diverse teams, from DL research to CUDA Kernel development and Silicon Architecture. You'll be working on characterizing and analyzing Deep Learning applications, developing cost-efficient datacenter architectures for Large Language Models, and creating analysis tools for performance metrics.

The ideal candidate should have a strong background in Computer Science or Electrical Engineering, with experience in system software, GPU kernels, or DL frameworks. Proficiency in C/C++ and Python is essential, and familiarity with containerization platforms like Docker and workload managers like Slurm is valuable.

At NVIDIA, you'll be part of a forward-thinking team that's shaping the future of AI infrastructure. The company offers a collaborative environment where you can work with experts in various domains and contribute to next-generation systems and Deep Learning Software Stack development. This is an excellent opportunity for someone passionate about system architecture, performance optimization, and artificial intelligence.

Last updated 13 minutes ago

Responsibilities For Deep Learning Engineer, Datacenters

  • Develop software infrastructure to characterize and analyze Deep Learning applications
  • Evolve cost-efficient datacenter architectures for Large Language Models (LLMs)
  • Develop analysis and profiling tools in Python, bash and C++ to measure performance metrics
  • Analyze system and software characteristics of DL applications
  • Develop analysis tools and methodologies to measure key performance metrics

Requirements For Deep Learning Engineer, Datacenters

Python
Linux
Kubernetes
  • Bachelor's degree in Electrical Engineering or Computer Science with 3+ years experience (Masters or PhD preferred)
  • Experience in System Software (Linux, Compilers, GPU kernels, DL Frameworks) or Silicon Architecture
  • Programming experience in C/C++ and Python
  • Ability to work in virtual environments
  • Experience with containerization platforms and datacenter workload managers is a plus

Interested in this job?

Jobs Related To NVIDIA Deep Learning Engineer, Datacenters

Applied Machine Learning Engineer

Applied Machine Learning Engineer position at Design Pickle, focusing on AI-driven applications and data solutions for creative services.

Prompt Engineer /Python

Remote Prompt Engineer/Python position at Oowlish, focusing on LLM development, prompt engineering, and API development with competitive benefits and international project exposure.

Machine Learning Engineer

Join Ideogram as a Machine Learning Engineer to build state-of-the-art AI models for creative expression, working with world-renowned AI experts in Toronto.

Machine Learning Ops Engineer

Machine Learning Ops Engineer position at Callsign in Abu Dhabi, focusing on ML infrastructure and operations with Python, Kubernetes, and AWS.

Software Engineer (AI focused)

AI-focused Software Engineer role at ResQ, developing LLM solutions for restaurant maintenance platform, requiring 3-5 years experience with emphasis on AI/ML technologies.