Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

NVIDIA is the world leader in accelerated computing, pioneering visual computing and GPU technology.
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
AI · Enterprise SaaS

Description For Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

NVIDIA, a pioneer in visual computing and GPU technology, is seeking a Senior DevTech Engineer to join their team focusing on Windows LLM and GenAI Open-Source Ecosystem. This role sits at the intersection of cutting-edge AI technology and GPU computing, where you'll work on enabling Windows AI enthusiasts and developers with innovative models and functionality.

The position involves working with large language models (LLMs) and Generative AI, contributing to open-source projects like PyTorch and llama.cpp, and optimizing performance on NVIDIA RTX platforms. You'll be responsible for improving user experience, solving deployment challenges, and working closely with both internal teams and external partners.

As an ideal candidate, you bring 5+ years of experience in GPU deployment and optimization, strong programming skills in C/C++ and Python, and deep understanding of transformer architectures and LLMs. Your role will be crucial in shaping the future of AI deployment on Windows platforms and influencing next-generation GPU features.

NVIDIA offers a competitive compensation package and a work environment that promotes diversity, inclusion, and flexibility. You'll be part of a team that's driving innovation in AI, High-Performance Computing, and Visualization, working on technology that's transforming industries and society.

This position offers an exciting opportunity to work at the forefront of AI technology, combining technical expertise with practical application while collaborating with industry leaders and innovative partners. Join NVIDIA to help shape the future of AI computing and make a significant impact in the field of machine learning and generative AI.

Last updated a month ago

Responsibilities For Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

  • Improve Windows LLM & GenAI user experience on NVIDIA RTX
  • Engage with internal product teams and external OSS maintainers
  • Work on solving local end-to-end LLM & Generative AI GPU deployment challenges
  • Apply profiling and debugging tools for analyzing GPU-accelerated AI applications
  • Conduct trainings, develop sample code and host presentations
  • Guide developers on efficient adoption of DL frameworks
  • Collaborate with GPU driver and architecture teams

Requirements For Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

Python
Linux
  • 5+ years of professional experience in local GPU deployment, profiling and optimization
  • BS or MS degree in Computer Science, Engineering, or related degree
  • Strong proficiency in C/C++, Python, software design
  • Familiarity with Windows operating system
  • Understanding of Transformer architectures and LLMs
  • Experience with open-source LLM and GenAI software
  • Experience with CUDA and NVIDIA's Nsight GPU profiling
  • Strong verbal and written communication skills in English
  • Excellent interpersonal skills
  • Willingness to travel for conferences and partner visits

Benefits For Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

  • Competitive salaries
  • Extensive benefits package

Interested in this job?

Jobs Related To NVIDIA Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

Senior Software Engineer - Conversational AI

Senior Software Engineer position at NVIDIA focusing on building next-generation Conversational AI systems and Digital Human solutions using advanced Speech and LLM models.

Senior Software Engineer, Deep Learning Inference

Senior Software Engineer role at NVIDIA focusing on optimizing deep learning inference performance and implementing AI runtime solutions.

Senior System Software Engineer, Deep Learning Accelerator

Senior System Software Engineer role at NVIDIA focusing on Deep Learning Accelerator development, requiring 7+ years of experience in low-level software development and system architecture.

Deep Learning Engineer, End-to-end - Autonomous Driving

Senior Deep Learning Engineer position at NVIDIA focusing on end-to-end autonomous driving solutions, combining AI expertise with automotive technology.

Senior Software Engineer, TensorRT-LLM

Senior Software Engineer position at NVIDIA focusing on TensorRT-LLM development, requiring expertise in C++, deep learning, and AI inferencing optimization.