NVIDIA, a pioneer in visual computing and GPU technology, is seeking a Senior DevTech Engineer to join their team focusing on Windows LLM and GenAI Open-Source Ecosystem. This role sits at the intersection of cutting-edge AI technology and GPU computing, where you'll work on enabling Windows AI enthusiasts and developers with innovative models and functionality.
The position involves working with large language models (LLMs) and Generative AI, contributing to open-source projects like PyTorch and llama.cpp, and optimizing performance on NVIDIA RTX platforms. You'll be responsible for improving user experience, solving deployment challenges, and working closely with both internal teams and external partners.
As an ideal candidate, you bring 5+ years of experience in GPU deployment and optimization, strong programming skills in C/C++ and Python, and deep understanding of transformer architectures and LLMs. Your role will be crucial in shaping the future of AI deployment on Windows platforms and influencing next-generation GPU features.
NVIDIA offers a competitive compensation package and a work environment that promotes diversity, inclusion, and flexibility. You'll be part of a team that's driving innovation in AI, High-Performance Computing, and Visualization, working on technology that's transforming industries and society.
This position offers an exciting opportunity to work at the forefront of AI technology, combining technical expertise with practical application while collaborating with industry leaders and innovative partners. Join NVIDIA to help shape the future of AI computing and make a significant impact in the field of machine learning and generative AI.