NVIDIA, the pioneer in accelerated computing and AI technology, is seeking a Senior Software Engineer specialized in Deep Learning Inference. This role is at the forefront of AI innovation, working with the latest generative AI models and LLMs. The position involves optimizing performance at all stack levels, from server-level request batching to GPU kernel fusion. You'll be working with cutting-edge technology, collaborating with research teams to implement and optimize AI runtimes, and developing sophisticated software solutions. The ideal candidate combines strong software engineering principles with deep ML knowledge and performance optimization expertise. NVIDIA offers a collaborative environment working with some of the most forward-thinking professionals in the technology field. The role provides an opportunity to impact the future of AI computing while working with state-of-the-art hardware and software. The company is committed to diversity and inclusion, fostering an environment where creativity and autonomy are highly valued. This position is perfect for those passionate about pushing the boundaries of AI technology and performance optimization.