Senior Software Engineer - Conversational AI

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Machine Learning
Senior Software Engineer
In-Person
10+ years of experience
AI · Enterprise SaaS

Description For Senior Software Engineer - Conversational AI

NVIDIA, a leader in High-Performance Computing and AI, is seeking a Senior Software Engineer to join their Conversational AI team. This role focuses on building next-generation Multimodal Conversational AI systems powered by advanced Speech and LLM models. The position involves developing Digital Human solutions and creating seamless experiences through AI Agents.

The ideal candidate will work on architecting and implementing conversation pipelines, speech-to-speech models, and multi-modal conversation frameworks. They will be responsible for analyzing and improving RAG systems and conversational AI agents while ensuring optimal performance across platforms.

This is an opportunity to work with cutting-edge technology in AI and contribute to groundbreaking developments in digital human interactions. The role requires expertise in both speech technologies and large language models, combined with strong software engineering skills.

NVIDIA offers a collaborative environment at the forefront of technological advancement, where you'll work with world-class teams on projects that impact various industries. The company is committed to fostering diversity and maintains an inclusive work environment, making it an ideal place for innovative minds to thrive.

Join NVIDIA to be part of a team that's shaping the future of AI and human-computer interaction, while working on technologies that power everything from self-driving cars to robotics and intelligent assistants.

Last updated 4 months ago

Responsibilities For Senior Software Engineer - Conversational AI

  • Architect and implement reliable low latency full duplex conversation pipelines and dialog systems
  • Build and benchmark cascaded and unified speech-to-speech models and systems
  • Design and implement domain specific agents and workflows for multi-turn, multi-modal conversations
  • Analyze RAG and conversational AI agent end to end accuracy
  • Characterize performance and quality metrics across platforms
  • Collaborate on new product features and improvements
  • Participate in code review, design documents, and test plan reviews

Requirements For Senior Software Engineer - Conversational AI

Python
Kubernetes
  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, AI, or Applied Math
  • 10+ years of experience in Speech, LLM, RAG and Agents
  • Excellent programming skills in Python and/or C++
  • Deep understanding of Speech technologies (VAD, ASR, TTS, Translation)
  • Experience with RAG and LLM based applications
  • Experience with microservices deployment and protocols (HTTP REST, gRPC, WebSockets)
  • Familiarity with Docker, helm, kubernetes
  • Experience with Software lifecycle and CI/CD pipeline
  • Knowledge of version control and code review tools

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer - Conversational AI

Senior Applied LLM Engineer, AI – Chip Design

Senior Applied LLM Engineer position at NVIDIA focusing on AI and chip design, developing and optimizing retrieval and generation algorithms for enterprise data.

Senior Performance Engineer - Deep Learning

Senior Performance Engineer position at NVIDIA focusing on optimizing Deep Learning tools and frameworks.

Senior Math Libraries Engineer – AI and HPC

Senior Math Libraries Engineer role at NVIDIA focusing on AI and HPC, developing GPU-accelerated mathematical libraries for cutting-edge applications.

Senior AI Training Performance Engineer

Senior AI Training Performance Engineer role at NVIDIA, focusing on optimizing AI training workloads and implementing solutions across the hardware/software stack.

Senior Deep Learning Performance Architect

Senior Deep Learning Performance Architect position at NVIDIA focusing on developing and analyzing next-generation architectures for AI and high-performance computing applications.