We are seeking a Principal Software Engineer to develop and deploy NVIDIA Inference Microservice (NIM) blueprints. NVIDIA NIM™ Agent Blueprints are reference workflows for canonical generative AI use cases. Enterprises can build and operationalize custom AI applications — creating data-driven AI flywheels — using NIM Agent Blueprints along with NVIDIA NIM microservices and NVIDIA NeMo framework, all part of the NVIDIA AI Enterprise Platform.
This role offers an outstanding opportunity to craft the future of AI at a fast-growing company at the forefront of the AI revolution. Join our team of world-class software engineers and partners to deliver the most sophisticated models and solutions with lightning-fast inference. You'll work on the most powerful, enterprise-grade GPU clusters capable of hundreds of PetaFLOPS and gain early access to unreleased hardware, making a direct impact on NVIDIA's roadmap and the broader AI landscape.
Responsibilities:
- Build AI applications and services
- Design, Build, and Deploy NIM blueprints using NVIDIA Nemo and NVIDIA NIMs in a cloud native environment
- Drive adoption and scale blueprint development by building reusable foundation blocks
- Apply cloud native development and deployment expertise to create optimized patterns for NIM blueprints
- Collaborate, brainstorm, and improve the designs of NIM blueprints with stakeholders from across the organization
- Collaborate effectively with peers and partners to find opportunities and distill requirements, propose technical strategy and architecture, and tackle exciting technical problems
- Mentor and collaborate with team members and other teams to foster growth and development
Requirements:
- AI applications and services experience
- Cloud native software development and deployment experience
- A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience
- 15+ years of relevant proven experience
- Strong background in design and implementation
- Passion for building scalable and performant inference applications
- Hands-on development and deployment of high quality, highly distributed cloud-based RESTful web services
- Passion for extending your technical knowledge into new areas
- Strong analytical skills and proven success in problem-solving and achieving performance objectives
- Mentorship and the ability to grow teams and team members
Preferred Qualifications:
- MS or PhD in Computer Science or an equivalent technical field
- 10+ years of experience building end to end AI services and deploying them into production
- 5+ years of experience with cloud native technologies such as Kubernetes, etc.
NVIDIA offers competitive salaries, a generous benefits package, and is widely considered one of the technology world's most desirable employers. Join us to be part of the AI revolution!