Senior Software Engineer (ML Inference and Performance)

Baseten is a growing team of builders backed by top-tier investors, providing ML infrastructure for enterprises and AI-native companies.
$150,000 - $250,000
Machine Learning
Senior Software Engineer
Hybrid
11 - 50 Employees
3+ years of experience

Description For Senior Software Engineer (ML Inference and Performance)

Baseten is seeking a Senior Software Engineer focused on ML inference and performance to join their dynamic team. This role is ideal for someone passionate about advancing AI frontiers and thrives in a fast-paced startup environment.

Key responsibilities include:

  • Implementing and refining cutting-edge techniques for ML model inference and infrastructure
  • Deep diving into codebases of TensorRT, PyTorch, Transformers, CUDA, and other libraries
  • Applying and scaling optimization techniques across various ML models, especially large language models
  • Collaborating with a diverse team to design and implement innovative solutions
  • Owning projects from idea to production

The ideal candidate should have:

  • 3-10 years of experience with programming languages like Python, C++, or Go
  • Deep understanding of software engineering principles and AI/ML inference solutions
  • Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM
  • Experience with Docker and Kubernetes
  • Deep understanding of GPU Architecture

Bonus points for:

  • Proficiency in enhancing LLM performance
  • Familiarity with LLM optimization techniques
  • Experience with CUDA or similar technologies

Baseten offers a competitive compensation package, including unlimited PTO, 401k, and covered healthcare premiums. They provide a unique opportunity to be part of a rapidly growing startup in an exciting engineering field, fostering an inclusive work culture that supports learning and growth.

The company uses a tech stack including AWS, Kubernetes, Istio, Knative, Prometheus stack for infrastructure; Python, Django, Postgres, Redis for backend; and React, TypeScript, GraphQL for frontend.

Baseten is committed to fostering a diverse and inclusive workplace, providing equal employment opportunities to all employees and applicants.

Last updated 5 months ago

Responsibilities For Senior Software Engineer (ML Inference and Performance)

  • Implement, refine, and productionize cutting edge techniques for ML model inference and infrastructure
  • Deep dive into underlying codebases of TensorRT, Pytorch, Transformers, CUDA, and other libraries to debug ML performance issues
  • Apply and scale optimization techniques across a wide range of ML models, particularly large language models
  • Collaborate with a diverse team to design and implement innovative solutions
  • Own projects from idea to production, from writing project specs to managing end-to-end feature implementation

Requirements For Senior Software Engineer (ML Inference and Performance)

Python
PostgreSQL
Redis
React
TypeScript
  • 3-10 years of experience with programming languages like Python, C++, or Go
  • Deep understanding of software engineering principles and AI/ML inference solutions
  • Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM
  • Experience with Docker and Kubernetes
  • Deep understanding of GPU Architecture

Benefits For Senior Software Engineer (ML Inference and Performance)

401k
Medical Insurance
  • Unlimited PTO
  • 401k
  • Covered healthcare premiums
  • Equity

Interested in this job?

Jobs Related To Baseten Senior Software Engineer (ML Inference and Performance)

Senior Software Engineer - Conversational AI

Senior Software Engineer position at NVIDIA focusing on building next-generation Conversational AI systems and Digital Human solutions using advanced Speech and LLM models.

AI Engineer

Senior AI Engineer role at Capco focusing on GenAI solutions development and deployment for financial services industry

Data Scientist & Machine Learning Engineer

Senior Data Science & Machine Learning role focused on building ML models, MAB solutions, and recommendation systems at a leading digital media company.

Senior AI Prompt Engineer

Senior AI Prompt Engineer position at CI&T, focusing on optimizing generative AI models for software development efficiency.

Senior AI Python Engineer

Senior AI Python Engineer position at Oowlish, focusing on Generative AI and machine learning development with remote work opportunity.