Baseten, a leading AI deployment infrastructure company, is seeking a Senior Machine Learning Engineer specializing in Fine-Tuning. Recently securing $75 million in Series C funding, Baseten is trusted by prominent enterprises like Descript, Bland.ai, Patreon, and Writer for their production AI workloads.
The role combines deep technical expertise in foundation model adaptation with customer-facing responsibilities. You'll be creating value by leveraging Baseten's infrastructure to fine-tune large language models and other modalities, working directly with customers to achieve their specific goals. This position requires both technical prowess in model training and the ability to translate customer requirements into effective solutions.
Key responsibilities include designing comprehensive fine-tuning strategies, developing tools for non-ML experts, implementing scalable pipelines, and utilizing state-of-the-art parameter-efficient techniques like LoRA and QLoRA. You'll also help shape the product roadmap by identifying common patterns in customer requirements and developing reusable components.
The ideal candidate brings 3+ years of ML engineering experience, strong background in model training and fine-tuning, and excellent communication skills. Experience with advanced frameworks like Axolotl, Transformers, and PyTorch Lightning is essential. Knowledge of distributed training systems, RLHF, and other emerging alignment methods is highly valued.
Benefits include competitive compensation ($150K-$225K), equity, comprehensive healthcare coverage, flexible PTO, and the opportunity to work with cutting-edge AI technology in a rapidly growing startup. Join Baseten to be part of shaping the future of AI deployment and accessibility across all products.