AI Research Engineer – Model Compression (all seniority levels)

Axelera AI

European Series B startup revolutionizing AI with in-memory computing platform for high-performance inference

Milan, Metropolitan City of Milan, Italy • Florence, Metropolitan City of Florence, Italy • Bologna, Metropolitan City of Bologna, Italy

$80,000 - $150,000

Machine Learning

Senior Software Engineer

Hybrid

51 - 100 Employees

5+ years of experience

AI · Enterprise SaaS

Description For AI Research Engineer – Model Compression (all seniority levels)

Axelera AI, a European Series B startup, is revolutionizing the AI landscape with their innovative in-memory computing platform. They are seeking an AI Research Engineer specialized in model compression to join their dynamic team. This role focuses on developing cutting-edge compression techniques for Generative AI models, optimizing them for real-time inference across various environments, from edge computing to server-side deployments.

The position offers a unique opportunity to work at the intersection of advanced machine learning, in-memory computing, and high-performance AI inference. The ideal candidate will be responsible for developing and implementing sophisticated model compression techniques while maintaining or improving model accuracy. They will work closely with cross-functional teams to integrate optimizations into the AI platform.

Key responsibilities include designing compression techniques like pruning and quantization, performance tuning for high-throughput inference, and staying current with the latest research developments. The role requires expertise in deep learning frameworks, experience with model optimization, and strong understanding of AI/ML concepts.

The position is based in Italy, with options to work from Milan, Florence, or Bologna. Axelera AI offers competitive compensation, including equity options, and supports relocation for international talent. The company promotes a diverse, inclusive environment and provides significant growth opportunities as part of a fast-growing Series B startup.

Last updated 19 days ago

Responsibilities For AI Research Engineer – Model Compression (all seniority levels)

Design and implement advanced model compression techniques such as pruning, quantization, weight sharing, and knowledge distillation
Optimize compressed models for high-throughput and low-latency inference
Collaborate with AI researchers, software engineers, and hardware engineers
Stay current with latest AI and model compression research developments
Implement best practices for model testing, deployment, and continuous improvement

Requirements For AI Research Engineer – Model Compression (all seniority levels)

Python

Experience in model compression, including pruning, quantization, low-rank factorization, and knowledge distillation
Expertise in deep learning frameworks (TensorFlow, PyTorch, or JAX)
Experience optimizing models for resource-constrained environments
Strong understanding of deep learning algorithms and neural networks
Strong understanding of AI/ML research advancements in compression and distillation
Ability to work in collaborative, fast-paced startup environment
PhD or advanced degree in Computer Science, Machine Learning, AI, or related fields preferred

Benefits For AI Research Engineer – Model Compression (all seniority levels)

Equity

Relocation Benefits

Competitive salary
Equity options
Relocation Benefits

Axelera AI

European Series B startup revolutionizing AI with in-memory computing platform for high-performance inference

Milan, Metropolitan City of Milan, Italy • Florence, Metropolitan City of Florence, Italy • Bologna, Metropolitan City of Bologna, Italy

$80,000 - $150,000

Machine Learning

Senior Software Engineer

Hybrid

51 - 100 Employees

5+ years of experience

AI · Enterprise SaaS

Interested in this job?

Jobs Related To Axelera AI AI Research Engineer – Model Compression (all seniority levels)

AI Research Engineer – Data Generation & Optimization (all seniority levels)

Axelera AI

AI Research Engineer position at Axelera AI focusing on data generation, selection, and optimization for cutting-edge AI models and systems.

AI Research Engineer – Data Generation & Optimization

Axelera AI

AI Research Engineer role at Axelera AI focusing on model compression and optimization for high-performance inference systems.

Sr. Machine Learning Engineer, Routing and Planning

Amazon

Senior Machine Learning Engineer position at Amazon focusing on AI solutions for last-mile delivery optimization and routing

Solution Engineer - IS&T AI & Data Platforms

Apple

Senior Solutions Engineer role at Apple focusing on enterprise GenAI strategy and AI/ML platforms, offering competitive compensation and benefits.

AIML - Senior ML Engineer - Siri & Information Intelligence

Apple

Senior ML Engineer position at Apple working on Siri's Geo domain team, focusing on search ranking and query understanding using deep learning models.