World Labs is at the forefront of artificial intelligence innovation, developing Large World Models that extend AI capabilities beyond traditional language and 2D visual processing into complex 3D environments. As a Model Optimization Engineer, you'll play a crucial role in bridging the gap between cutting-edge research and production-ready systems. The position demands expertise in PyTorch, CUDA programming, and deep learning model optimization techniques.
You'll be responsible for transforming research code into highly optimized, low-latency inference solutions, working with state-of-the-art models and technologies. The role requires both technical excellence and collaborative skills, as you'll work closely with research teams to implement optimizations while maintaining model accuracy.
The ideal candidate brings 3+ years of experience in deep learning model optimization, with expert-level knowledge in PyTorch and CUDA programming. You should be comfortable with model quantization, inference frameworks, and multi-GPU deployment strategies. Additional experience with custom CUDA kernel development and high-performance serving frameworks like Triton or vLLM is highly valued.
World Labs offers an environment where fearless innovation, resilience, and collaboration are celebrated. You'll be part of a global team working on technology that will fundamentally change how machines perceive and interact with the world. This is an opportunity to make a significant impact in the field of AI while working with some of the brightest minds in the industry.
If you're passionate about pushing the boundaries of AI technology and want to contribute to developing spatially intelligent AI systems that will shape the future, this role at World Labs could be your next career-defining move.