dltHub is an innovative company based in Berlin and New York City, focused on developing an open-source library that transforms messy, unstructured data into organized datasets. The library works with popular SQL and vector stores, data lakes, storage buckets, and local engines like DuckDB, Arrow, or delta-rs. As a Senior Data Engineer, you'll be working directly with the CTO as part of the core product team, focusing on the Composable Data Stack ecosystem.
The role involves designing and implementing features that bridge traditional Modern Data Stack with the emerging Pythonic Composable Data Stack. You'll be responsible for integrating various query engines, transformation frameworks, and table formats while ensuring user satisfaction. The position requires deep expertise in Python data processing libraries and a strong understanding of both modern data stack and composable data stack ecosystems.
The company offers a collaborative environment with a balance between office work and flexibility for deep focus. They provide various benefits including learning opportunities, equity through ESOP, and various perks. The company is backed by notable investors and technical founders from companies like Datadog, Hugging Face, and MotherDuck.
This role is perfect for someone who is passionate about data engineering, has strong Python skills, and wants to contribute to the evolution of data processing tools while working with a talented team in Berlin.