Senior Data Engineer (Python)

An open-source library company that automatically creates datasets from messy, unstructured data sources, backed by Foundation Capital and Dig Ventures.
$80,000 - $120,000
Data
Senior Software Engineer
In-Person
11 - 50 Employees
5+ years of experience
Enterprise SaaS

Description For Senior Data Engineer (Python)

dltHub is an innovative company based in Berlin and New York City, focused on developing an open-source library that transforms messy, unstructured data into organized datasets. The library works with popular SQL and vector stores, data lakes, storage buckets, and local engines like DuckDB, Arrow, or delta-rs. As a Senior Data Engineer, you'll be working directly with the CTO as part of the core product team, focusing on the Composable Data Stack ecosystem.

The role involves designing and implementing features that bridge traditional Modern Data Stack with the emerging Pythonic Composable Data Stack. You'll be responsible for integrating various query engines, transformation frameworks, and table formats while ensuring user satisfaction. The position requires deep expertise in Python data processing libraries and a strong understanding of both modern data stack and composable data stack ecosystems.

The company offers a collaborative environment with a balance between office work and flexibility for deep focus. They provide various benefits including learning opportunities, equity through ESOP, and various perks. The company is backed by notable investors and technical founders from companies like Datadog, Hugging Face, and MotherDuck.

This role is perfect for someone who is passionate about data engineering, has strong Python skills, and wants to contribute to the evolution of data processing tools while working with a talented team in Berlin.

Last updated 20 days ago

Responsibilities For Senior Data Engineer (Python)

  • Design and implement OSS features integrating query engines, transformation frameworks, table formats
  • Listen to users and focus on their production needs with dlt
  • Work with customers in commercial projects combining dlt with modern data stack infrastructure
  • Maintain the open source project with the team (review PRs, resolve issues, communicate with community)

Requirements For Senior Data Engineer (Python)

Python
  • Knowledge of duckdb, arrow, datafusion, lancedb, delta-rs, ibis, pyiceberg, sqlglot, kedro, hamilton and similar Python libraries
  • Experience in building data apps or products based on composable data stack
  • Knowledge of Modern Data Stack
  • Fluency in Python coding (typing, unit testing, docstrings)
  • Degree in computer science, data science, or equivalent experience
  • Familiarity with GitHub workflows
  • Based in Berlin and willing to work in office regularly

Benefits For Senior Data Engineer (Python)

Education Budget
Equity
  • Office-first company with flexibility for deep work and WFH
  • Dedicated no meeting days
  • Public transportation ticket coverage
  • Annual budget for learning and development
  • Subsidized team lunches
  • Urban Sports Club membership
  • ESOP plan for employees

Interested in this job?

Jobs Related To dltHub Senior Data Engineer (Python)

Senior Data Scientist, YouTube Premium

Senior Data Scientist position at YouTube Premium focusing on analytics, modeling, and data-driven product strategy, offering competitive compensation and benefits.

Senior Data Scientist, Research, Ads Metrics

Senior Data Scientist position at Google focusing on ads metrics research, offering competitive salary and opportunity to shape advertising products through data-driven decisions.

Senior Data Scientist, Research

Senior Data Scientist position at Google, working on product analytics and development using Python and SQL, with 5+ years of experience required.

Senior Data Scientist, Search (Real World Journeys)

Senior Data Scientist position at Google, focusing on Search analytics and Real World Journeys, requiring expertise in data analysis, programming, and fluency in English and Portuguese.

Customer Engineer, Data Analytics, Google Cloud

Senior Data Engineer role at Google Cloud focusing on data analytics and customer engineering, offering competitive compensation and benefits.