Senior Data Engineer

Spotify is a leading music and podcast streaming platform innovating in audio technology.
London, UKUnited Kingdom
Data
Senior Software Engineer
Hybrid

Description For Senior Data Engineer

Spotify is seeking a Senior Data Engineer to join their Speak team, which is the in-house text-to-speech (TTS) team supporting products like DJ, AI Voice Translation, and exciting new unreleased products. The role focuses on building world-class speech technologies that can power the next generation of personalized generative voice products at scale.

As a Senior Data Engineer, you'll be responsible for building large-scale speech and audio data pipelines using frameworks like Google Cloud Platform and Apache Beam. You'll work on machine learning projects powering new generative AI experiences and help build state-of-the-art text-to-speech models. The role involves learning and contributing to the team's understanding of best practices and techniques for building data pipelines for large-scale generative models, including cleaning, filtering, classifying, and labelling.

You'll collaborate with other engineers, researchers, product managers, and stakeholders, taking on learning and leadership opportunities. The ideal candidate has strong Data Engineering experience, particularly with high-volume, heterogeneous data and distributed systems. Proficiency in Python and experience with data processing frameworks like Beam, Dataflow, or Spark is essential.

This position offers the opportunity to work on cutting-edge speech technology projects in a dynamic, collaborative environment. You'll be part of a team that values quality, agile processes, and responsible experimentation. If you're passionate about data engineering, machine learning, and building innovative voice products, this role at Spotify could be an excellent fit for your career growth.

Last updated 7 months ago

Responsibilities For Senior Data Engineer

  • Build large-scale speech and audio data pipelines using frameworks like Google Cloud Platform and Apache Beam
  • Work on machine learning projects powering new generative AI experiences and helping to build state-of-the-art text-to-speech models
  • Learn and contribute to the team's understanding of best practices and techniques for building data pipelines for large scale generative models, including cleaning, filtering, classifying and labelling
  • Collaborate with other engineers, researchers, product managers and stakeholders, taking on learning and leadership opportunities that arise
  • Deliver scalable, testable, maintainable, and high-quality code
  • Share knowledge, promote standard methodologies, making your team the best version of itself through mentorship and constructive accountability

Requirements For Senior Data Engineer

Python
Java
Cassandra
  • Data Engineering experience with high-volume, heterogeneous data, preferably with distributed systems such as Hadoop, BigTable, Cassandra, GCP, AWS or Azure
  • Experience with one or more higher-level Python or Java based data processing frameworks such as Beam, Dataflow, Crunch, Scalding, Storm, Spark, Flink etc.
  • Strong Python programming abilities
  • Experience using pre-trained ML models is a plus
  • Experience with Docker, Luigi, Airflow, or similar tools
  • Care about quality and know what it means to ship high quality code
  • Experience managing data retention policies
  • Care about agile software processes, data-driven development, reliability, and responsible experimentation
  • Understand the value of collaboration and partnership within teams

Interested in this job?

Jobs Related To Spotify Senior Data Engineer

Data Compression Research Engineer, Sr.

Senior Data Compression Research Engineer role at Qualcomm focusing on developing innovative compression algorithms for neural networks and multimedia systems.

Senior Data Engineer

Senior Data Engineer position at Spin focusing on building scalable data pipelines, ETL processes, and cloud-based architectures while providing technical leadership.

Senior Data Engineer

Senior Data Engineer position at Cornspring, building ETL pipelines and managing financial datasets using Python and AWS technologies in a hybrid work environment.

Senior Data Engineer

Senior Data Engineer position at Kueski, leading FinTech in Mexico, focusing on building scalable data solutions and processing systems.

Senior Data Engineer

Senior Data Engineer position at 3Pillar Global, building data solutions for digital businesses with 5+ years of experience required, remote work available.