Senior Data Engineer

Spotify is a leading music and podcast streaming platform innovating in audio technology.
London, UKUnited Kingdom
Data
Senior Software Engineer
Hybrid

Description For Senior Data Engineer

Spotify is seeking a Senior Data Engineer to join their Speak team, which is the in-house text-to-speech (TTS) team supporting products like DJ, AI Voice Translation, and exciting new unreleased products. The role focuses on building world-class speech technologies that can power the next generation of personalized generative voice products at scale.

As a Senior Data Engineer, you'll be responsible for building large-scale speech and audio data pipelines using frameworks like Google Cloud Platform and Apache Beam. You'll work on machine learning projects powering new generative AI experiences and help build state-of-the-art text-to-speech models. The role involves learning and contributing to the team's understanding of best practices and techniques for building data pipelines for large-scale generative models, including cleaning, filtering, classifying, and labelling.

You'll collaborate with other engineers, researchers, product managers, and stakeholders, taking on learning and leadership opportunities. The ideal candidate has strong Data Engineering experience, particularly with high-volume, heterogeneous data and distributed systems. Proficiency in Python and experience with data processing frameworks like Beam, Dataflow, or Spark is essential.

This position offers the opportunity to work on cutting-edge speech technology projects in a dynamic, collaborative environment. You'll be part of a team that values quality, agile processes, and responsible experimentation. If you're passionate about data engineering, machine learning, and building innovative voice products, this role at Spotify could be an excellent fit for your career growth.

Last updated 5 months ago

Responsibilities For Senior Data Engineer

  • Build large-scale speech and audio data pipelines using frameworks like Google Cloud Platform and Apache Beam
  • Work on machine learning projects powering new generative AI experiences and helping to build state-of-the-art text-to-speech models
  • Learn and contribute to the team's understanding of best practices and techniques for building data pipelines for large scale generative models, including cleaning, filtering, classifying and labelling
  • Collaborate with other engineers, researchers, product managers and stakeholders, taking on learning and leadership opportunities that arise
  • Deliver scalable, testable, maintainable, and high-quality code
  • Share knowledge, promote standard methodologies, making your team the best version of itself through mentorship and constructive accountability

Requirements For Senior Data Engineer

Python
Java
Cassandra
  • Data Engineering experience with high-volume, heterogeneous data, preferably with distributed systems such as Hadoop, BigTable, Cassandra, GCP, AWS or Azure
  • Experience with one or more higher-level Python or Java based data processing frameworks such as Beam, Dataflow, Crunch, Scalding, Storm, Spark, Flink etc.
  • Strong Python programming abilities
  • Experience using pre-trained ML models is a plus
  • Experience with Docker, Luigi, Airflow, or similar tools
  • Care about quality and know what it means to ship high quality code
  • Experience managing data retention policies
  • Care about agile software processes, data-driven development, reliability, and responsible experimentation
  • Understand the value of collaboration and partnership within teams

Interested in this job?

Jobs Related To Spotify Senior Data Engineer

Data Engineer, Personalization

Join Spotify as a Data Engineer in Personalization, building large-scale data pipelines and recommendation systems for millions of users.

Data Engineer, Personalization

Join Spotify as a Data Engineer in Personalization, building large-scale data pipelines and recommendation systems for millions of users.

Sr. Business Intelligence Engineer, EU FBA

Senior Business Intelligence Engineer role at Amazon's FBA team, focusing on analytics and optimization for high-value items in e-commerce fulfillment.

Business Intelligence Engineer III, Supply Chain

Senior Business Intelligence Engineer role at Amazon focusing on supply chain analytics and optimization through data engineering and visualization.

Senior Business Intelligence Engineer, DCC Communities

Senior Business Intelligence Engineer role at AWS, focusing on data warehouse development and analytics for global data center infrastructure operations.