Senior Data Engineer

Spotify is a leading music and podcast streaming platform innovating in audio technology.
London, UKUnited Kingdom
Data
Senior Software Engineer
Hybrid

Description For Senior Data Engineer

Spotify is seeking a Senior Data Engineer to join their Speak team, which is the in-house text-to-speech (TTS) team supporting products like DJ, AI Voice Translation, and exciting new unreleased products. The role focuses on building world-class speech technologies that can power the next generation of personalized generative voice products at scale.

As a Senior Data Engineer, you'll be responsible for building large-scale speech and audio data pipelines using frameworks like Google Cloud Platform and Apache Beam. You'll work on machine learning projects powering new generative AI experiences and help build state-of-the-art text-to-speech models. The role involves learning and contributing to the team's understanding of best practices and techniques for building data pipelines for large-scale generative models, including cleaning, filtering, classifying, and labelling.

You'll collaborate with other engineers, researchers, product managers, and stakeholders, taking on learning and leadership opportunities. The ideal candidate has strong Data Engineering experience, particularly with high-volume, heterogeneous data and distributed systems. Proficiency in Python and experience with data processing frameworks like Beam, Dataflow, or Spark is essential.

This position offers the opportunity to work on cutting-edge speech technology projects in a dynamic, collaborative environment. You'll be part of a team that values quality, agile processes, and responsible experimentation. If you're passionate about data engineering, machine learning, and building innovative voice products, this role at Spotify could be an excellent fit for your career growth.

Last updated 4 months ago

Responsibilities For Senior Data Engineer

  • Build large-scale speech and audio data pipelines using frameworks like Google Cloud Platform and Apache Beam
  • Work on machine learning projects powering new generative AI experiences and helping to build state-of-the-art text-to-speech models
  • Learn and contribute to the team's understanding of best practices and techniques for building data pipelines for large scale generative models, including cleaning, filtering, classifying and labelling
  • Collaborate with other engineers, researchers, product managers and stakeholders, taking on learning and leadership opportunities that arise
  • Deliver scalable, testable, maintainable, and high-quality code
  • Share knowledge, promote standard methodologies, making your team the best version of itself through mentorship and constructive accountability

Requirements For Senior Data Engineer

Python
Java
Cassandra
  • Data Engineering experience with high-volume, heterogeneous data, preferably with distributed systems such as Hadoop, BigTable, Cassandra, GCP, AWS or Azure
  • Experience with one or more higher-level Python or Java based data processing frameworks such as Beam, Dataflow, Crunch, Scalding, Storm, Spark, Flink etc.
  • Strong Python programming abilities
  • Experience using pre-trained ML models is a plus
  • Experience with Docker, Luigi, Airflow, or similar tools
  • Care about quality and know what it means to ship high quality code
  • Experience managing data retention policies
  • Care about agile software processes, data-driven development, reliability, and responsible experimentation
  • Understand the value of collaboration and partnership within teams

Interested in this job?

Jobs Related To Spotify Senior Data Engineer

Business Intelligence Engineer, Last Mile Analytics

Lead Amazon's last mile quality team as a Business Intelligence Engineer, shaping strategy for customer-facing products and driving continuous improvement.

Sr. Data Scientist, Apple Services Engineering

Senior Data Scientist role at Apple Services Engineering, focusing on experimentation, metrics design, and data-driven decision making.

Worldwide Logistics Operations- Industrial Engineer DC Operations

Senior Industrial Engineer role at Apple, optimizing DC operations and driving innovation in supply chain logistics.

Data Engineer (L5) - Content Machine Learning

Senior Data Engineer role at Netflix, focusing on Content Machine Learning and Knowledge Graph development.

Senior Data Engineer

Senior Data Engineer role at Microsoft in Hyderabad, India, focusing on advanced data engineering and analytics for Windows products.