Presentation: Scaling the Data Infrastructure @Spotify

Duration

Duration: 
11:50am - 12:40pm

Level:

Persona:

Abstract

Spotify is currently one of the most popular music streaming services in the world with over 100 million monthly active users. We have over the last few years have a phenomenal growth that now has pushed our backend infrastructure out from our data centers and into the cloud. Earlier this year we announced that we are transitioning all of our backend into Google Cloud Platform, GCP. 

In this talk we are going to give an brief overview of what our Data Infrastructure tribe provides at Spotify. Then we are going to do a bit deeper dive into some of the Data Infrastructure components:

  • Event Delivery - Our event delivery system is a key component in our data infrastructure, that delivers complete data with predictable latency and well defined interface for our developers. This data is used to produce Discover Weekly, Spotify Party, Year in music and many more Spotify features. Here we will focus on the evolution of the event delivery service and the lessons learned and some of the reasoning for moving to Google Cloud Pub/Sub and into the cloud.
  • Datamon - Another key component of our data infrastructure is Datamon. Datamon provides and easy overview of data delivered, not just by our event delivery system but for all systems producing data into our central storages. Datamon also integrates with PagerDuty to help with our Data Operations.
  • Styx - In any data infrastructure there is a need for scheduling applications. Styx enables distributed and scalable scheduling of Docker containers. Styx has evolved out of our extensive use of Luigi and the need to get more specialized tools in our infrastructure. Styx is built using the Spotify Apollo framework and uses Kubernetes for container invocations.

Speaker: Mārtiņš Kalvāns

Big Data Engineer @Spotify

Graduated University of Latvia masters degree in Computer Science. Since 2013 working in the Spotify Data Infrastructure Tribe.

Find Mārtiņš Kalvāns at

Speaker: Matti Pehrs

Software Engineer @Spotify

Matti works as a back-end developer in the Spotify Data Infrastructure Tribe. He has worked in the IT industry for more than 20 years and with Java almost from its inception.

Find Matti Pehrs at

Similar Talks

Developer @ThoughtWorks Inc
Tech Lead of Manhattan Team @Twitter
Staff Engineer, JVM Team @Twitter
Technical Manager Aurora / Mesos Team @Twitter
Provisioning Engineering SE @Twitter
Senior Software Engineer @Apple

.

Tracks

Monday Nov 7

Tuesday Nov 8

Wednesday Nov 9

Conference for Professional Software Developers