Presentation: "Data Architecture at Twitter Scale"

Time: Wednesday 10:35 - 11:35

Location: Franciscan I & II

Abstract: One of the biggest opportunities for innovation on the web today is mining the huge amount of data from Twitter and similar platforms. Unfortunately, supporting these datasets is labor-intensive due to immature storage solutions. We will discuss Twitter's experience with its four fundamental data types and query patterns: tweets, timelines, social graphs, and search indices. For each of these, Twitter implemented custom datastores because existing solutions were insufficient. Based on that experience, we will propose requirements for a general-purpose distributed database that would foster broad innovation on the web.

Nick Kallen, Twitter Engineer; FlockDb co-creator

 Nick  Kallen Nick Kallen is a Systems Engineer at Twitter. He is the author of Arel, NamedScope, Cache Money, and Screw.Unit; and a co-creator FlockDb, Twitter's distributed graph database.