Samza: Real-time Stream Processing at LinkedIn
Samza: Real-time Stream Processing at LinkedIn
Location:
Grand Ballroom A
Track:
Time:
Wednesday, 11:45am - 12:35pm
Abstract:
Apache Samza is a distributed stream processing framework. Samza provides a familiar and easy to use MapReduce style API that allows developers to process messages and events in realtime. The framework integrates with Apache Kafka for its messaging layer, and Apache Hadoop YARN to manage fault tolerance, processor isolation, resource management, and security. Samza also manages processor state, and will recover to a consistent snapshot when failures occur. This talk will cover Samza's feature set, how Samza integrates with YARN and Kafka, how it's used at LinkedIn, and what's next on the roadmap.
Chris is a Staff Software Engineer at LinkedIn, where he's is currently working as a committer and PMC member for Apache Samza. He's been involved in a wide range of projects at LinkedIn, including, "People You May Know", REST.li, Hadoop, engineering tooling, and OLAP systems. Prior to LinkedIn, he worked on data visualization and fraud modeling at PayPal.