Chris.Riccomini
Chris Riccomini
Staff Software Engineer, LinkedInChris is a Staff Software Engineer at LinkedIn, where he's is currently working as a committer and PMC member for Apache Samza. He's been involved in a wide range of projects at LinkedIn, including, "People You May Know", REST.li, Hadoop, engineering tooling, and OLAP systems. Prior to LinkedIn, he worked on data visualization and fraud modeling at PayPal.
-
Samza: Real-time Stream Processing at LinkedIn
Track: Hadoop : Beyond Map-ReduceLocation:Grand Ballroom AAbstract:
Apache Samza is a distributed stream processing framework. Samza provides a familiar and easy to use MapReduce style API that allows developers to process messages and events in realtime. The framework integrates with Apache Kafka for its messaging layer, and Apache Hadoop YARN to manage fault tolerance, processor isolation, resource management, and security. Samza also manages processor state, and will recover to a consistent snapshot when failures occur. This talk will cover Samza's feature set, how Samza integrates with YARN and Kafka, how it's used at LinkedIn, and what's next on the roadmap.