Presentation: "Hadoop"

Time: Wednesday 15:00 - 16:00

Location: City

Abstract: Apache Hadoop is an open-source platform for storing and processing large volumes of data by using clusters of commodity machines. We will briefly cover the architecture of Hadoop's two main components: Hadoop Distributed File System and Hadoop MapReduce. Then, we'll delve into some common problems well-suited to Hadoop. We'll also discuss the larger Hadoop open-source ecosystem, including powerful tools like Pig, Hive, and HBase.

Philip Zeyliger, Cloudera

 Philip  Zeyliger Philip Zeyliger is an engineer working on and around Hadoop at Cloudera, which offers support and services for Hadoop. Previously, he worked at Google on scalable storage for user-facing applications, and, before that, he worked at financial firm D.E. Shaw. Philip holds a bachelor's degree in mathematics from Harvard University.