Presentation: "Hadoop"
Track:
Cool Stuff with Java
Time: Wednesday 15:00 - 16:00
Location: City
Abstract: Apache Hadoop is an open-source platform for storing and processing large volumes of data by using clusters of commodity machines. We will briefly cover the architecture of Hadoop's two main components: Hadoop Distributed File System and Hadoop MapReduce. Then, we'll delve into some common problems well-suited to Hadoop. We'll also discuss the larger Hadoop open-source ecosystem, including powerful tools like Pig, Hive, and HBase.