Workshop: Stream Processing with Apache Flink®

Location:

Level: 
Beginner

When:

1:00pm - 5:00pm

Prerequisites

Programming experience with Java or Scala using an IDE such as IntelliJ IDEA or Eclipse.
Maven knowledge is helpful, but not required.
Basic knowledge about distributed data processing.
For the development environment, there are two options: a) Use your own IDE to load a sample Flink project (recommended) b) Use a VirtualBox image provided by the instructor

This Apache Flink® workshop is a mix of short introductory talks into the basic concepts of Apache Flink and hands-on exercises. This way, participants can immediately try out what they learned in the talk. The exercises are set up to be done in a self-paced way to allow participants to work at their own speed. The solutions to the exercises are available in both Java and Scala, so attendants are free to choose between these languages. There will be two instructors available to help with questions and problems. The instructors are also available for participants who choose to work on their own Flink-related project during the workshop.

After setting up the development environment and the first introduction talk, participants will implement their first streaming application, analyzing data from a data generator. We’ll ingest the analysis results into Grafana to visualize them. In the second exercise, we’ll implement a job using event time windows to analyze data with sliding time-windows. Next, we’ll connect Apache Flink with Apache Kafka to stream events to Kafka topics and to consume them from there.

Key Takeaways:

  • Learn key concepts of Apache Flink
  • The required knowledge to get started developing advanced streaming applications with Flink. This includes a working Flink example on your laptop
  • Access to some of the core committers of Flink to ask questions during the workshop
  • Learn how to implement a simple streaming job with a data generator and some operations and also how to connect Flink to Apache Kafka

Speaker: Jamie Grier

Director, Applications Engineering @DataArtisans

Jamie Grier is Director of Applications Engineering at data Artisans where he’s extremely excited to be able to help others realize the potential of Apache Flink® in their own projects. Jamie has been working on stream processing for the last decade at companies such as Twitter, Gnip and Boulder Imaging. This has spanned everything from ultra-high-performance video stream processing to social media analytics.

Find Jamie Grier at

Speaker: Robert Metzger

PMC member and committer Apache Flink project

Robert Metzger is a PMC member at the Apache Flink project and a co-founder and software engineer at data Artisans. He is the author of many Flink components including the Kafka and YARN connectors. Robert studied Computer Science at TU Berlin and worked at IBM Germany and at the IBM Almaden Research Center in San Jose. He is a frequent speaker at conferences such as the Hadoop Summit in San Jose 2015, ApacheCon Big Data in Budapest, and meetups in Europe and the US.

Find Robert Metzger at

.

Tracks

Monday Nov 7

Tuesday Nov 8

Wednesday Nov 9