Workshop: Introduction to Stream Processing and Apache Flink




1:00pm - 4:00pm


  • Basic programming experience in Java or Scala necessary to participate in the exercises
  • Knowledge of stream- or batch processing frameworks is helpful, but not necessary

Over the last years, data stream processing has redefined how many of us build data pipelines. Apache Flink is one of the systems at the forefront of that development: With its versatile APIs (event-time streaming, Stream SQL, events/state) and powerful execution- and operations model, Flink has been part of re-defining what stream processing can do. By now, Apache Flink powers some of the largest data stream processing pipelines in the world.

This workshop will give an introduction to Apache Flink's concepts and architecture, and give you a hands-on tutorial where you create a streaming application, receiving events from Apache Kafka and process them with Flink's DataSteam API and Streaming SQL.

Here is what you will learn:

  • A brief introduction to stateful stream processing
  • The architecture of Apache Flink: abstractions, execution, concepts for managing stateful streaming applications
  • Introduction to Flink's DataStream API
  • Basic concepts of Flink's Stream SQL API
  • How to connect Flink to Kafka

Speaker: Stephan Ewen

Committer @ApacheFlink, CTO @dataArtisans

Stephan Ewen is a PMC member and one of the original creators of Apache Flink, and co-founder and CTO of data Artisans ( He holds a Ph.D. from the Berlin University of Technology.

Find Stephan Ewen at

Speaker: Jamie Grier

Streaming Platform @Lyft

Jamie Grier is currently the Streaming Platform tech lead @ Lyft, has been worked with Apache Flink extensively over the last 2 years and stream processing in general for the last decade at companies such as Twitter, Gnip and Boulder Imaging.

Find Jamie Grier at