Presentation: Demystifying stream processing with Apache Kafka
Abstract
The concept of stream processing has been around for a while and most software systems continuously transform streams of inputs into streams of outputs. Yet the idea of directly modeling stream processing in infrastructure systems is just coming into its own after a few decades on the periphery.
At its core, stream processing is simple: read data in, process it, and maybe emit some data out. So why are there so many stream processing frameworks that all define their own terminology? And are the components of each even comparable? Why do I need to know about spouts or DStreams just to process a simple sequence of records? Depending on your application's requirements, you may not need a framework.
This talk will be delivered by the creators of the popular stream data systems Apache Kafka and Apache Samza and will abstract away the details of individual frameworks while describing the key features they provide. These core features include scalability and parallelism through data partitioning, fault tolerance and event processing order guarantees, support for stateful stream processing, and handy stream processing primitives such as windowing. Based on our experience building and scaling Kafka to handle streams that captured hundreds of billions of records per day -- this presentation will help you understand how to map practical data problems to stream processing and how to write applications that process streams of data at scale.
Similar Talks


Tracks
Covering innovative topics
Monday Nov 16
-   
          Architectures You've Always Wondered About    
  Silicon Valley to Beijing: Exploring some of the world's most intrigiuing architectures 
-   
          Applied Machine Learning     
  How to start using machine learning and data science in your environment today. Latest and greatest best practices. 
-   
          Browser as a platform (Realizing HTML5)    
  Exciting new standards like Service Workers, Push Notifications, and WebRTC are making the browser a formidable platform. 
-   
          Modern Languages in Practice    
  The rise of 21st century languages: Go, Rust, Swift 
-   
          Org Hacking    
  Our most innovative companies reimagining the org structure 
-   
          Design Thinking    
  Level up your approach to problem solving and leave everything better than you found it. 
Tuesday Nov 17
-   
          Containers in Practice    
  Build resilient, reactive systems one service at a time. 
-   
          Architecting for Failure    
  Your system will fail. Take control before it takes you with it. 
-   
          Modern CS in the Real World    
  Real-world Industry adoption of modern CS ideas 
-   
          The Amazing Potential of .NET Open Source    
  From language design in the open to Rx.NET, there is amazing potential in an Open Source .NET 
-   
          Optimizing You     
  Keeping life in balance is always a challenge. Learning lifehacks 
-   
          Unlearning Performance Myths    
  Lessons on the reality of performance, scale, and security 
Wednesday Nov 18
-   
          Streaming Data @ Scale    
  Real-time insights at Cloud Scale & the technologies that make them happen! 
-   
          Taking Java to the Next Level    
  Modern, lean Java. Focuses on topics that push Java beyond how you currently think about it. 
-   
          The Dark Side of Security    
  Lessons from your enemies 
-   
          Taming Distributed Architecture    
  Reactive architectures, CAP, CRDTs, consensus systems in practice 
-   
          JavaScript Everywhere!    
  Javascript is Everywhere. Learn why 
-   
          Culture Reimagined    
  Lessons on building highly effective organizations 










