Workshop: Hands on with Apache Spark
Location:
- Seacliff A
Key takeaways
Prerequisites
Apache Spark is a new and exciting open source data processing engine and it is deemed as the next-generation successor of MapReduce. It was designed from the ground up to support streaming data processing, graph processing as well as complex iterative data processing. Apache Spark provides a nice abstraction of large data sets with the concept of Resilient Distributed Datasets (RDD) and elegant APIs to easily manipulate these large data sets.
This workshop will cover the core concepts in Apache Spark and will include hands on exercises with using RDD APIs to solve common data processing problems. The exercises will be done using Apache Spark Scala APIs and therefore this workshop will also cover the essential parts of Scala that are relevant to the exercises.
Other Workshops:
Tracks
Covering innovative topics
Monday Nov 16
-   
          Architectures You've Always Wondered About    
  Silicon Valley to Beijing: Exploring some of the world's most intrigiuing architectures 
-   
          Applied Machine Learning     
  How to start using machine learning and data science in your environment today. Latest and greatest best practices. 
-   
          Browser as a platform (Realizing HTML5)    
  Exciting new standards like Service Workers, Push Notifications, and WebRTC are making the browser a formidable platform. 
-   
          Modern Languages in Practice    
  The rise of 21st century languages: Go, Rust, Swift 
-   
          Org Hacking    
  Our most innovative companies reimagining the org structure 
-   
          Design Thinking    
  Level up your approach to problem solving and leave everything better than you found it. 
Tuesday Nov 17
-   
          Containers in Practice    
  Build resilient, reactive systems one service at a time. 
-   
          Architecting for Failure    
  Your system will fail. Take control before it takes you with it. 
-   
          Modern CS in the Real World    
  Real-world Industry adoption of modern CS ideas 
-   
          The Amazing Potential of .NET Open Source    
  From language design in the open to Rx.NET, there is amazing potential in an Open Source .NET 
-   
          Optimizing You     
  Keeping life in balance is always a challenge. Learning lifehacks 
-   
          Unlearning Performance Myths    
  Lessons on the reality of performance, scale, and security 
Wednesday Nov 18
-   
          Streaming Data @ Scale    
  Real-time insights at Cloud Scale & the technologies that make them happen! 
-   
          Taking Java to the Next Level    
  Modern, lean Java. Focuses on topics that push Java beyond how you currently think about it. 
-   
          The Dark Side of Security    
  Lessons from your enemies 
-   
          Taming Distributed Architecture    
  Reactive architectures, CAP, CRDTs, consensus systems in practice 
-   
          JavaScript Everywhere!    
  Javascript is Everywhere. Learn why 
-   
          Culture Reimagined    
  Lessons on building highly effective organizations 



