Workshop: Building Smarter Applications with Spark & H20
Location:
- Marina Room
Data is today’s clay and the newest killer applications are data products.
In this workshop, you will learn about the anatomy of a data product on Spark and H2O. Then we’ll show you how to mold your data in order to build a pipeline and a real-time ML system by building a complete loan interest rate prediction product. Our clay is going to be a public Lending Club dataset, and we will create a real-time Spark Streaming application that is both predictive and deploys library models.
The goal is to produce borrower interest rates that are comparable or better than human-led predictions.
Key Take Aways:
- Clean and transform datasets in Sparkling Water
- Join varying datasets (text and time series) by defining conformed dimensions
- Use MLlib to implement word2vec for NLP and H2O for Gradient Boosting Machine in order to produce a scoring engine
- Integrate the scoring engine from Sparkling Water models into Spark Streaming
- Produce real-time scoring and predictions
- Create a pipeline of ensemble models – retire and promote them based on risk and domain
- Deploy smarter applications on Spark and Cloud
Other Workshops:
Tracks
Covering innovative topics
Monday Nov 16
-   
          Architectures You've Always Wondered About    
  Silicon Valley to Beijing: Exploring some of the world's most intrigiuing architectures 
-   
          Applied Machine Learning     
  How to start using machine learning and data science in your environment today. Latest and greatest best practices. 
-   
          Browser as a platform (Realizing HTML5)    
  Exciting new standards like Service Workers, Push Notifications, and WebRTC are making the browser a formidable platform. 
-   
          Modern Languages in Practice    
  The rise of 21st century languages: Go, Rust, Swift 
-   
          Org Hacking    
  Our most innovative companies reimagining the org structure 
-   
          Design Thinking    
  Level up your approach to problem solving and leave everything better than you found it. 
Tuesday Nov 17
-   
          Containers in Practice    
  Build resilient, reactive systems one service at a time. 
-   
          Architecting for Failure    
  Your system will fail. Take control before it takes you with it. 
-   
          Modern CS in the Real World    
  Real-world Industry adoption of modern CS ideas 
-   
          The Amazing Potential of .NET Open Source    
  From language design in the open to Rx.NET, there is amazing potential in an Open Source .NET 
-   
          Optimizing You     
  Keeping life in balance is always a challenge. Learning lifehacks 
-   
          Unlearning Performance Myths    
  Lessons on the reality of performance, scale, and security 
Wednesday Nov 18
-   
          Streaming Data @ Scale    
  Real-time insights at Cloud Scale & the technologies that make them happen! 
-   
          Taking Java to the Next Level    
  Modern, lean Java. Focuses on topics that push Java beyond how you currently think about it. 
-   
          The Dark Side of Security    
  Lessons from your enemies 
-   
          Taming Distributed Architecture    
  Reactive architectures, CAP, CRDTs, consensus systems in practice 
-   
          JavaScript Everywhere!    
  Javascript is Everywhere. Learn why 
-   
          Culture Reimagined    
  Lessons on building highly effective organizations 




