Workshop: Building Smarter Applications with Spark & H20
Location:
- Marina Room
Data is today’s clay and the newest killer applications are data products.
In this workshop, you will learn about the anatomy of a data product on Spark and H2O. Then we’ll show you how to mold your data in order to build a pipeline and a real-time ML system by building a complete loan interest rate prediction product. Our clay is going to be a public Lending Club dataset, and we will create a real-time Spark Streaming application that is both predictive and deploys library models.
The goal is to produce borrower interest rates that are comparable or better than human-led predictions.
Key Take Aways:
- Clean and transform datasets in Sparkling Water
- Join varying datasets (text and time series) by defining conformed dimensions
- Use MLlib to implement word2vec for NLP and H2O for Gradient Boosting Machine in order to produce a scoring engine
- Integrate the scoring engine from Sparkling Water models into Spark Streaming
- Produce real-time scoring and predictions
- Create a pipeline of ensemble models – retire and promote them based on risk and domain
- Deploy smarter applications on Spark and Cloud
Other Workshops:
Tracks
Covering innovative topics
Monday Nov 16
-
Architectures You've Always Wondered About
Silicon Valley to Beijing: Exploring some of the world's most intrigiuing architectures
-
Applied Machine Learning
How to start using machine learning and data science in your environment today. Latest and greatest best practices.
-
Browser as a platform (Realizing HTML5)
Exciting new standards like Service Workers, Push Notifications, and WebRTC are making the browser a formidable platform.
-
Modern Languages in Practice
The rise of 21st century languages: Go, Rust, Swift
-
Org Hacking
Our most innovative companies reimagining the org structure
-
Design Thinking
Level up your approach to problem solving and leave everything better than you found it.
Tuesday Nov 17
-
Containers in Practice
Build resilient, reactive systems one service at a time.
-
Architecting for Failure
Your system will fail. Take control before it takes you with it.
-
Modern CS in the Real World
Real-world Industry adoption of modern CS ideas
-
The Amazing Potential of .NET Open Source
From language design in the open to Rx.NET, there is amazing potential in an Open Source .NET
-
Optimizing You
Keeping life in balance is always a challenge. Learning lifehacks
-
Unlearning Performance Myths
Lessons on the reality of performance, scale, and security
Wednesday Nov 18
-
Streaming Data @ Scale
Real-time insights at Cloud Scale & the technologies that make them happen!
-
Taking Java to the Next Level
Modern, lean Java. Focuses on topics that push Java beyond how you currently think about it.
-
The Dark Side of Security
Lessons from your enemies
-
Taming Distributed Architecture
Reactive architectures, CAP, CRDTs, consensus systems in practice
-
JavaScript Everywhere!
Javascript is Everywhere. Learn why
-
Culture Reimagined
Lessons on building highly effective organizations