Presentation: Beyond Ad-hoc Data Science
Abstract
Today, tooling for ad-hoc data science is fairly well understood. But when you want to create a repeated process such as analytics or prediction systems, things tend to change with time, and how to deal with such change is not always clear. Columns and features are added and removed. New models are developed. Data errors are discovered and corrected. How can we build a data pipeline system to handle these demands? This talk will discuss some of the systems challenges and solutions that arise when building evolving data science products, and we’ll see how they are addressed at Twitter.
Similar Talks


Tracks
Covering innovative topics
Monday Nov 16
-   
          Architectures You've Always Wondered About    
  Silicon Valley to Beijing: Exploring some of the world's most intrigiuing architectures 
-   
          Applied Machine Learning     
  How to start using machine learning and data science in your environment today. Latest and greatest best practices. 
-   
          Browser as a platform (Realizing HTML5)    
  Exciting new standards like Service Workers, Push Notifications, and WebRTC are making the browser a formidable platform. 
-   
          Modern Languages in Practice    
  The rise of 21st century languages: Go, Rust, Swift 
-   
          Org Hacking    
  Our most innovative companies reimagining the org structure 
-   
          Design Thinking    
  Level up your approach to problem solving and leave everything better than you found it. 
Tuesday Nov 17
-   
          Containers in Practice    
  Build resilient, reactive systems one service at a time. 
-   
          Architecting for Failure    
  Your system will fail. Take control before it takes you with it. 
-   
          Modern CS in the Real World    
  Real-world Industry adoption of modern CS ideas 
-   
          The Amazing Potential of .NET Open Source    
  From language design in the open to Rx.NET, there is amazing potential in an Open Source .NET 
-   
          Optimizing You     
  Keeping life in balance is always a challenge. Learning lifehacks 
-   
          Unlearning Performance Myths    
  Lessons on the reality of performance, scale, and security 
Wednesday Nov 18
-   
          Streaming Data @ Scale    
  Real-time insights at Cloud Scale & the technologies that make them happen! 
-   
          Taking Java to the Next Level    
  Modern, lean Java. Focuses on topics that push Java beyond how you currently think about it. 
-   
          The Dark Side of Security    
  Lessons from your enemies 
-   
          Taming Distributed Architecture    
  Reactive architectures, CAP, CRDTs, consensus systems in practice 
-   
          JavaScript Everywhere!    
  Javascript is Everywhere. Learn why 
-   
          Culture Reimagined    
  Lessons on building highly effective organizations 








