From The Lab To The Factory: Building A Production Machine Learning Infrastructure
From The Lab To The Factory: Building A Production Machine Learning Infrastructure
Location:
Bayview A/B
Time:
Tuesday, 5:20pm - 6:10pm
Abstract:
At most companies, advanced analytics expertise is contained in a lab environment: a small team of analysts sitting at their computers and churning out reports and insights to support business decisions. But the real potential for advanced analytics lies in building models that make real-time decisions within productionworkflows.
We will discuss how to use the ecosystem of technologies around Hadoop to support bringing models out of the lab and into the factory, with a focus on strategies for data integration, large-scale machine learning, and experimentation.
Josh Wills is the director of data science at Cloudera. Wills is one of the main contributors to Cloudera’s most recent open source project, Crunch, a Java library that aims to make writing, testing, and running MapReduce pipelines easy, efficient, and even fun. Prior to joining Cloudera, Wills was a software engineer at Google. Josh holds a M.S.E. in operations research from the University of Texas and a BS in mathematics from Duke University.