Big Data Platform as a Service at Netflix
Big Data Platform as a Service at Netflix
Netflix is well known for being a heavily data driven company, leveraging billions of hours of subscriber viewing data to power its recommendation algorithms and refine the customer experience. Hadoop has been instrumental in unlocking the potential to process these vast quantities data. However, without tooling and services to facilitate data discoverability and usability, the potential of big data would be difficult to fully realize at Netflix.
This talk will deep dive into key services of Netflix’s “data platform as a service” architecture, including RESTful services that: provide comprehensive metadata management across data sources (Franklin); enable visualization and caching of results of Hadoop jobs (Sting); and visualize the execution plans produced by languages such as Pig and Hive (Lipstick). The presentation will show how these services can be employed in concert to solve various use cases at Netflix, and will include implementation details, demos, and our open source roadmap for these projects.