说明: 该文档来自Spark Summit 2013峰会上Gavin Li,Jaebong Kim,Andy Feng的演讲。In this talk, we will present our recent effort to migrate AEX pipeline from Hadoop streaming to Spark. We aim to reduce audience model to be refreshed at least 2x faster. We came up an inno <villa123> 在 上传 | 大小:3145728
说明: 该文档来自Spark Summit 2013峰会上来自Cloudera的Sandy Ryza的主题演讲。The talk will discuss the current state of resource management on Hadoop, how Spark fits in currently, the work that needs to be done to share resources fluidly between Spark and other processing f <villa123> 在 上传 | 大小:858112
说明: 该文档来自Spark Summit 2013峰会上来自Ooyala的Evan Chan和Kelvin Chu的主题演讲。We would like to share with you the innovative ways that we use Spark at Ooyala, together with Apache Cassandra, to tackle interactive analytics and OLAP applications. <villa123> 在 上传 | 大小:4194304
说明: 该文档来自Spark Summit 2013峰会上来自CloudPhysics的Xiaojun Liu的主题演讲。CloudPhysics is creating an operations management SaaS product to address such challenges. Our service has hundreds of active users. Each day more than 100 billion data samples are collected f <villa123> 在 上传 | 大小:4194304
说明: 该文档来自Spark Summit 2013峰会上来自Intel公司的Jason Dai的主题演讲。In this talk, we will present our efforts and experience on building real-time analytical processing framework with several large websites in China (e.g., Alibaba, Baidu iQiyi, Youku), leveraging the <villa123> 在 上传 | 大小:787456