this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java, and Scala. This edition include
编者之一是Spark的创始人Matei,强烈推荐. 本书仅370多页, 内容精炼, 适合初学者全面掌握Spark. 本PDF是打印版, 可以复制书中代码和文字, 很方便. One of the writer is the one of the creators of Spark, Matei, it is strongly recommended. There are merely around 370 pages, very concise, suitable for beginner to