摘要: From https://databricks.gitbooks.io/databricks-spark-knowledge-base/content/performance_optimization/how_many_partitions_does_an_rdd_have.html For tun 阅读全文
posted @ 2016-02-17 16:22 木石头 阅读(423) 评论(0) 推荐(0) 编辑
摘要: fold and reduce both aggregate over a collection by implementing an operation you specify, the major different is the starting point of the aggregatio 阅读全文
posted @ 2016-02-17 16:19 木石头 阅读(571) 评论(0) 推荐(0) 编辑