摘要:
按照我的理解 coalesce(numPartitions, shuffle = true) 就等于repartition(numPartitions) 区别 当coalesce(numPartitions, shuffle = false)时候,是单纯的将多个分区合并成一个(注意:不shuffle 阅读全文
摘要:
import org.apache.spark.SparkConf; import org.apache.spark.api.java.JavaRDD; import org.apache.spark.api.java.JavaSparkContext; import org.apache.spar 阅读全文
摘要:
import org.apache.spark.SparkConf; import org.apache.spark.api.java.JavaRDD; import org.apache.spark.api.java.JavaSparkContext; import org.apache.spar 阅读全文