Spark中对两个RDD的基本转化操作

对数据分别为{1, 2, 3}和{3, 4, 5}的RDD进行针对两个RDD的转化操作:

1:

  val example1 = sc.parallelize(List(1, 2, 3))

  val example2 = sc.parallelize(List(3, 4, 5))

  val result = example1.union(example2)

  println(result.collect().mkString(","))

2:

  val example1 = sc.parallelize(List(1, 2, 3))

  val example2 = sc.parallelize(List(3, 4, 5))

  val result = example1.intersection(example2)

  println(result.collect().mkString(","))

3:

  val example1 = sc.parallelize(List(1, 2, 3))

  val example2 = sc.parallelize(List(3, 4, 5))

  val result = example1.subtract(example2)

  println(result.collect().mkString(","))

4:

  val example1 = sc.parallelize(List(1, 2, 3))

  val example2 = sc.parallelize(List(3, 4, 5))

  val result = example1.cartesian(example2)

  println(result.collect().mkString(","))

posted @ 2017-07-04 08:47  reedom1991  阅读(442)  评论(0编辑  收藏  举报