03 2019 档案

摘要:In addition to the Resilient Distributed Dataset (RDD) interface, the second kind of low level API in Spark is two types of “distributed shared variab 阅读全文
posted @ 2019-03-04 10:36 DataNerd 阅读(343) 评论(0) 推荐(0)
摘要:This chapter covers the advanced RDD operations and focuses on key–value RDDs, a powerful abstraction for manipulating data. We also touch on some mor 阅读全文
posted @ 2019-03-04 10:03 DataNerd 阅读(355) 评论(0) 推荐(0)