摘要:
Introduction 介绍两种共享变量的方式: accumulators:聚集信息 broadcast variables:高效地分布large values 介绍对高setup costs任务的批操作,比如查询数据库时连接数据的消耗。 > working on a per-partiton b 阅读全文
摘要:
Working with key/value Pairs Motivation Pair RDDs are a useful building block in many programs, as they expose operations that allow u to act on each 阅读全文