Spark2.0研究
一、Mastering Apache Spark 2
https://www.gitbook.com/book/jaceklaskowski/mastering-apache-spark/details
二、Apache Spark 2 (Scala) Workshops
https://github.com/jaceklaskowski/spark-workshop/blob/gh-pages/slides/README.md#toc
三、Spark部署三种方式
1.loccal:本地服务,在IDE上本地跑程序,用于调试
2.Standalone:使用Spark自己的Master/Worker进行服务的调度,脱离Yarn的资源管理
3.Spark on Yarn:使用yarn来进行资源调度
四、Spark Streaming文档下载
https://www.gitbook.com/download/pdf/book/jaceklaskowski/spark-streaming
StreamingContext三种状态: 1.INITIALIZED , i.e. after it was instantiated. 2.ACTIVE , i.e. after it was started. 3.STOPPED , i.e. after it has been stopped