知识文档汇总
python中的lambda表达式:https://blog.csdn.net/zjuxsl/article/details/79437563
python中没有reduce的问题:https://blog.csdn.net/zjuxsl/article/details/79437563
eclipse搭建连接HDFS:37.04高可用搭建理论(Av46726994,P37)
stringbuilder和stringbuffer的区别:https://blog.csdn.net/csxypr/article/details/92378336
数据倾斜:mapreduce中间会形成kvp,p会使数据随机分布到不同的reduce结点上去.
https://baike.baidu.com/item/%E6%95%B0%E6%8D%AE%E5%80%BE%E6%96%9C/4740858?fr=aladdin
reduce数量的设置:1、数据量 2、数据(key种类)
hive实现wordcount:https://blog.csdn.net/levy_cui/article/details/51142816
mysql join方法:https://blog.csdn.net/huatian5/article/details/80854455