优质博文链接
Kafka
Kafka streams : http://aseigneurin.github.io/2016/10/07/kafka-streams-scaling-up-or-down.html Spark Streaming+Kafka踩坑记:http://www.cnblogs.com/xlturing/p/6246538.html
Kafka tools工具使用:https://crazyof.me/blog/archives/1815.html
Spark
Spark Streaming从指定时间戳开始消费:http://www.klion26.com/spark-streaming-consume-kafka-message-from-specify-timestamp.html Spark性能相关参数配置:http://spark-config.readthedocs.io/en/latest/index.html#id2
Spark算子:http://homepage.cs.latrobe.edu.au/zhe/ZhenHeSparkRDDAPIExamples.html Spark Streaming direct approach(No receviers)分析,并且对比了recevier-based approach: http://www.jianshu.com/p/b4af851286e5 Spark Streaming 源码解析系列:
https://github.com/lw-lin/CoolplaySpark/tree/master/Spark%20Streaming%20%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%90%E7%B3%BB%E5%88%97
机器学习
用python进行机器学习:http://blog.csdn.net/lsldd/article/category/2709209
lightGBM和xgboost实例对比: https://www.analyticsvidhya.com/blog/2017/06/which-algorithm-takes-the-crown-light-gbm-vs-xgboost/
Scala
yield用法:http://www.teahoo.net/IT/20161026908.html
集合方法大全:https://blog.csdn.net/bdmh/article/details/50070013 Josh Suereth大神写的scala教程:http://docs.scala-lang.org/zh-cn/overviews/core/implicit-classes.html with用法:http://hongjiang.info/scala-type-system-compund-type/
Scala类型系统,类型参数化和变化型注解:http://m.blog.csdn.net/article/details?id=50757095
ElasticSearch
Java
ForkJoinPool类并行处理:http://blog.csdn.net/xuguoli_beyondboy/article/details/44288047
python
python实现排序算法:http://python.jobbole.com/82270/
python数据类型详解(注意时间类型): http://www.cnblogs.com/linjiqin/p/3608541.html