摘要: https://avro.apache.org/docs/current/ Introduction Apache Avro™ is a data serialization system. Avro provides: Rich data structures. A compact, fast, 阅读全文
posted @ 2017-10-31 23:45 papering 阅读(195) 评论(0) 推荐(0) 编辑
摘要: https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html Introduction Archival Storage is a solution to decouple gr 阅读全文
posted @ 2017-10-31 23:38 papering 阅读(290) 评论(0) 推荐(0) 编辑
摘要: splittability CompressedStorage CompressedStorage CompressedStorage Skip to end of metadata Created by Confluence Administrator, last modified by Left 阅读全文
posted @ 2017-10-31 23:26 papering 阅读(301) 评论(0) 推荐(0) 编辑
摘要: http://cis.stvincent.edu/html/tutorials/swd/btree/btree.html Introduction A B-tree is a specialized multiway tree designed especially for use on disk. 阅读全文
posted @ 2017-10-31 21:51 papering 阅读(161) 评论(0) 推荐(0) 编辑
摘要: https://kafka.apache.org/intro.html 阅读全文
posted @ 2017-10-31 17:03 papering 阅读(153) 评论(0) 推荐(0) 编辑
摘要: https://kafka.apache.org/intro.html Kafka as a Messaging System How does Kafka's notion of streams compare to a traditional enterprise messaging syste 阅读全文
posted @ 2017-10-31 12:03 papering 阅读(189) 评论(0) 推荐(0) 编辑
摘要: limit 阅读全文
posted @ 2017-10-31 11:22 papering 阅读(146) 评论(0) 推荐(0) 编辑
摘要: rmds mapper 阅读全文
posted @ 2017-10-31 11:21 papering 阅读(113) 评论(0) 推荐(0) 编辑
摘要: https://spark.apache.org/sql/ Performance & Scalability Spark SQL includes a cost-based optimizer, columnar storage and code generation to make querie 阅读全文
posted @ 2017-10-31 00:10 papering 阅读(162) 评论(0) 推荐(0) 编辑