摘要:
https://hadoop.apache.org/docs/r2.7.2/hadoop-project-dist/hadoop-hdfs/HdfsDesign.html Introduction 【相对于POSIX,放宽要求】 The Hadoop Distributed File System 阅读全文
摘要:
Our goal is to foster an ecosystem of components and tools that relieve the burden of running applications in public and private clouds. https://kuber 阅读全文
摘要:
RocksDB RocksDB is a high performance[1][2][3][4][5] embedded database for key-value data. It is a fork of LevelDB which was then optimized to exploit 阅读全文
摘要:
https://github.com/google/snappy Introduction 【速度第一,压缩比适宜】 【favors speed over compression ratio】 Snappy is a compression/decompression library. It doe 阅读全文
摘要:
def combination_2_n(l): n, r = len(l), [] for i in range(0, n, 1): s = i + 1 for ii in range(s, n, 1): r.append([l[i], l[ii]]) return r # l1, l2 = [... 阅读全文
摘要:
m*n matrix m*n=1000 f(A)=25 https://www.cs.princeton.edu/courses/archive/spring12/cos598C/svdchapter.pdf finding the best k -dimensional subspace with 阅读全文
摘要:
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html Introduction The Hadoop Distributed File System (HDFS) is a distributed file system designed to 阅读全文
摘要:
布隆过滤器 http://pages.cs.wisc.edu/~cao/papers/summary-cache/node8.html A Bloom filter is a method for representing a set A of n elements (also called key 阅读全文
摘要:
https://chenli.ics.uci.edu/files/icde2017-AsterixDB-Spatial-Comparison.pdf A Comparative Study of Log-Structured Merge-Tree-Based Spatial Indexes for 阅读全文
摘要:
【粗暴的HIVE-SQL】select xyz from abc where ty='sdk' and ret_code=0 and data_source_type=1 and dt between '20171013' and '20171016' limit 1200000;一个手机几天就在同 阅读全文