Parquet 列式存储结构
摘要:Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem, regardless of the choice of data processing framework, d
阅读全文
Avro 序列化
摘要:官网传送:http://avro.apache.org/docs/current/ Introduction Apache Avro™ is a data serialization system. Avro provides: Rich data structures. A compact, fa
阅读全文