摘要:
Overview Use HBase when u need random, realtime read/write access to ur Big Data. HBase is an open-source, distributed, versioned, non-relational data 阅读全文
摘要:
Overview 如果你了解过HDFS,至少看过这句话吧: HDFS is a filesystem designed for storing very large files with streaming or sequential data access patterns. That's to 阅读全文
摘要:
Operations upon Impala Create table stored as parquet like parquet '/user/etl/datafile1' stored as parquet Loading data shuffle / no shuffle to choose 阅读全文