2017年5月29日

摘要: Overview Use HBase when u need random, realtime read/write access to ur Big Data. HBase is an open-source, distributed, versioned, non-relational data 阅读全文
posted @ 2017-05-29 21:58 橘子不是唯一的水果 阅读(228) 评论(0) 推荐(0) 编辑
 
摘要: Overview 如果你了解过HDFS,至少看过这句话吧: HDFS is a filesystem designed for storing very large files with streaming or sequential data access patterns. That's to 阅读全文
posted @ 2017-05-29 20:30 橘子不是唯一的水果 阅读(1723) 评论(0) 推荐(0) 编辑
 
摘要: Operations upon Impala Create table stored as parquet like parquet '/user/etl/datafile1' stored as parquet Loading data shuffle / no shuffle to choose 阅读全文
posted @ 2017-05-29 10:38 橘子不是唯一的水果 阅读(593) 评论(0) 推荐(0) 编辑