[Hive_add_3] Hive 进行简单数据处理


 0. 说明

  通过 Hive 对 duowan 数据进行简单处理

 

 


1. 操作流程

  1.1 建表

create table duowan(id int, name string, pass string, mail string, nickname string)
row format delimited
fields terminated by '\t' 
lines terminated by '\n' 
stored as textfile;

 

  1.2 加载数据

load data inpath '/duowan_user.txt' into table duowan;

 

  1.3 开始执行

select pass , count(*) as count from duowan group by pass order by count desc limit 10; 

 

  1.4 设置 reduce 个数

set mapreduce.job.reduces=2;

 

 

 


 

posted @ 2018-12-25 15:07  山间一棵松  阅读(260)  评论(0编辑  收藏  举报