未来就是现在的延续,过去就是完成的现在

Hive

一、Hive安装(略)

二、数据导入

(1)数据的分隔符在创建表的时候指定:

create table if not exists test(name string,age int) row format delimited fields terminated by '\t';

指定之后在导入数据时就会以\t也就是tab分隔字串:

ngsan   30

lisi    50

wangwu  60

peiqi   6

(2)创建Hive外部表

CREATE external TABLE tt (

stime STRING,

assid STRING,

ssid STRING,

other STRING,

mac STRING,

servicecode STRING,

host STRING,

apname STRING,

termtype STRING

) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' LOCATION '/logstash';

此时会自动创建元数据,无需load

(3)创建Hive内部表

CREATE external TABLE tt (

stime STRING,

assid STRING,

ssid STRING,

other STRING,

mac STRING,

servicecode STRING,

host STRING,

apname STRING,

termtype STRING

) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n';

LOAD DATA INPATH 'hdfs://cdh01:8020/logstash/test.txt' overwrite INTO TABLE bigdata_odl.mac;

创建内部表时需要load数据才会创建元数据,而且删除了表则数据也删除了;注意: 'hdfs://cdh01:8020/logstash/test.txt' cdh连接,非cdh则为:'hdfs://logstash/test.txt'

 

posted @ 2020-08-11 10:39  lzj123  阅读(131)  评论(0编辑  收藏  举报