CDH5.16安装lzo
1.在CDH管理页面进入parcels,下载GPLEXTRAS
lintong@master:/opt/cloudera/parcel-repo$ ls | grep GPLEXTRAS GPLEXTRAS-5.16.2-1.cdh5.16.2.p0.8-xenial.parcel GPLEXTRAS-5.16.2-1.cdh5.16.2.p0.8-xenial.parcel.sha1
将sha1改成sha
sudo mv GPLEXTRAS-5.16.2-1.cdh5.16.2.p0.8-xenial.parcel.sha1 GPLEXTRAS-5.16.2-1.cdh5.16.2.p0.8-xenial.parcel.sha
如果parcels的哈希文件不存在,可以这样生成
sha1sum ./SPARK2-2.4.0.cloudera2-1.cdh5.13.3.p0.1041012-xenial.parcel | cut -d ' ' -f 1 > SPARK2-2.4..cloudera2-1.cdh5.13.3.p0.1041012-xenial.parcel.sha1
2.在界面上分配并激活
3.在HDFS配置的 io.compression.codecs 参数添加上
com.hadoop.compression.lzo.LzoCodec com.hadoop.compression.lzo.LzopCodec
参考文档
https://www.cloudera.com/documentation/enterprise/latest/topics/cm_mc_gpl_extras.html
4.在节点上安装lzo
sudo apt-get install liblzo2-2
参考文档
https://docs.cloudera.com/documentation/enterprise/latest/topics/cm_ig_install_gpl_extras.html#xd_583c10bfdbd326ba-3ca24a24-13d80143249--7ec6
4.YARN配置,在mapreduce.application.classpath中添加
/opt/cloudera/parcels/GPLEXTRAS/lib/hadoop/lib/*
5.重启,验证
create table test_table(id int,name string) set mapreduce.output.fileoutputformat.compress.codec=com.hadoop.compression.lzo.LzoCodec; set hive.exec.compress.output=true; set mapreduce.output.fileoutputformat.compress=true; set mapreduce.output.fileoutputformat.compress.type=BLOCK; insert overwrite table test_table select * from test_table;
本文只发表于博客园和tonglin0325的博客,作者:tonglin0325,转载请注明原文链接:https://www.cnblogs.com/tonglin0325/p/5321259.html