摘要:
对于外部表而言,Because it's external, Hive doesn't assume it owns the data. Therefore, dropping the external table doesn't delete the data, although the metadata for the table will be deleted(即外部表的metadata与real data是分开存放的).Let's 瞧瞧看:hive> set hive.cli.print.current.db = true;hive (defaul
阅读全文
posted @ 2013-03-27 22:03
kalor
阅读(8256)
推荐(0)
编辑
摘要:
The following table declaration creates an external table that can read all the data filesfor this comma-delimited data in /data/stocks:CREATE EXTERNAL TABLE IF NOT EXISTS stocks ( exchange STRING, symbol STRING, ymd STRING, price_open FLOAT, price_high FLOAT, price_low FLOAT, price_close FLO...
阅读全文
posted @ 2013-03-27 19:15
kalor
阅读(1049)
推荐(0)
编辑
摘要:
Hadoop端口说明默认端口 设置位置 描述信息8020 namenode RPC交互端口8021 JT RPC交互端口50030 mapred.job.tracker.http.address JobTracker administrative web GUIJOBTRACKER的HTTP服务器和端口50070 dfs.http.address NameNode administrative web GUINAMENODE的HTTP服务器和端口50010 dfs.datanode.address DataNode control port (each DataNode listens o..
阅读全文
posted @ 2013-03-27 18:23
kalor
阅读(418)
推荐(0)
编辑
摘要:
1、jps的作用jps类似linux的ps命令,不同的是ps是用来显示进程,而jps只显示java进程,准确的说是当前用户已启动的部分java进程信息,信息包括进程号和简短的进程command。2、某个java进程已经启动,用jps却显示不了该进程进程号这个问题已经碰到过两次了,所以在这里总结下。现象:用ps -ef|grep java能看到启动的java进程,但是用jps查看却不存在该进程的id。待会儿解释过之后就能知道在该情况下,jconsole、jvisualvm可能无法监控该进程,其他java自带工具也可能无法使用分析:java程序启动后,默认(请注意是默认)会在/tmp/hsperf
阅读全文
posted @ 2013-03-27 18:08
kalor
阅读(493)
推荐(0)
编辑
摘要:
Problem 1: 程序 'java' 已包含在下列软件包中:* default-jre* gcj-4.6-jre-headless* gcj-4.7-jre-headless* openjdk-7-jre-headless* openjdk-6-jre-headless请尝试:sudo apt-...
阅读全文
posted @ 2013-03-27 17:02
kalor
阅读(1302)
推荐(1)
编辑
摘要:
Alter TableMost table properties can be altered with Alter Table statements, which change metadata about the table but not the data itself. These statements can be used to fix mistakes in schema, move partition locations and so on.1. Renamign a Table: hive> alter table log_messages rename to logm
阅读全文
posted @ 2013-03-26 22:00
kalor
阅读(1553)
推荐(0)
编辑
摘要:
External Tables However, managed tables are less convinent for sharing with other tools. For example, suppose we have data that is created and used primarily by Pig or other tools, but we want to run some quries against it, but not give Hive ownership of the data. So we can define an external table
阅读全文
posted @ 2013-03-26 14:07
kalor
阅读(2130)
推荐(0)
编辑
摘要:
Welcome to the world of Hive-0.10.0, now we will use Managed Tables:1. hive> create database if not exists dataprocess > comment 'analyze cluster data' > location '/home/landen/UntarFile/hive-0.10.0/user/hive/warehouse';Notice: you can optionally specify a location for the t
阅读全文
posted @ 2013-03-25 22:08
kalor
阅读(810)
推荐(0)
编辑
摘要:
Hadoop配置:http://www.cnblogs.com/viviman/archive/2013/01/14/2860201.htmlHive配置:http://springsfeng.iteye.com/blog/1734517
阅读全文
posted @ 2013-03-24 21:25
kalor
阅读(156)
推荐(0)
编辑
摘要:
ubuntu12.10 安装hadoop1.0.4伪分布式我是使用的虚拟机安装的,安装ubuntu12.10那就不说了,说一下简单的安装hadoop1.0.4伪分布式的步骤。1、 安装jdk安装方式参照我的上一篇文章http://blog.csdn.net/mengxiangyue/article/details/8589308相关问题: 如果/etc/profile为only-read文件,此时需要加写权限:caipeichao@caipeichao-Lenovo-Q35:~/Software$ sudo chmod +w /etc/profile 然后用vi编辑器编辑/etc/profil
阅读全文
posted @ 2013-03-24 12:10
kalor
阅读(199)
推荐(0)
编辑