kalor - 博客园

查看hive表中数据所在路径

摘要：对于外部表而言，Because it's external, Hive doesn't assume it owns the data. Therefore, dropping the external table doesn't delete the data, although the metadata for the table will be deleted(即外部表的metadata与real data是分开存放的).Let's 瞧瞧看：hive> set hive.cli.print.current.db = true;hive (defaul 阅读全文

posted @ 2013-03-27 22:03 kalor 阅读(8256) 评论(0) 推荐(0) 编辑

load数据到hive表中Success

摘要： The following table declaration creates an external table that can read all the data filesfor this comma-delimited data in /data/stocks:CREATE EXTERNAL TABLE IF NOT EXISTS stocks ( exchange STRING, symbol STRING, ymd STRING, price_open FLOAT, price_high FLOAT, price_low FLOAT, price_close FLO... 阅读全文

posted @ 2013-03-27 19:15 kalor 阅读(1049) 评论(0) 推荐(0) 编辑

hadoop相关端口详解

摘要： Hadoop端口说明默认端口设置位置描述信息8020 namenode RPC交互端口8021 JT RPC交互端口50030 mapred.job.tracker.http.address JobTracker administrative web GUIJOBTRACKER的HTTP服务器和端口50070 dfs.http.address NameNode administrative web GUINAMENODE的HTTP服务器和端口50010 dfs.datanode.address DataNode control port (each DataNode listens o.. 阅读全文

posted @ 2013-03-27 18:23 kalor 阅读(418) 评论(0) 推荐(0) 编辑

了解jps以及无法启动相关进程问题

摘要： 1、jps的作用jps类似linux的ps命令，不同的是ps是用来显示进程，而jps只显示java进程，准确的说是当前用户已启动的部分java进程信息，信息包括进程号和简短的进程command。2、某个java进程已经启动，用jps却显示不了该进程进程号这个问题已经碰到过两次了，所以在这里总结下。现象：用ps -ef|grep java能看到启动的java进程，但是用jps查看却不存在该进程的id。待会儿解释过之后就能知道在该情况下，jconsole、jvisualvm可能无法监控该进程，其他java自带工具也可能无法使用分析：java程序启动后，默认（请注意是默认）会在/tmp/hsperf 阅读全文

posted @ 2013-03-27 18:08 kalor 阅读(493) 评论(0) 推荐(0) 编辑

hadoop相关Exception

摘要： Problem 1: 程序 'java' 已包含在下列软件包中：* default-jre* gcj-4.6-jre-headless* gcj-4.7-jre-headless* openjdk-7-jre-headless* openjdk-6-jre-headless请尝试：sudo apt-... 阅读全文

posted @ 2013-03-27 17:02 kalor 阅读(1302) 评论(0) 推荐(1) 编辑

hive之alter table

摘要： Alter TableMost table properties can be altered with Alter Table statements, which change metadata about the table but not the data itself. These statements can be used to fix mistakes in schema, move partition locations and so on.1. Renamign a Table: hive> alter table log_messages rename to logm 阅读全文

posted @ 2013-03-26 22:00 kalor 阅读(1553) 评论(0) 推荐(0) 编辑

hive之external table创建

摘要： External Tables However, managed tables are less convinent for sharing with other tools. For example, suppose we have data that is created and used primarily by Pig or other tools, but we want to run some quries against it, but not give Hive ownership of the data. So we can define an external table 阅读全文

posted @ 2013-03-26 14:07 kalor 阅读(2130) 评论(0) 推荐(0) 编辑

hive之managed table创建

摘要： Welcome to the world of Hive-0.10.0, now we will use Managed Tables:1. hive> create database if not exists dataprocess > comment 'analyze cluster data' > location '/home/landen/UntarFile/hive-0.10.0/user/hive/warehouse';Notice: you can optionally specify a location for the t 阅读全文

posted @ 2013-03-25 22:08 kalor 阅读(810) 评论(0) 推荐(0) 编辑

hadoop配置一览表

摘要： Hadoop配置：http://www.cnblogs.com/viviman/archive/2013/01/14/2860201.htmlHive配置：http://springsfeng.iteye.com/blog/1734517 阅读全文

posted @ 2013-03-24 21:25 kalor 阅读(156) 评论(0) 推荐(0) 编辑

Ubuntu下hadoop-1.0.4安装过程

摘要： ubuntu12.10 安装hadoop1.0.4伪分布式我是使用的虚拟机安装的，安装ubuntu12.10那就不说了，说一下简单的安装hadoop1.0.4伪分布式的步骤。1、安装jdk安装方式参照我的上一篇文章http://blog.csdn.net/mengxiangyue/article/details/8589308相关问题：如果/etc/profile为only-read文件，此时需要加写权限：caipeichao@caipeichao-Lenovo-Q35:~/Software$ sudo chmod +w /etc/profile 然后用vi编辑器编辑/etc/profil 阅读全文

posted @ 2013-03-24 12:10 kalor 阅读(199) 评论(0) 推荐(0) 编辑

导航

2013年3月27日

2013年3月26日

2013年3月25日

2013年3月24日