代码改变世界

随笔分类 -  Hadoop

Hadoop Hive基础sql语法

2017-07-07 22:42 by GarfieldEr007, 1585 阅读, 收藏, 编辑
摘要: Hive 是基于Hadoop 构建的一套数据仓库分析系统,它提供了丰富的SQL查询方式来分析存储在Hadoop 分布式文件系统中的数据,可以将结构化的数据文件映射为一张数据库表,并提供完整的SQL查询功能,可以将SQL语句转换为MapReduce任务进行运行,通过自己的SQL 去查询分析需要的内容, 阅读全文

Hadoop Hive sql 语法详解

2017-07-07 22:41 by GarfieldEr007, 37301 阅读, 收藏, 编辑
摘要: Hive 是基于Hadoop 构建的一套数据仓库分析系统,它提供了丰富的SQL查询方式来分析存储在Hadoop 分布式文件系统中的数据,可以将结构化的数据文件映射为一张数据库表,并提供完整的SQL查询功能,可以将SQL语句转换为MapReduce任务进行运行,通过自己的SQL 去查询分析需要的内容, 阅读全文

hive参数——深入浅出学Hive

2017-07-07 22:38 by GarfieldEr007, 340 阅读, 收藏, 编辑
摘要: 第一部分:Hive 参数 hive.exec.max.created.files •说明:所有hive运行的map与reduce任务可以产生的文件的和 •默认值:100000 hive.exec.max.created.files •说明:所有hive运行的map与reduce任务可以产生的文件的和 阅读全文

Hive安装与配置——深入浅出学Hive

2017-07-07 22:35 by GarfieldEr007, 278 阅读, 收藏, 编辑
摘要: 第一部分:软件准备与环境规划 Hadoop环境介绍 •Hadoop安装路径 •/home/test/Desktop/hadoop-1.0.0/ •Hadoop 元数据存放目录 •/home/test/data/core/namenode •Hadoop 数据存放路径 •/home/test/data 阅读全文

Hive QL——深入浅出学Hive

2017-07-07 22:33 by GarfieldEr007, 395 阅读, 收藏, 编辑
摘要: 第一部分:DDL DDL DDL •建表 •删除表 •修改表结构 •创建/删除视图 •创建数据库 •显示命令 建表 CREATE [EXTERNAL] TABLE [IF NOT EXISTS] table_name [(col_name data_type [COMMENT col_comment 阅读全文

用Python语言写Hadoop MapReduce程序Writing an Hadoop MapReduce Program in Python

2016-03-15 21:36 by GarfieldEr007, 288 阅读, 收藏, 编辑
摘要: In this tutorial I will describe how to write a simple MapReduce program for Hadoop in the Python programming language.   Even though the Hadoop frame 阅读全文

运行Hadoop的示例程序WordCount-Running Hadoop Example

2016-03-15 21:26 by GarfieldEr007, 324 阅读, 收藏, 编辑
摘要:     In the last post we've installed Hadoop 2.2.0 on Ubuntu. Now we'll see how to launch an example mapreduce task on Hadoop. In the Hadoop directory 阅读全文

结合MapReduce和数据集Combining datasets with MapReduce

2016-03-15 21:24 by GarfieldEr007, 187 阅读, 收藏, 编辑
摘要: While in the SQL-world is very easy combining two or more datasets - we just need to use the JOIN keyword - with MapReduce things becomes a little har 阅读全文

实现Hadoop的Writable接口Implementing Writable interface of Hadoop

2016-03-15 21:23 by GarfieldEr007, 459 阅读, 收藏, 编辑
摘要: As we saw in the previous posts, Hadoop makes an heavy use of network transmissions for executing its jobs. As Doug Cutting (the creator of Hadoop) ex 阅读全文

更为详细的介绍Hadoop combiners-More about Hadoop combiners

2016-03-15 21:21 by GarfieldEr007, 294 阅读, 收藏, 编辑
摘要: Hadoop combiners are a very powerful tool to speed up our computations. We already saw what a combiner is in a previous post and we also have seen ano 阅读全文

计算均值mean的MapReduce程序Computing mean with MapReduce

2016-03-15 21:20 by GarfieldEr007, 271 阅读, 收藏, 编辑
摘要: In this post we'll see how to compute the mean of the max temperatures of every month for the city of Milan. The temperature data is taken from http:/ 阅读全文

Hadoop Combiners

2016-03-15 21:19 by GarfieldEr007, 166 阅读, 收藏, 编辑
摘要: In the last post and in the preceding one we saw how to write a MapReduce program for finding the top-n items of a data set. The difference between th 阅读全文

Top N之MapReduce程序加强版Enhanced MapReduce for Top N items

2016-03-15 21:18 by GarfieldEr007, 373 阅读, 收藏, 编辑
摘要: In the last post we saw how to write a MapReduce program for finding the top-n items of a dataset. The code in the mapper emits a pair key-value for e 阅读全文

Top N的MapReduce程序MapReduce for Top N items

2016-03-15 21:16 by GarfieldEr007, 554 阅读, 收藏, 编辑
摘要: In this post we'll see how to count the top-n items of a dataset; we'll again use the flatland book we used in a previous post: in that example we use 阅读全文

MapReduce模式MapReduce patterns

2016-03-15 21:15 by GarfieldEr007, 282 阅读, 收藏, 编辑
摘要: After having modified and run a job in the last post, we can now examine which are the most frequent patterns we encounter in MapReduce programming. A 阅读全文

为Hadoop创建JAR包文件Creating a JAR for Hadoop

2016-03-15 21:14 by GarfieldEr007, 517 阅读, 收藏, 编辑
摘要: We've seen the internals of MapReduce in the last post. Now we can make a little change to the WordCount and create a JAR for being executed by Hadoop 阅读全文

MapReduce任务分析与讨论MapReduce job explained

2016-03-15 21:11 by GarfieldEr007, 272 阅读, 收藏, 编辑
摘要: In the last post we saw how to run a MapReduce job on Hadoop. Now we're going to analyze how a MapReduce program works. And, if you don't know what Ma 阅读全文
点击右上角即可分享
微信分享提示