hadoop安装

1.下载解压:hadoop-2.7.7.tar

2.修改配置文件etc/hadoop/hadoop-env.sh

export JAVA_HOME=/usr/java/latest

3.尝试命令

$ bin/hadoop

4.伪分布式配置

etc/hadoop/core-site.xml:

<configuration>
  <!- namenode -> <property> <name>fs.defaultFS</name> <value>hdfs://localhost:7000</value> </property> </configuration>

 

etc/hadoop/hdfs-site.xml:

<configuration>
    <property>
        <name>dfs.replication</name>
        <value>1</value>
    </property>
</configuration>

 好像hdfs设置datanode默认最少3,这里1不管用。 

 

5.设置ssh

ssh localhost

 

6.格式化

bin/hdfs namenode -format

 

7.Start NameNode daemon and DataNode daemon:

sbin/start-dfs.sh

 

8.hadoop默认端口

(9000)   fs.defaultFS,自定义端口,如:hdfs://172.25.40.171:9000

9001    dfs.namenode.rpc-address,DataNode会连接这个端口

50070    dfs.namenode.http-address

50470   dfs.namenode.https-address

50090   dfs.namenode.secondary.http-address,如:172.25.39.166:50090

50091   dfs.namenode.secondary.https-address,如:172.25.39.166:50091

50075   dfs.datanode.http.address(数据download需要)

50010         dfs.datanode.address,DataNode的数据传输端口

 

 

 

8.浏览器查看localhost:50070

 

9.终端查看:

查看文件夹
bin/hadoop fs -ls /flumeResume/data/2021-11-23 Found 2 items -rw-r--r-- 3 nuctech supergroup 1995 2021-11-23 16:43 /flumeResume/data/2021-11-23/FlumeData.1637656971481 -rw-r--r-- 3 nuctech supergroup 1308 2021-11-23 16:55 /flumeResume/data/2021-11-23/FlumeData.1637657653666

查看文件
bin/hadoop fs -ls /flumeResume/data/2021-11-23/FlumeData.1637657653666

查看文件内容

bin/hadoop fs -cat /flumeResume/data/2021-11-23/FlumeData.1637656971481
bin/hadoop fs -cat /flumeResume/data/2021-11-23/FlumeData.1637656971481 | head -10 //前10行
bin/hadoop fs -cat /flumeResume/data/2021-11-23/FlumeData.1637656971481 | tail -5 //最后5行
bin/hadoop fs -cat /flumeResume/data/2021-11-23/FlumeData.1637656971481 | wc -l //查看行数

删除文件

bin/hadoop fs -rm /flumeResume/data/FlumeData.1638255829995

 

posted @ 2021-11-23 16:33  jason47  阅读(20)  评论(0编辑  收藏  举报