HA集群中namenode连接不上journalnode,导致namenode启动不了

查看日志发现一下的错误:

2018-10-08 15:29:26,373 FATAL org.apache.hadoop.hdfs.server.namenode.FSEditLog: Error: recoverUnfinalizedSegments failed for required journal (JournalAndStream(mgr=QJM to [192.168.135.71:8485, 192.168.135.72:8485, 192.168.135.73:8485], stream=null)) org.apache.hadoop.hdfs.qjournal.client.QuorumException: Got too many exceptions to achieve quorum size 2/3. 3 exceptions thrown: 192.168.135.72:8485: Call From mini2/192.168.135.72 to mini2:8485 failed on connection exception: java.net.ConnectException: 拒绝连接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused 192.168.135.71:8485: Call From mini2/192.168.135.72 to mini1:8485 failed on connection exception: java.net.ConnectException: 拒绝连接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused 192.168.135.73:8485: Call From mini2/192.168.135.72 to mini3:8485 failed on connection exception: java.net.ConnectException: 拒绝连接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused

解决方法:

方法一:首先手动启动journalnode,再手动启动namenode

方法二:修改core-site.xml中的ipc参数

<property>
<name>ipc.client.connect.max.retries</name>
<value>100</value>
<description>
Indicates the number of retries a client will make to establisha server connection.
</description>
</property>
<property>
<name>ipc.client.connect.retry.interval</name>
<value>10000</value>
<description>Indicates the number of milliseconds a client will wait for
before retrying to establish a server connection.
</description>
</property>
posted @ 2018-11-22 20:04  Z、某某  阅读(4511)  评论(0编辑  收藏  举报