解决HBase 出现client.RpcRetryingCaller: Call exception, tries=11, retries=35的超时问题
22/03/01 19:20:31 INFO client.RpcRetryingCaller: Call exception, tries=11, retries=35, started=48502 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970324918435862,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
22/03/01 19:20:31 INFO client.RpcRetryingCaller: Call exception, tries=11, retries=35, started=48504 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970324908406311,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
22/03/01 19:20:31 INFO client.RpcRetryingCaller: Call exception, tries=11, retries=35, started=48508 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970324917125296,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
22/03/01 19:20:31 INFO client.RpcRetryingCaller: Call exception, tries=11, retries=35, started=48535 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970324895377738,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
22/03/01 19:20:31 INFO client.RpcRetryingCaller: Call exception, tries=11, retries=35, started=48577 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970324938928593,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
22/03/01 19:21:09 INFO client.RpcRetryingCaller: Call exception, tries=10, retries=35, started=38316 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970324922617643,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
22/03/01 19:21:09 INFO client.RpcRetryingCaller: Call exception, tries=10, retries=35, started=38255 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970324936355843,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
22/03/01 19:21:09 INFO client.RpcRetryingCaller: Call exception, tries=10, retries=35, started=38315 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970325145704779,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
22/03/01 19:21:09 INFO client.RpcRetryingCaller: Call exception, tries=10, retries=35, started=38334 ms ago, cancelled=false, msg=row 'nezha:nvwa_pfnl_prod,1970324939432971,99999999999999' on table 'hbase:meta' at region=hbase:meta,,1.1588230740, hostname=qingyun37,16020,1635079184806, seqNum=0
这种超时问题多半是hosts文件配置ip问题,但是仔细检查确定配置没有问题,最后发现是metrics-core的jar包问题,spark集群中的这个jar包是3.1.2版本,实际代码应该使用的是2.2.0版本,把代码里的2.2.0版本的这个jar包放入到SPARK_HOME/jars目录内问题解决,注意这里目前是既有3.1.2版本,又有2.2.0版本的,目前代码运行正常