redis集群故障无法自动提升slave
问题描述
生产redis集群(3master/3slave)部署在3台虚机上,每个虚机部署2个redis节点,挂了一台虚机导致redis集群异常,分析发现是挂了机器上是2master redis
redis日志
* MASTER <-> REPLICA sync started
# Error condition on socket for SYNC: Connection refused
* Connecting to MASTER x.12.73.126:4379
解决问题
m1、人工提供slave到master恢复集群
redis-cli //login
cluster nodes
//登陆slave节点执行故障转移,slave->master
cluster failover takeover
m2、备份master rdb,重新初始化redis集群然后导入rdb文件