恢复被驱逐的pod

因:该集群使用udd
果:io性能不足导致oom,引发etcd leader多次选举,使集群处于不稳定状态
解:解决方法是更换为ssd
这里我们先恢复被驱逐的pod
1.查询

kubectl get po -A -o wide | egrep -i Evicted

2.恢复

kubectl get po -A -o wide | egrep -i Evicted | awk '{print $1 " " $2}' | while read a b;do kubectl delete po -n $a $b;done

3.检查

kubectl get po -A -o wide | egrep -i -v running

参考:https://kubernetes.io/docs/concepts/scheduling-eviction/node-pressure-eviction/

posted @ 2022-04-18 11:58  Sureing  阅读(494)  评论(0编辑  收藏  举报