恢复被驱逐的pod
因:该集群使用udd
果:io性能不足导致oom,引发etcd leader多次选举,使集群处于不稳定状态
解:解决方法是更换为ssd
这里我们先恢复被驱逐的pod
1.查询
kubectl get po -A -o wide | egrep -i Evicted
2.恢复
kubectl get po -A -o wide | egrep -i Evicted | awk '{print $1 " " $2}' | while read a b;do kubectl delete po -n $a $b;done
3.检查
kubectl get po -A -o wide | egrep -i -v running
参考:https://kubernetes.io/docs/concepts/scheduling-eviction/node-pressure-eviction/