prometheus-operator监控traefik-Ingress组件状态
系统环境:
Prometheus Operator版本: 0.29 Kubernetes 版本: 1.14.0
一、Traefik 配置文件设置 Prometheus
要监控 Traefik 控制器,首先要控制 Traeik 将 Metrics 数据暴露出来,这需要在配置文件中加入下面配置:
[metrics]
[metrics.prometheus]
entryPoint = "traefik"
buckets = [0.1,0.3,1.2,5.0]
安装 Traefik 时候已经将配置文件外挂到 Kubernetes ConfigMap 中,详情可以参考 Kubernetes 部署 Traefik Ingress 一文。
例如,集群中将 Traefik 配置文件挂载到 Kubernetes ConfigMap 中,可以用 “kubectl etid” 命令编辑 Traefik 配置文件,加上 Prometheus 配置,这里提供本人完整配置如下:
$ kubectl edit ConfigMap traefik-config -n kube-system
apiVersion: v1
data:
traefik.toml: |
# traefik.toml
debug = true
InsecureSkipVerify = true
defaultEntryPoints = ["http","https"]
[entryPoints]
[entryPoints.http]
address = ":80"
compress = true
[entryPoints.https]
address = ":443"
compress = true
[entryPoints.https.tls]
[[entryPoints.https.tls.certificates]]
CertFile = "/ssl/tls.crt"
KeyFile = "/ssl/tls.key"
[entryPoints.traefik]
address = ":8080"
[kubernetes]
[traefikLog]
format = "json"
#filePath = "/data/traefik.log"
[accessLog]
#filePath = "/data/access.log"
format = "json"
[accessLog.filters]
retryAttempts = true
minDuration = "10ms"
[accessLog.fields]
defaultMode = "keep"
[accessLog.fields.names]
"ClientUsername" = "drop"
[accessLog.fields.headers]
defaultMode = "keep"
[accessLog.fields.headers.names]
"User-Agent" = "redact"
"Authorization" = "drop"
"Content-Type" = "keep"
[api]
entryPoint = "traefik"
dashboard = true
[metrics]
[metrics.prometheus]
entryPoint = "traefik"
buckets = [0.1,0.3,1.2,5.0]
二、Traefik Service 设置标签
Prometheus Operator 是通过 Label 匹配的,需要提前设置 Service 贴上“k8s-app: traefik-ingress”标签
1、查看 Traefik Service
$ kubectl get service -n kube-system
kube-dns ClusterIP 10.10.0.10 <none> 53/UDP,53/TCP,9153/TCP 79d
kubelet ClusterIP None <none> 10250/TCP 35d
traefik-ingress-service ClusterIP 10.10.114.105 <none> 80/TCP,443/TCP,8080/TCP 56d
2、编辑该 Service 设置 Label
编辑 Traefik Service
$ kubectl edit service traefik-ingress-service -n kube-system
设置 Label “k8s-app: traefik-ingress”
apiVersion: v1
kind: Service
metadata:
creationTimestamp: "2019-04-15T05:06:41Z"
name: traefik-ingress-service
namespace: kube-system
resourceVersion: "85575"
selfLink: /api/v1/namespaces/kube-system/services/traefik-ingress-service
uid: 4172b4df-5f3c-11e9-9287-000c29d98697
labels:
k8s-app: traefik-ingress #---增加标签 “k8s-app: traefik-ingress”
spec:
clusterIP: 10.10.114.105
ports:
- name: http
port: 80
protocol: TCP
targetPort: 80
- name: https
port: 443
protocol: TCP
targetPort: 443
- name: admin #---Prometheus metrics 数据是通过8080端口暴露的
port: 8080
protocol: TCP
targetPort: 8080
selector:
k8s-app: traefik-ingress-lb
sessionAffinity: None
type: ClusterIP
status:
loadBalancer: {}
三、Prometheus Operator 配置监控规则
配置服务监控资源,用于监控 Traefik 控制器:
traefik-monitor.yaml
apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
name: traefik-ingress
namespace: monitoring
labels:
k8s-app: traefik-ingress
spec:
jobLabel: k8s-app
endpoints:
- port: admin #---设置为traefik 8080端口名称 admin
interval: 30s
selector:
matchLabels:
k8s-app: traefik-ingress
namespaceSelector:
matchNames:
- kube-system
创建该Service Monitor
$ kubectl apply -f traefik-monitor.yaml
四、查看 Prometheus 规则
打开 Prometheus UI,查看 Prometheus 规则,可以看到 traefik 数据已经存在。
五、Grafana 引入仪表盘
打开 Grafana,在其中引入编号“4475”的仪表盘
然后就可以看到仪表盘
如果没有数据,请提前通过 Traefik Ingress 访问其配置的域名,刷新出一些数据,然后调整小时间范围。