prometheus告警配置注意事项
global:
scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
# scrape_timeout is set to the global default (10s).
# Alertmanager configuration
alerting:
alertmanagers:
- static_configs:
- targets: ['100.101.201.131:9093']
rule_files:
- 'rules.yml'
# - "second_rules.yml"
# Here it's Prometheus itself.
scrape_configs:
- job_name: 'beta' scrape_timeout: 14s #超时时间必须少于即间隔scrape_interval,这里是14s,小于15s relabel_configs: regex: "(.*),(.+),(.*)" #正则匹配 replacement: $2 #第二个 action: replace #动作为替换 target_label: "nodename" #目标的key为这个 consul_sd_configs: - server: '127.0.0.1:8501'
global: resolve_timeout: 2h route: group_by: ['alertname'] group_wait: 5s group_interval: 10s repeat_interval: 1h #告警发送到webhook的间隔时间 receiver: 'webhook' receivers: - name: 'webhook' webhook_configs: - url: 'http://127.0.0.1:5001/send' send_resolved: true
运维虐我千万遍,我对运维如初恋。
【推荐】还在用 ECharts 开发大屏?试试这款永久免费的开源 BI 工具!
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步