Prometheus Alert Manager -- Difference between group_wait, group_interval, and repeat_interval
Definition
group_interval:group_interval dictates how long to wait
before sending notifications about new
alerts that are added to a group of alerts that have been alerted on
before。
repeat_interval:If there is nothing change in the Alert Group -- no new alert added or new old alert resolved, when will the next alert be sent out. 在Group没有发生更新的情况下重新发送通知的时间间隔
Test Setting
group_wait 1m
group_interval 10m
repeat_interval
60m
keep_firing_for 1d
Data
Retention Period 2d
Testing Plan and Result
Firing Time | 1st Send Time | 2nd Send Time | Last Send Time | |
---|---|---|---|---|
Incident A | T0 | |||
Incident B | T0 + 30s | T1 = T0 + 1m / T0 + 10m? | T1 + 10m / T1 + 60m ? |
Firing Time | Send Time | 2nd Send Time | Last Send Time | |
---|---|---|---|---|
Incident A | T0 | T0 + 1m | ||
Incident B | T0 + 5m | T1 = T0 + 6m or T0 + 10m? |
Firing Time | Send Time | 2nd Send Time | Last Send Time | |
---|---|---|---|---|
Incident A | T0 | T1 = T0 + 1m | ||
Incident B | T0 + 5m | T2 = T1 + 5m or T1 + 10m? | ||
Incident C | T0 + 30m | T3 = T0 + |
Reference:
Receiving alert more often then repeat interval · Issue #3326 · prometheus/alertmanager · GitHub