alerting: account for vmauth component for alerts ServiceDown and TooManyRestarts

Signed-off-by: hagen1778 <roman@victoriametrics.com>
This commit is contained in:
hagen1778 2023-10-03 16:45:33 +02:00
parent 7b35eaa853
commit de651165bd
No known key found for this signature in database
GPG key ID: 3BF75F3741CA9640
2 changed files with 4 additions and 2 deletions

View file

@ -7,7 +7,7 @@ groups:
# note the `job` filter and update accordingly to your setup # note the `job` filter and update accordingly to your setup
rules: rules:
- alert: TooManyRestarts - alert: TooManyRestarts
expr: changes(process_start_time_seconds{job=~"victoriametrics.*|vmselect.*|vminsert.*|vmstorage.*|vmagent.*|vmalert.*|vmsingle.*|vmalertmanager.*"}[15m]) > 2 expr: changes(process_start_time_seconds{job=~".*(victoriametrics|vmselect|vminsert|vmstorage|vmagent|vmalert|vmsingle|vmalertmanager|vmauth).*"}[15m]) > 2
labels: labels:
severity: critical severity: critical
annotations: annotations:
@ -16,7 +16,7 @@ groups:
It might be crashlooping." It might be crashlooping."
- alert: ServiceDown - alert: ServiceDown
expr: up{job=~"victoriametrics.*|vmselect.*|vminsert.*|vmstorage.*|vmagent.*|vmalert.*|vmsingle.*|vmalertmanager.*"} == 0 expr: up{job=~".*(victoriametrics|vmselect|vminsert|vmstorage|vmagent|vmalert|vmsingle|vmalertmanager|vmauth).*"} == 0
for: 2m for: 2m
labels: labels:
severity: critical severity: critical

View file

@ -28,6 +28,8 @@ The sandbox cluster installation is running under the constant load generated by
## tip ## tip
* FEATURE: [Alerting rules for VictoriaMetrics](https://github.com/VictoriaMetrics/VictoriaMetrics/tree/master/deployment/docker#alerts): account for `vmauth` component for alerts `ServiceDown` and `TooManyRestarts`.
## [v1.94.0](https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.94.0) ## [v1.94.0](https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.94.0)
Released at 2023-10-02 Released at 2023-10-02