Skip to content

Commit 363c1c8

Browse files
authored
Redfish exporter: Decrease sensitivity of alert (#1358)
The redfish exporter is prone to failed scrapes. Lets wait for mulitple failed scrapes before triggering an alert. This should still catch the case where it is completely dead, but reduce the false positives from failed scrapes.
1 parent 6f7051d commit 363c1c8

File tree

2 files changed

+16
-1
lines changed

2 files changed

+16
-1
lines changed

etc/kayobe/kolla/config/prometheus/prometheus.rules

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,14 +7,23 @@ groups:
77
rules:
88

99
- alert: PrometheusTargetMissing
10-
expr: up == 0
10+
expr: up{job!="redfish-exporter-seed"} == 0
1111
for: 5m
1212
labels:
1313
severity: critical
1414
annotations:
1515
summary: "Prometheus target missing (instance {{ $labels.instance }})"
1616
description: "A Prometheus target has disappeared. An exporter might have crashed."
1717

18+
- alert: PrometheusTargetMissing
19+
expr: up{job="redfish-exporter-seed"} == 0
20+
for: 15m
21+
labels:
22+
severity: critical
23+
annotations:
24+
summary: "Prometheus target missing (instance {{ $labels.instance }})"
25+
description: "A Prometheus target has disappeared. An exporter might have crashed."
26+
1827
- alert: PrometheusAllTargetsMissing
1928
expr: count by (job) (up) == 0
2029
for: 1m
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
---
2+
fixes:
3+
- |
4+
Changes the duration for which redfish exporter must continually fail
5+
scrapes before triggering an alert to 15 minutes. This should hopefully
6+
reduce some alert spam.

0 commit comments

Comments
 (0)