Skip to content

Commit 03bf3f6

Browse files
authored
Merge pull request #1033 from stackhpc/bond-degraded-alert
Raise alert on degraded network bonds
2 parents a8fb2c8 + c2edb38 commit 03bf3f6

File tree

2 files changed

+14
-0
lines changed

2 files changed

+14
-0
lines changed

etc/kayobe/kolla/config/prometheus/system.rules

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -96,6 +96,15 @@ groups:
9696
summary: Host clock not synchronising (instance {{ $labels.instance }})
9797
description: "Clock not synchronising. Ensure NTP is configured on this host."
9898

99+
- alert: HostNetworkBondDegraded
100+
expr: (node_bonding_active - node_bonding_slaves) != 0
101+
for: 2m
102+
labels:
103+
severity: warning
104+
annotations:
105+
summary: Host network bond degraded (instance {{ $labels.instance }})
106+
description: "Bond {{ $labels.master }} degraded on {{ $labels.instance }}"
107+
99108
- alert: HostConntrackLimit
100109
expr: node_nf_conntrack_entries / node_nf_conntrack_entries_limit > 0.8
101110
for: 5m
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
---
2+
features:
3+
- |
4+
Adds a new Prometheus alert ``HostNetworkBondDegraded`` which will be
5+
raised when at least one bond member is down.

0 commit comments

Comments
 (0)