Skip to content

Commit e183052

Browse files
committed
fix: use rabbitmq length for RabbitMQNodeDown
The `RabbitMQNodeDown` made the assumption that all deployments involve only three RabbitMQ nodes. However, this is not always the case as we do support deployments with a single node or more than three. Before this would have caused false alerts in deployments with a single RabbitMQ node. Whilst also concealing alerts in deployments with more than three nodes.
1 parent d1f423f commit e183052

File tree

2 files changed

+7
-1
lines changed

2 files changed

+7
-1
lines changed

etc/kayobe/kolla/config/prometheus/rabbitmq.rules

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ groups:
66
- name: rabbitmq.rules
77
rules:
88
- alert: RabbitMQNodeDown
9-
expr: sum(rabbitmq_build_info{instance!=""}) < 3
9+
expr: sum(rabbitmq_build_info{instance!=""}) < {% endraw %}{{ groups['rabbitmq'] | length }}{% raw %}
1010
for: 30m
1111
labels:
1212
severity: critical
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
---
2+
features:
3+
- |
4+
Use the length of the ``rabbitmq`` group to determine if any RabbitMQ
5+
nodes are down. This is benefical for deployments that do not use a
6+
standard three node setup.

0 commit comments

Comments
 (0)