Hi,
I have alertmanager cluster setup with 12 nodes set up in a cluster mode
with consistent 12 cluster members.
I am getting multiple alerts firing for some alerts. Looking at counts of
changes(ALERTS_FOR_STATE)
on these alerts is way less than actual events firing out of alertmanager.
I am storing events fired in a database with timestamps.
The fingerprint counts on these alerts is way more than state change as
shared in the dashboard.
I am using following expression in the alerting rule:
(sum(last_over_time(..{}[2h]) / last_over_time(...) * 100) > 90
Any pointers to debug this issue is highly appreciated!
--
You received this message because you are subscribed to the Google Groups
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/prometheus-users/77a75681-9252-41b2-bea6-26c56e6fcf59n%40googlegroups.com.