sentinel: Add Prometheus metrics #656

benwh · 2019-05-31T16:16:44Z

These metrics provide the ability to build alerts that tell us whether the sentinels are operating as expected:

The last time that the sentinel successfully processed the clusterdata.
Whether the sentinel is a leader.
The number of times that the sentinel has been elected leader.

This follows on from the keeper metrics in #639.

We've found these sentinel metrics to be extremely useful, upon discovering that our sentinels could occasionally all become stuck - and no decisions would be made across the cluster - when there are issues communicating with etcd (we're planning to upstream a patch for that issue in the near future!)

benwh · 2019-05-31T16:17:30Z

This is what the metrics look like in action, with 3 sentinels running:

sgotti

@benwh Thanks for your PR! It LGTM, just a small nit in the comments.

sgotti · 2019-06-03T08:40:14Z

cmd/sentinel/cmd/metrics.go

@@ -0,0 +1,73 @@
+// Copyright 2017 Sorint.lab


s/2017/2019

Good spot - fixed!

sgotti · 2019-06-03T15:06:43Z

@benwh Can you please squash in a single commit?

These metrics provide the ability to build alerts that tell us whether the sentinels are operating as expected: - The last time that the sentinel successfully processed the clusterdata. - Whether the sentinel is a leader. - The number of times that the sentinel has been elected leader.

benwh · 2019-06-03T15:11:50Z

@sgotti I'd originally kept it separate as it was also changing a file outside of the scope of this PR. But happy to do so, squashed now.

sgotti · 2019-06-03T16:06:25Z

@benwh Oh I haven't noticed that you also changed the keeper metrics file. Anyway it's not a big styling issue. I'm going to merge it. Thanks again!

benwh · 2019-06-03T16:08:40Z

Excellent, thanks very much for the speedy review!

sgotti requested changes Jun 3, 2019

View reviewed changes

benwh mentioned this pull request Jun 3, 2019

Add sentinel metrics to playground gocardless/stolon-pgbouncer#54

Merged

benwh force-pushed the sentinel-metrics branch from ca66559 to eabdd33 Compare June 3, 2019 15:08

sgotti merged commit 259ef10 into sorintlab:master Jun 3, 2019

benwh deleted the sentinel-metrics branch June 3, 2019 16:08

sgotti added this to the v0.14.0 milestone Jun 6, 2019

sgotti mentioned this pull request Jun 7, 2019

Expose metrics related to stolon cluster for prometheus #603

Closed

sgotti mentioned this pull request Mar 19, 2020

metrics exposes only golang metrics #768

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sentinel: Add Prometheus metrics #656

sentinel: Add Prometheus metrics #656

Uh oh!

benwh commented May 31, 2019

Uh oh!

benwh commented May 31, 2019

Uh oh!

sgotti left a comment

Uh oh!

sgotti Jun 3, 2019

Uh oh!

benwh Jun 3, 2019

Uh oh!

sgotti commented Jun 3, 2019

Uh oh!

benwh commented Jun 3, 2019

Uh oh!

sgotti commented Jun 3, 2019

Uh oh!

benwh commented Jun 3, 2019

Uh oh!

Uh oh!

sentinel: Add Prometheus metrics #656

sentinel: Add Prometheus metrics #656

Uh oh!

Conversation

benwh commented May 31, 2019

Uh oh!

benwh commented May 31, 2019

Uh oh!

sgotti left a comment

Choose a reason for hiding this comment

Uh oh!

sgotti Jun 3, 2019

Choose a reason for hiding this comment

Uh oh!

benwh Jun 3, 2019

Choose a reason for hiding this comment

Uh oh!

sgotti commented Jun 3, 2019

Uh oh!

benwh commented Jun 3, 2019

Uh oh!

sgotti commented Jun 3, 2019

Uh oh!

benwh commented Jun 3, 2019

Uh oh!

Uh oh!