Problem with promtheus, cadvisor and node exporter in docker swarm with no Leader role node #16596
Unanswered
ImanoLCarbonell
asked this question in
Q&A
Replies: 1 comment
-
This seems like a support request, converting it into a discussion. Also, you maybe need to reach out to the swarm community, as |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
What did you do?
Hello everyone !! I recently encountered the following problem with promtheus, cadvisor and node exporter on the docker swarm clusters we have.
I have Prometheus, Cadvisor and Node Exporter deployed in the clusters as services to collect information about the containers and hosts. This Prometheus is federated with another one that we have on another host to centralize all the metrics in one and in this way, have a single graph to be able to see them.
This architecture only works well for us if we deploy the prometheus, cadvisor and node exporter stack on the node that has the Leader role.
When, for any reason, this node loses the Leader role and therefore, the containers are deployed on another node, we stop receiving metrics from the cadvisor and node exporter of the new node that has the Leader role and lines appear in the Prometheus logs indicating that it cannot connect to the cadvisor and node exporter.
We do not understand the reason for this problem.
Has anyone ever had this happen to them? Do you know the cause of this problem?
In this scheme it works well.
Not in this one
What did you expect to see?
These are the metrics of Prometheus when it works well.
What did you see instead? Under which circumstances?
These are the metrics of Prometheus when it does not work well when deployed with this scheme.
System information
Linux 3.10.0-1160.118.1.el7.x86_64 x86_64
Prometheus version
Prometheus configuration file
Alertmanager version
No response
Alertmanager configuration file
No response
Logs
Beta Was this translation helpful? Give feedback.
All reactions