-
Notifications
You must be signed in to change notification settings - Fork 3.4k
health: add ICMP probe retries #36023
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
julianwiedmann
merged 2 commits into
cilium:main
from
pippolo84:pr/pippolo84/icmp-probe-retries
Dec 2, 2024
Merged
health: add ICMP probe retries #36023
julianwiedmann
merged 2 commits into
cilium:main
from
pippolo84:pr/pippolo84/icmp-probe-retries
Dec 2, 2024
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
c1509f0
to
2e0ba1a
Compare
2e0ba1a
to
4aaf676
Compare
/test |
ttarczynski
reviewed
Nov 20, 2024
squeed
approved these changes
Nov 20, 2024
squeed
reviewed
Nov 20, 2024
4aaf676
to
8fea346
Compare
@ttarczynski updated helm value name as well as Cilium option name (I prefer to keep it consistent), PTAL |
ttarczynski
approved these changes
Nov 21, 2024
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
thorn3r
approved these changes
Nov 21, 2024
marseel
reviewed
Nov 22, 2024
8fea346
to
ed90302
Compare
marseel
approved these changes
Nov 25, 2024
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
ttarczynski
reviewed
Nov 25, 2024
ed90302
to
3f780bf
Compare
/test |
To check the connectivity status of other Cilium nodes, we rely on both HTTP and ICMP probes. In both cases, a single request is sent to assess the health status of the peer. In certain environments, marking a node or an endpoint as unreachable after a single ICMP response is lost may be too aggressive, ultimately leading to the reported status flapping between "OK" and "Unreachable". The commit adds an option to configure the number of ICMP requests sent before marking a node or endpoint as unreachable. By default, the value is set to 3. Signed-off-by: Fabio Falzoi <fabio.falzoi@isovalent.com>
Add an option to set the number of ICMP requests to send when checking the health of a node or and endpoint, before marking them as unreachable. Signed-off-by: Fabio Falzoi <fabio.falzoi@isovalent.com>
3f780bf
to
5d0a509
Compare
/test |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
affects/v1.14
This issue affects v1.14 branch
affects/v1.15
This issue affects v1.15 branch
affects/v1.16
This issue affects v1.16 branch
area/health
Relates to the cilium-health component
area/helm
Impacts helm charts and user deployment experience
backport/author
The backport will be carried out by the author of the PR.
ready-to-merge
This PR has passed all tests and received consensus from code owners to merge.
release-note/minor
This PR changes functionality that users may find relevant to operating Cilium.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
To check the connectivity status of other Cilium nodes, we rely on both HTTP and ICMP probes. In both cases, a single request is sent to assess the health status of the peer. In certain environments, marking a node or an endpoint as unreachable after a single ICMP response is lost may be too aggressive, ultimately leading to the reported status constantly flapping between "OK" and "Unreachable".
The PR adds an option (
--health-check-icmp-failure-threshold
) to configure the number of ICMP requests sent before marking a node or endpoint as unreachable. By default, this value is kept to 1, as it was before.Fixes #35996