-
Notifications
You must be signed in to change notification settings - Fork 3.4k
cilium-dbg: fix status commands for cluster connectivity health #33972
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
3a6dcf9
to
1892fe1
Compare
1892fe1
to
af3dd44
Compare
Converted back to draft as I work on adding tests for the |
5163aa8
to
c6fb670
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@darox Nice work!
@derailed I pushed the changes you've requested. I will rebase once you're OK with the changes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@darox Thank you for the updates!
/test |
@darox seems like some tests are failing, would you mind rebasing PR, hopefully it fixes it 😃 ? |
I need a bit more time, because there's now a conflict:
|
1f5eadc
to
d170d99
Compare
@marseel I rebased and fixed the conflicts, let's see if it goes through. |
d170d99
to
42379c1
Compare
6f10193
to
a3369bd
Compare
I don't see how this PR would affect the failing CI task. |
I've rerun test, let's see if it was just a flake (probably) |
passed |
@ldelossa can we get it merged? |
/test |
1 similar comment
/test |
This commit fixes several issues that were reported in the status commands of cilium-dbg. - Fix the issue where not up to 10 unreachable nodes were shown in the status command. - Fix the issue where `--all-nodes` didn't return all nodes regardless of their status. - `--all-health` now returns the health of all modules regardless of their status. This can be paired with `--all-nodes` to get the health of all nodes. - `--verbose` now shows the latency to nodes and endpoints. Signed-off-by: darox <maderdario@gmail.com>
a3369bd
to
4efb41f
Compare
/test |
@ldelossa would you mind taking a look or should we reassign? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Backport to v1.16 in #35468 lead to non-trivial merge conflicts, this will need a manual backport. Marking as |
What are the steps I have to take, to get this backported to v1.16? |
Hey @darox 👋 Are you still willing to take care of this author backport? If so, the easiest approach is probably to:
Feel free to reach out to me if you have questions. |
@giorio94 If I will find time, I will take a look. |
I had some time to check the code and I think we first need to apply fee6107 before I can apply my commit. My whole commit is based on the former commit. Now, the question is, if we can apply fee6107 because it deprecates some metrics. This commit was first applied on 1.17. I pushed it to #37463 which fixes it. But someone will have to decide, whether that's fine. |
That commit introduces a new feature, which is typically not eligible for being backported, and IMO it would come with a significant risk for stable releases. If that commit is a strict requirement to backport this PR, my feeling is that it would be better skipping this backport, considering that it fixes a CLI command that most likely always suffered from this bug. |
Yes, I would agree with that. |
Ack, labels dropped. |
This commit fixes serveral issues that were
reported in the status commands of cilium-dbg.
--all-nodes
didn't return all nodes regardless of their status.--all-health
now returns the health of all modules regardless of their status. This can be paired with--all-nodes
to get the health of all nodes.--verbose
now shows the latency to nodes and endpoints.Fixes: #33697