-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Add counter to track all datapath timeouts due to FQDN IP updates #19809
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add counter to track all datapath timeouts due to FQDN IP updates #19809
Conversation
/test |
/test-gke |
/ci-nat46x64 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One small comment on metric metrics to conform with promlint, the rest looks good to me.
GHA conformance test with metric validation might not catch such issue, as unlikely we have all possible metrics, the lack of unit test on this will be tracked in another issue.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Marked as request to change so that maintainer bot will not add ready to merge till small change in metric name is done.
Thanks.
/test @sayboras -- should be ready for another pass. Job 'Cilium-PR-K8s-1.23-kernel-net-next' failed: Click to show.Test Name
Failure Output
If it is a flake and a GitHub issue doesn't already exist to track it, comment |
/test-1.23-net-next Job 'Cilium-PR-K8s-1.23-kernel-net-next' failed: Click to show.Test Name
Failure Output
If it is a flake and a GitHub issue doesn't already exist to track it, comment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks and LGTM 💯
Marking this ready to merge, as the tests were successfully ran before minor change with metric name. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you squash the commits? No need for them to be separate. Thanks!
Signed-off-by: Vlad Ungureanu <ungureanuvladvictor@gmil.com>
30df9a6
to
d771086
Compare
/test @christarazi -- squashed + triggered the tests. |
Can we consider this PR for backporting? Would be very nice to have, and doesn't seem to depend on anything new. |
I think it's a good idea for debugging / observability, so why not. |
The code is logging when this is happening but it would be useful to have them exposed as metrics to put them on a dashboard and to correlate with other events in the system.
Tagging @christarazi for SA.