Skip to content

Conversation

christarazi
Copy link
Member

It is possible during endpoint restoration for the endpoint to be
deleted in the middle of the process. If the endpoint restoration
process was waiting for global identities to be synced and the deletion
occurs, then the wait for the sync will be canceled. This triggers an
unnecessary warning log. To fix, detect that the endpoint is deleted
which we can tell by checking if the context was canceled.

See #40924 for more details.

Fixes: #40924
Signed-off-by: Joe Stringer joe@cilium.io
Signed-off-by: Chris Tarazi chris@isovalent.com

@christarazi christarazi added release-note/ci This PR makes changes to the CI. area/agent Cilium agent related. labels Aug 4, 2025
@christarazi christarazi marked this pull request as ready for review August 4, 2025 20:01
@christarazi christarazi requested a review from a team as a code owner August 4, 2025 20:01
@christarazi christarazi requested a review from fristonio August 4, 2025 20:01
@christarazi christarazi added kind/enhancement This would improve or streamline existing functionality. area/CI-improvement Topic or proposal to improve the Continuous Integration workflow labels Aug 4, 2025
@christarazi
Copy link
Member Author

/test

Copy link
Member

@joestringer joestringer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@joestringer
Copy link
Member

joestringer commented Aug 4, 2025

Not sure whether v1.18 is affected by this or not to mark for v1.18 backport - I saw this failure condition for the first time today. I just don't know whether we saw it now because we changed something about startup ordering or if it's always been possible and it's just rare enough we don't often see it.

@joestringer joestringer added the needs-backport/1.18 This PR / issue needs backporting to the v1.18 branch label Aug 4, 2025
@joestringer
Copy link
Member

I'm OK with backporting this as it's just changing the logging for this condition.

@christarazi christarazi force-pushed the pr/christarazi/fix-40924 branch from 7536c06 to 6d4893c Compare August 5, 2025 16:57
It is possible during endpoint restoration for the endpoint to be
deleted in the middle of the process. If the endpoint restoration
process was waiting for global identities to be synced and the deletion
occurs, then the wait for the sync will be canceled. This triggers an
unnecessary warning log. To fix, detect that the endpoint is deleted
which we can tell by checking if the context was canceled.

See cilium#40924 for more details.

Fixes: cilium#40924
Signed-off-by: Joe Stringer <joe@cilium.io>
Signed-off-by: Chris Tarazi <chris@isovalent.com>
@christarazi christarazi force-pushed the pr/christarazi/fix-40924 branch from 6d4893c to d0f8767 Compare August 5, 2025 16:58
@christarazi
Copy link
Member Author

christarazi commented Aug 5, 2025

/test

Edit: #39370

@maintainer-s-little-helper maintainer-s-little-helper bot added the ready-to-merge This PR has passed all tests and received consensus from code owners to merge. label Aug 5, 2025
@christarazi christarazi added this pull request to the merge queue Aug 5, 2025
Merged via the queue into cilium:main with commit 534a71f Aug 5, 2025
68 checks passed
@christarazi christarazi deleted the pr/christarazi/fix-40924 branch August 5, 2025 20:53
@rastislavs rastislavs mentioned this pull request Aug 6, 2025
17 tasks
@rastislavs rastislavs added backport-pending/1.18 The backport for Cilium 1.18.x for this PR is in progress. and removed needs-backport/1.18 This PR / issue needs backporting to the v1.18 branch labels Aug 6, 2025
@github-actions github-actions bot added backport-done/1.18 The backport for Cilium 1.18.x for this PR is done. and removed backport-pending/1.18 The backport for Cilium 1.18.x for this PR is in progress. labels Aug 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/agent Cilium agent related. area/CI-improvement Topic or proposal to improve the Continuous Integration workflow backport-done/1.18 The backport for Cilium 1.18.x for this PR is done. kind/enhancement This would improve or streamline existing functionality. ready-to-merge This PR has passed all tests and received consensus from code owners to merge. release-note/ci This PR makes changes to the CI.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CI: ci-clustermesh: check-log-errors: Failed while waiting for initial global identities
4 participants