Skip to content

Conversation

squeed
Copy link
Contributor

@squeed squeed commented Nov 11, 2024

This trigger may be, well, triggered when reacting to a DNS request -- even when the agent is starting up. This could lead to a deadlock, as the datapath is not able to write the endpoint header file until the agent is started, but the agent cannot finish starting as the endpoint is locked.

The fix for this is to remove the unnecessary trigger initialization on endpoint parsing; we will always start it on first regeneration.

This catches a case missed in #34059, which only fixed the new-endpoint case. That change moved trigger initialization in to endpoint regeneration and out of endpoint initialization. But it missed this line.

Fixes a potential deadlock when restarting cilium agent with pods with DNS interception configured

This trigger may be, well, triggered when reacting to a DNS request --
even when the agent is starting up. This could lead to a deadlock, as
the datapath is not able to write the endpoint header file until the
agent is started, but the agent cannot finish starting as the endpoint
is locked.

The fix for this is to remove the unnecessary trigger initialization on
endpoint parsing; we will always start it on first regeneration.

This catches a case missed in cilium#34059, which only fixed the new-endpoint
case.

Signed-off-by: Casey Callendrello <cdc@isovalent.com>
@squeed squeed added kind/bug This is a bug in the Cilium logic. release-note/misc This PR makes changes that have no direct user impact. needs-backport/1.16 This PR / issue needs backporting to the v1.16 branch labels Nov 11, 2024
@squeed squeed requested a review from a team as a code owner November 11, 2024 13:54
@squeed squeed requested a review from tommyp1ckles November 11, 2024 13:54
@squeed
Copy link
Contributor Author

squeed commented Nov 11, 2024

/test

@squeed
Copy link
Contributor Author

squeed commented Nov 11, 2024

I suspect this will fix #35080

@squeed squeed added release-note/bug This PR fixes an issue in a previous release of Cilium. and removed release-note/misc This PR makes changes that have no direct user impact. labels Nov 11, 2024
@aanm aanm enabled auto-merge November 11, 2024 18:42
@aanm aanm disabled auto-merge November 12, 2024 09:13
@aanm aanm merged commit 0bb676d into cilium:main Nov 12, 2024
69 of 70 checks passed
@bimmlerd bimmlerd added backport-pending/1.16 The backport for Cilium 1.16.x for this PR is in progress. and removed needs-backport/1.16 This PR / issue needs backporting to the v1.16 branch labels Nov 12, 2024
@bimmlerd
Copy link
Member

bimmlerd commented Nov 12, 2024

@aanm test-foo-bar (André is testing his GitHub mail notifications 😁)

@bimmlerd
Copy link
Member

@aanm test-foo-bar-1

@bimmlerd
Copy link
Member

@aanm test-foo-bar-2

@github-actions github-actions bot added backport-done/1.16 The backport for Cilium 1.16.x for this PR is done. and removed backport-pending/1.16 The backport for Cilium 1.16.x for this PR is in progress. labels Nov 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport-done/1.16 The backport for Cilium 1.16.x for this PR is done. kind/bug This is a bug in the Cilium logic. release-note/bug This PR fixes an issue in a previous release of Cilium.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants