Skip to content

Conversation

jleeh
Copy link
Contributor

@jleeh jleeh commented May 27, 2024

Please ensure your pull request adheres to the following guidelines:

  • For first time contributors, read Submitting a pull request
  • All code is covered by unit and/or runtime tests where feasible.
  • All commits contain a well written commit description including a title,
    description and a Fixes: #XXX line if the commit addresses a particular
    GitHub issue.
  • If your commit description contains a Fixes: <commit-id> tag, then
    please add the commit author[s] as reviewer[s] to this issue.
  • All commits are signed off. See the section Developer’s Certificate of Origin
  • Provide a title or release-note blurb suitable for the release notes.
  • Are you a user of Cilium? Please add yourself to the Users doc
  • Thanks for contributing!

Previously, when a cluster ran with native-networking and had multiple zones, it wasn't possible to enable auto direct routes. This caused a bottleneck for same-zone traffic as it always had to be routed through the gw.

With this new flag, any direct routes for nodes on different L2 networks will be skipped. Cilium will add routes for nodes on the same L2 and not exit.

Fixes: #31124

Regarding tests, I couldn't get the node_linux_test.go file to run locally when in privileged mode. I'm not sure how applicable it is to add a test specifically for this with how the current tests are written and with the new flag.

I've ran this Cilium build on our staging cluster and works as intended.

@jleeh jleeh requested review from a team as code owners May 27, 2024 13:12
@jleeh jleeh requested review from ldelossa and asauber May 27, 2024 13:12
@maintainer-s-little-helper maintainer-s-little-helper bot added the dont-merge/needs-release-note-label The author needs to describe the release impact of these changes. label May 27, 2024
@jleeh jleeh requested review from nebril and a user May 27, 2024 13:12
@github-actions github-actions bot added the kind/community-contribution This was a contribution made by a community member. label May 27, 2024
Copy link

@ghost ghost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @jleeh.
I left two comments. Could you also document the new agent flag and Helm value in the upgrade notes (Documentation/operations/upgrade.rst)?

@ghost ghost added the release-note/minor This PR changes functionality that users may find relevant to operating Cilium. label May 27, 2024
@maintainer-s-little-helper maintainer-s-little-helper bot removed the dont-merge/needs-release-note-label The author needs to describe the release impact of these changes. label May 27, 2024
@jleeh jleeh force-pushed the enable-direct-route-skipping branch from 966a654 to 135b59c Compare May 28, 2024 09:26
@jleeh jleeh requested a review from a user May 28, 2024 09:26
@jleeh
Copy link
Contributor Author

jleeh commented May 28, 2024

Thanks for the quick review @lambdanis, actioned comments.

@jleeh jleeh force-pushed the enable-direct-route-skipping branch 2 times, most recently from 06aa612 to fa21f25 Compare May 29, 2024 10:48
Copy link

@ghost ghost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

docs good

Copy link
Member

@nebril nebril left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

k8s LGTM

@aanm
Copy link
Member

aanm commented May 31, 2024

/test

@ldelossa
Copy link
Contributor

ldelossa commented Jun 4, 2024

Let's pause this PR just for a bit while we discuss this point: #31124 (comment)

Overall, the PR looks generally harmless and I'm not opposed, but I'd like to square up across contributors on whether this is the right answer, or if this should become a BGP responsibility.

@ldelossa ldelossa added dont-merge/discussion A discussion is ongoing and should be resolved before merging, regardless of reviews & tests status. and removed dont-merge/discussion A discussion is ongoing and should be resolved before merging, regardless of reviews & tests status. labels Jun 4, 2024
Copy link
Contributor

@ldelossa ldelossa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After a bit of discussion, this change looks OK to me. The alternative BGP mesh approach is currently not planned 👍

@ldelossa
Copy link
Contributor

ldelossa commented Jun 4, 2024

@jleeh please run 'make -C Documentation update-cmdref' and push changes back up. That is why the Documentation Updates / Check generated documentation CI test is failing.

Copy link
Member

@asauber asauber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the changes 👍

@dylandreimerink
Copy link
Member

/test

1 similar comment
@aanm
Copy link
Member

aanm commented Jun 12, 2024

/test

@aanm aanm closed this Jun 15, 2024
@aanm aanm reopened this Jun 15, 2024
@aanm aanm closed this Jun 15, 2024
@aanm aanm reopened this Jun 15, 2024
@aanm
Copy link
Member

aanm commented Jun 15, 2024

@jleeh the PR needs a rebase as the CI is failing due to some flakes.

@aanm aanm added the dont-merge/needs-rebase This PR needs to be rebased because it has merge conflicts. label Jun 15, 2024
@jleeh jleeh force-pushed the enable-direct-route-skipping branch from 44cf6cd to 5aa1fb8 Compare June 15, 2024 12:20
@jleeh
Copy link
Contributor Author

jleeh commented Jun 15, 2024

Rebased @aanm

…orks

Previously, when a cluster ran with native-networking and had multiple
zones, it wasn't possible to enable auto direct routes. This caused a
bottleneck for same-zone traffic as it always had to be routed through
the gw.

With this new flag, any direct routes for nodes on different L2 networks
will be skipped. Cilium will add routes for nodes on the same L2 and not
exit.

Fixes: cilium#31124
Signed-off-by: Jonny <jonny@linkpool.io>
@aanm aanm removed the dont-merge/needs-rebase This PR needs to be rebased because it has merge conflicts. label Jun 16, 2024
@aanm
Copy link
Member

aanm commented Jun 16, 2024

/test

@aanm aanm enabled auto-merge June 16, 2024 07:59
@aanm aanm added this pull request to the merge queue Jun 17, 2024
@maintainer-s-little-helper maintainer-s-little-helper bot added the ready-to-merge This PR has passed all tests and received consensus from code owners to merge. label Jun 17, 2024
Merged via the queue into cilium:main with commit c2b8f48 Jun 17, 2024
baurmatt added a commit to syseleven/cilium that referenced this pull request Oct 14, 2024
Seems like the config option has been renamed in the inital PR while it
was reviewed but this instance has been missed.

See cilium#32733
baurmatt added a commit to syseleven/cilium that referenced this pull request Oct 14, 2024
Seems like the config option has been renamed in the inital PR while it
was reviewed but this instance has been missed.

See cilium#32733

Signed-off-by: Matthias Baur <m.baur@syseleven.de>
baurmatt added a commit to syseleven/cilium that referenced this pull request Oct 22, 2024
Seems like the config option has been renamed in the inital PR while it
was reviewed but this instance has been missed.

See cilium#32733

Signed-off-by: Matthias Baur <m.baur@syseleven.de>
github-merge-queue bot pushed a commit that referenced this pull request Oct 22, 2024
Seems like the config option has been renamed in the inital PR while it
was reviewed but this instance has been missed.

See #32733

Signed-off-by: Matthias Baur <m.baur@syseleven.de>
tklauser pushed a commit that referenced this pull request Oct 25, 2024
[ upstream commit b418310 ]

Seems like the config option has been renamed in the inital PR while it
was reviewed but this instance has been missed.

See #32733

Signed-off-by: Matthias Baur <m.baur@syseleven.de>
Signed-off-by: Tobias Klauser <tobias@cilium.io>
tklauser pushed a commit that referenced this pull request Oct 25, 2024
[ upstream commit b418310 ]

Seems like the config option has been renamed in the inital PR while it
was reviewed but this instance has been missed.

See #32733

Signed-off-by: Matthias Baur <m.baur@syseleven.de>
Signed-off-by: Tobias Klauser <tobias@cilium.io>
github-merge-queue bot pushed a commit that referenced this pull request Oct 25, 2024
[ upstream commit b418310 ]

Seems like the config option has been renamed in the inital PR while it
was reviewed but this instance has been missed.

See #32733

Signed-off-by: Matthias Baur <m.baur@syseleven.de>
Signed-off-by: Tobias Klauser <tobias@cilium.io>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/community-contribution This was a contribution made by a community member. ready-to-merge This PR has passed all tests and received consensus from code owners to merge. release-note/minor This PR changes functionality that users may find relevant to operating Cilium.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CFP: Improve auto-direct-node-routes to not exit when node is outside of subnet
8 participants