-
Notifications
You must be signed in to change notification settings - Fork 3.4k
datapath: Add support for skipping direct routes on different L2 networks #32733
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @jleeh.
I left two comments. Could you also document the new agent flag and Helm value in the upgrade notes (Documentation/operations/upgrade.rst
)?
966a654
to
135b59c
Compare
Thanks for the quick review @lambdanis, actioned comments. |
06aa612
to
fa21f25
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
docs good
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
k8s LGTM
/test |
Let's pause this PR just for a bit while we discuss this point: #31124 (comment) Overall, the PR looks generally harmless and I'm not opposed, but I'd like to square up across contributors on whether this is the right answer, or if this should become a BGP responsibility. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After a bit of discussion, this change looks OK to me. The alternative BGP mesh approach is currently not planned 👍
@jleeh please run 'make -C Documentation update-cmdref' and push changes back up. That is why the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the changes 👍
/test |
1 similar comment
/test |
@jleeh the PR needs a rebase as the CI is failing due to some flakes. |
44cf6cd
to
5aa1fb8
Compare
Rebased @aanm |
…orks Previously, when a cluster ran with native-networking and had multiple zones, it wasn't possible to enable auto direct routes. This caused a bottleneck for same-zone traffic as it always had to be routed through the gw. With this new flag, any direct routes for nodes on different L2 networks will be skipped. Cilium will add routes for nodes on the same L2 and not exit. Fixes: cilium#31124 Signed-off-by: Jonny <jonny@linkpool.io>
/test |
Seems like the config option has been renamed in the inital PR while it was reviewed but this instance has been missed. See cilium#32733
Seems like the config option has been renamed in the inital PR while it was reviewed but this instance has been missed. See cilium#32733 Signed-off-by: Matthias Baur <m.baur@syseleven.de>
Seems like the config option has been renamed in the inital PR while it was reviewed but this instance has been missed. See cilium#32733 Signed-off-by: Matthias Baur <m.baur@syseleven.de>
Seems like the config option has been renamed in the inital PR while it was reviewed but this instance has been missed. See #32733 Signed-off-by: Matthias Baur <m.baur@syseleven.de>
Please ensure your pull request adheres to the following guidelines:
description and a
Fixes: #XXX
line if the commit addresses a particularGitHub issue.
Fixes: <commit-id>
tag, thenplease add the commit author[s] as reviewer[s] to this issue.
Previously, when a cluster ran with native-networking and had multiple zones, it wasn't possible to enable auto direct routes. This caused a bottleneck for same-zone traffic as it always had to be routed through the gw.
With this new flag, any direct routes for nodes on different L2 networks will be skipped. Cilium will add routes for nodes on the same L2 and not exit.
Fixes: #31124
Regarding tests, I couldn't get the
node_linux_test.go
file to run locally when in privileged mode. I'm not sure how applicable it is to add a test specifically for this with how the current tests are written and with the new flag.I've ran this Cilium build on our staging cluster and works as intended.