v1.11 backports 2022-11-09 #22073

pchaigno · 2022-11-09T22:17:29Z

docs: Remove autoDirectNodeRoutes where not needed #21831 -- docs: Remove autoDirectNodeRoutes where not needed (@pchaigno)
Add a section with distro-specific considerations #21064 -- Add a section with distro-specific considerations (@bmcustodio)
- Minor conflict on Documentation/operations/system_requirements.rst.
ipam: Fix overlapping/duplicate PodCIDR allocation when nodes are added while operator is down #21526 -- ipam: Fix overlapping/duplicate PodCIDR allocation when nodes are added while operator is down (@dylandreimerink)
- Non-trivial conflicts. Please review carefully ⚠️
docs: Reword note in Azure CNI chaining documentation #21897 -- docs: Reword note in Azure CNI chaining documentation (@wedaly)

Once this PR is merged, you can update the PR labels via:

$ for pr in 21831 21064 21526 21897; do contrib/backporting/set-labels.py $pr done 1.11; done

[ upstream commit 34127e6 ] The KPR guide contains the autoDirectNodeRoutes option in most Helm commands, but that option isn't a requirement for KPR subfeatures and may even fail if Kubernetes nodes are not L2-connected. Signed-off-by: Paul Chaignon <paul@cilium.io>

[ upstream commit e121b5d ] Over time we've been accumulating some knowledge about particular Linux distributions and groups of distributions that has gone largely unnoted in our documentation. A good understanding and implementation of these considerations are extremely important to ensure that Cilium runs properly, so this commit attempts at adding a subsection containing this information. Signed-off-by: Bruno M. Custódio <brunomcustodio@gmail.com> Signed-off-by: Paul Chaignon <paul@cilium.io>

dylandreimerink · 2022-11-15T09:52:45Z

The unit test included in #21526 somehow triggers a nil pointer deference in operator.startSynchronizingCiliumNodes. I will need to do some investigation on why(original stack trace is gone, panic is caught and re-thrown by error handling in the test). So I will have to make a dedicated backport PR with a bugfix for v1.11.

[ upstream commit 4c9c1d3 ] This commit fixes an edge case in the `NodesPodCIDRManager`. If there were any nodes on operator startup which have no PodCIDRs, the operator would sometimes assign PodCIDRs to these nodes which have already been allocated to other nodes. The operator assumed that when `k8sCiliumNodesCacheSynced` closes, all node events have been processed. And it proceeds to call `Resync` on the `nodeManager`. The `NodesPodCIDRManager` will queue any nodes without PodCIDRs to be allocated once the `canAllocatePodCIDRs` variable is set. This variable is set by the `Resync`. So, the assumption/expected behavior is that the `NodesPodCIDRManager.Update` function has been called for all nodes in the cache before `Resync` is called. However, this wasn't the case. The `startSynchronizingCiliumNodes` function starts the informer and connects the nodeManager to it. But instead of handling the events at once, the callbacks enqueue the events, to be handled by a separate go routine. This means that `k8sCiliumNodesCacheSynced` is closed once all of the node events are enqueued, not when they have been processed by the `nodeManager`. This commit fixes this behavior by processing all events at once in the informer callbacks until the full sync is complete, at which point we will switch over to using the workqueue. Fixes: cilium#21482 Signed-off-by: Dylan Reimerink <dylan.reimerink@isovalent.com> Signed-off-by: Paul Chaignon <paul@cilium.io>

[ upstream commit b3cd077 ] Clarify that Azure CNI chaining is different than Azure CNI Powered by Cilium. Signed-off-by: Will Daly <widaly@microsoft.com> Signed-off-by: Paul Chaignon <paul@cilium.io>

pchaigno · 2022-11-15T11:33:53Z

/test-backport-1.11

Job 'Cilium-PR-K8s-GKE' failed:

Click to show.

Test Name

K8sDatapathConfig MonitorAggregation Checks that monitor aggregation flags send notifications

Failure Output

FAIL: Pods are not ready in time: timed out waiting for pods with filter  to be ready: 4m0s timeout expired

If it is a flake and a GitHub issue doesn't already exist to track it, comment /mlh new-flake Cilium-PR-K8s-GKE so I can create one.

michi-covalent · 2022-11-15T18:42:40Z

test-gke: failed with a known flake: CI: K8sDatapathConfig MonitorAggregation Checks that monitor aggregation restricts notifications #21175
ci-aks-1.11: cluster creation failure: https://github.com/cilium/cilium/actions/runs/3470063305/jobs/5804332913
ci-external-workloads-v1.11: known issue: CI: GKE-based: network "default" does not have available private IP space in 10.0.0.0/8 to reserve a /14 block cilium-cli#940
ci-multicluster-1.11: known issue: CI: GKE-based: network "default" does not have available private IP space in 10.0.0.0/8 to reserve a /14 block cilium-cli#940

pchaigno and others added 2 commits November 9, 2022 23:02

pchaigno requested a review from a team as a code owner November 9, 2022 22:17

pchaigno added backport/1.11 kind/backports This PR provides functionality previously merged into master. labels Nov 9, 2022

pchaigno force-pushed the pr/v1.11-backport-2022-11-09 branch from 34a8a05 to a791ad5 Compare November 9, 2022 22:19

pchaigno requested review from bmcustodio and dylandreimerink November 9, 2022 22:20

michi-covalent approved these changes Nov 15, 2022

View reviewed changes

michi-covalent mentioned this pull request Nov 15, 2022

v1.12 backports 2022-11-07 #22028

Merged

michi-covalent self-requested a review November 15, 2022 04:30

dylandreimerink and others added 2 commits November 15, 2022 11:59

docs: Reword note in Azure CNI chaining documentation

a136494

[ upstream commit b3cd077 ] Clarify that Azure CNI chaining is different than Azure CNI Powered by Cilium. Signed-off-by: Will Daly <widaly@microsoft.com> Signed-off-by: Paul Chaignon <paul@cilium.io>

dylandreimerink force-pushed the pr/v1.11-backport-2022-11-09 branch from a791ad5 to a136494 Compare November 15, 2022 10:59

michi-covalent approved these changes Nov 15, 2022

View reviewed changes

michi-covalent merged commit be42a2a into cilium:v1.11 Nov 15, 2022

pchaigno deleted the pr/v1.11-backport-2022-11-09 branch November 15, 2022 21:46

This was referenced Nov 15, 2022

Prepare for release v1.11.11 #22192

Closed

Prepare for release v1.11.11 #22202

Merged

pchaigno mentioned this pull request Jan 24, 2023

v1.11 backports 2023-01-24 #23310

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v1.11 backports 2022-11-09 #22073

v1.11 backports 2022-11-09 #22073

Uh oh!

pchaigno commented Nov 9, 2022

Uh oh!

dylandreimerink commented Nov 15, 2022

Uh oh!

pchaigno commented Nov 15, 2022 •

edited by maintainer-s-little-helper bot

Loading

Test Name

Failure Output

Uh oh!

michi-covalent commented Nov 15, 2022 •

edited

Loading

Uh oh!

Uh oh!

v1.11 backports 2022-11-09 #22073

v1.11 backports 2022-11-09 #22073

Uh oh!

Conversation

pchaigno commented Nov 9, 2022

Uh oh!

dylandreimerink commented Nov 15, 2022

Uh oh!

pchaigno commented Nov 15, 2022 • edited by maintainer-s-little-helper bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Name

Failure Output

Uh oh!

michi-covalent commented Nov 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

pchaigno commented Nov 15, 2022 •

edited by maintainer-s-little-helper bot

Loading

michi-covalent commented Nov 15, 2022 •

edited

Loading