jenkins: switch to ad-hoc GKE cluster creation/deletion #19918

nbusseneau · 2022-05-23T15:29:02Z

The general idea is to remove the need for our permanent pool of GKE clusters + management cluster (that manages the pool via Config Connector).

Instead, we switch to ad-hoc clusters as we do on CI 3.0. This should:

Remove the upper limit on the number of concurrent Jenkins GKE jobs.
Remove the need for permanent clusters (reduce CI costs).
Have no effect on the setup time required before the tests actually start running on GKE clusters.
Improve control over GKE features (e.g. DenyServiceExternalIPs admission controller) that cannot be controlled via CNRM / Config Connector.

The general idea is to remove the need for our permanent pool of GKE clusters + management cluster (that manages the pool via Config Connector). Instead, we switch to ad-hoc clusters as we do on CI 3.0. This should: - Remove the upper limit on the number of concurrent Jenkins GKE jobs. - Remove the need for permanent clusters (reduce CI costs). - Have no effect on the setup time required before the tests actually start running on GKE clusters. - Improve control over GKE features (e.g. `DenyServiceExternalIPs` admission controller) that cannot be controlled via CNRM / Config Connector. Signed-off-by: Nicolas Busseneau <nicolas@isovalent.com>

New GKE clusters have the automatic labelling feature gate enabled by default, so the labels used in the `Identity CLI testing` `K8sCLI` test need to be updated with the additional `k8s:io.cilium.k8s.namespace.labels.kubernetes.io/metadata.name` automatic label. Signed-off-by: Nicolas Busseneau <nicolas@isovalent.com>

nbusseneau · 2022-05-24T15:29:10Z

/test

nbusseneau · 2022-05-24T15:30:43Z

Notes to reviewers:

Since the GKE pipeline is currently disabled so as not to be triggered on /test, changes here were tested in an exact copy of the pipeline at https://jenkins.cilium.io/job/Cilium-PR-K8s-GKE-nicolas/. See https://jenkins.cilium.io/job/Cilium-PR-K8s-GKE-nicolas/3/ for a valid run.
I've triggered the rest of the CI out of habit but this is actually useless for this PR, as we edit only the GKE pipeline and the K8sCLI tests only run on GKE. So the GKE run itself is sufficient to validate that this fixes the issue. If reviews pass, then this PR is ready to be merged.

gandro

Not part of the CI team, but looks good to me nonetheless.

sayboras

🚢

joestringer

I glanced through the bash, set -e is present in the new script and it seems like if it fails, the outer code will call into the release-cluster.sh. 👍

pchaigno · 2022-05-25T18:27:22Z

Have no effect on the setup time required before the tests actually start running on GKE clusters.

How is that possible if we are going from a pool of pre-created clusters to creating the clusters as part of the CI job?

nbusseneau · 2022-05-25T20:16:07Z

How is that possible if we are going from a pool of pre-created clusters to creating the clusters as part of the CI job?

The clusters in the pool are already created but were actually scaled down to 0 nodes when not in use, then scaled back up to 2 nodes when in use. In practice this operation takes about the same amount of time as creating a new cluster.

nbusseneau added area/CI Continuous Integration testing issue or flake release-note/ci This PR makes changes to the CI. labels May 23, 2022

nbusseneau force-pushed the pr/fix-gke branch from c8aba2c to e25c5e1 Compare May 23, 2022 15:32

nbusseneau force-pushed the pr/fix-gke branch from e25c5e1 to 9cc9735 Compare May 23, 2022 17:43

nbusseneau force-pushed the pr/fix-gke branch from 9cc9735 to 187f7ce Compare May 24, 2022 13:54

nbusseneau marked this pull request as ready for review May 24, 2022 15:28

nbusseneau requested review from a team as code owners May 24, 2022 15:28

nbusseneau requested review from ldelossa and nebril May 24, 2022 15:28

nbusseneau added needs-backport/1.10 labels May 24, 2022

gandro approved these changes May 25, 2022

View reviewed changes

sayboras approved these changes May 25, 2022

View reviewed changes

nbusseneau added ready-to-merge This PR has passed all tests and received consensus from code owners to merge. labels May 25, 2022

joestringer approved these changes May 25, 2022

View reviewed changes

joestringer merged commit b42e5a0 into cilium:master May 25, 2022

jibi mentioned this pull request May 26, 2022

v1.11 backports 2022-05-26 #19966

Merged

jibi added backport-pending/1.11 and removed needs-backport/1.11 labels May 26, 2022

jibi mentioned this pull request May 30, 2022

v1.10 backports 2022-05-17 #19859

Merged

nbusseneau removed the needs-backport/1.10 label May 30, 2022

nbusseneau added the backport-pending/1.10 label May 30, 2022

jibi added backport-done/1.10 and removed backport-pending/1.10 labels May 31, 2022

tklauser added backport-done/1.11 The backport for Cilium 1.11.x for this PR is done. and removed backport-pending/1.11 labels Jun 2, 2022

This was referenced Jun 10, 2022

Prepare for release v1.10.12 #20168

Merged

Prepare for release v1.11.6 #20190

Merged

aanm mentioned this pull request Jun 22, 2022

Prepare for release v1.12.0-rc3 #20279

Merged

nbusseneau deleted the pr/fix-gke branch July 11, 2024 16:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

jenkins: switch to ad-hoc GKE cluster creation/deletion #19918

jenkins: switch to ad-hoc GKE cluster creation/deletion #19918

Uh oh!

nbusseneau commented May 23, 2022

Uh oh!

nbusseneau commented May 24, 2022

Uh oh!

nbusseneau commented May 24, 2022 •

edited

Loading

Uh oh!

gandro left a comment

Uh oh!

sayboras left a comment

Uh oh!

joestringer left a comment

Uh oh!

pchaigno commented May 25, 2022

Uh oh!

nbusseneau commented May 25, 2022

Uh oh!

Uh oh!

jenkins: switch to ad-hoc GKE cluster creation/deletion #19918

jenkins: switch to ad-hoc GKE cluster creation/deletion #19918

Uh oh!

Conversation

nbusseneau commented May 23, 2022

Uh oh!

nbusseneau commented May 24, 2022

Uh oh!

nbusseneau commented May 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gandro left a comment

Choose a reason for hiding this comment

Uh oh!

sayboras left a comment

Choose a reason for hiding this comment

Uh oh!

joestringer left a comment

Choose a reason for hiding this comment

Uh oh!

pchaigno commented May 25, 2022

Uh oh!

nbusseneau commented May 25, 2022

Uh oh!

Uh oh!

nbusseneau commented May 24, 2022 •

edited

Loading