Host firewall tests #12621

pchaigno · 2020-07-22T20:59:14Z

This PR disables the host firewall by default in CI. It will only be enabled for all tests when the label ci/host-firewall is set. Tests that specifically enable the host firewall (see below) will still run regardless of the label.

The two last commits then add two new tests for the host firewall:

A test of a fromCIDR+toPorts host policy (based on the existing fromCIDR+toPorts test) from the third node.
A NodePort test with an ingress+egress host policy (initially written to catch a potential regression on the path client->node1->backend@node2).

I verified ci/host-firewall works by running the tests with that label in #12672.

test/k8sT/Services.go

aanm

From experience, having a label to run a particular set of tests will make the code to regress as soon this PR is merge. Even we are not enabling the host firewall by default, what's wrong on having more coverage for the host firewall?

pchaigno · 2020-07-25T15:25:49Z

From experience, having a label to run a particular set of tests will make the code to regress as soon this PR is merge. Even we are not enabling the host firewall by default, what's wrong on having more coverage for the host firewall?

My original post was a bit unclear so not sure we're on the same page. The new host firewall tests I added will run regardless of the ci/host-firewall label. I've clarified that ☝️

On the general risk of regressions once we stop testing something, I agree. That is partly why I'm only making this change now that we also have some first host firewall tests.

Then, why don't I want the host firewall enabled by default in all tests? Because (1) with #12345, enabling the host firewall changes the path of packets in some cases (pod1@node1>node1 and pod1@node1->node2) and (2) it's not the default setup most users will run. I intend to use the new label to run additional tests on PRs that may impact the host firewall.

aanm · 2020-07-25T18:28:22Z

From experience, having a label to run a particular set of tests will make the code to regress as soon this PR is merge. Even we are not enabling the host firewall by default, what's wrong on having more coverage for the host firewall?

My original post was a bit unclear so not sure we're on the same page. The new host firewall tests I added will run regardless of the ci/host-firewall label. I've clarified that

👍

On the general risk of regressions once we stop testing something, I agree. That is partly why I'm only making this change now that we also have some first host firewall tests.

Then, why don't I want the host firewall enabled by default in all tests? Because (1) with #12345, enabling the host firewall changes the path of packets in some cases (pod1@node1>node1 and pod1@node1->node2) and (2) it's not the default setup most users will run. I intend to use the new label to run additional tests on PRs that may impact the host firewall.

Why can't we deploy Cilium to run with host firewall and run these tests (besides the setups that we already have)?

pchaigno · 2020-07-27T07:14:03Z

test-me-please

jenkinsfiles/ginkgo-kernel.Jenkinsfile

test/k8sT/manifests/ccnp-host-ingress-from-cidr-to-ports.yaml

test/k8sT/Services.go

pchaigno · 2020-08-10T19:24:30Z

I pushed an update but this is currently blocked by #12834 so switching to draft.

pchaigno · 2020-08-14T12:50:34Z

Runtime tests failed with #12862: https://jenkins.cilium.io/job/Cilium-PR-Runtime-4.9/1553/testReport/junit/(root)/Suite-runtime/RuntimePrivilegedUnitTests_Run_Tests/
retest-runtime

brb

Thanks, LGTM! Just two nits.

test/helpers/kubectl.go

The host firewall is only enabled in CI if label ci/host-firewall is set. The goal is to have default CI options closer to common user environments and host firewall is not enabled by default in those. Signed-off-by: Paul Chaignon <paul@cilium.io>

This commit extends the existing fromCIDR+toPorts policy test to test the same kind of policy for the host firewall. To that end, it: 1. Enables the host firewall. The issue in comment is not relevant anymore since masquerading is disabled. 2. Introduce a helper to get the ID of the host endpoint. This helper will likely be needed for other host firewall tests as well. 3. Load a new DaemonSet to instanciate a host-networking pod on each k8s node. This pod serves as the target for host firewall connectivity tests. 4. Extend the existing test cases with CCNP tests. Signed-off-by: Paul Chaignon <paul@cilium.io>

This commit adds new tests, identical to NodePort tests under vxlan tunneling and direct routing, but with an ingress+egress host policy applied. The host policy only allow communications between nodes and to specific endpoints for readiness probes. Signed-off-by: Paul Chaignon <paul@cilium.io>

pchaigno · 2020-08-25T16:01:26Z

test-me-please

We currently have a couple of host firewall tests, but they don't cover all possible packet paths. [1] added two tests for the path node <-> world (one with fromCIDR+toPorts and one combined with NodePort handling). Other firewall tests [2] are only validating correct loading without enforcing policies. This commit fills this gap by adding VXLAN and direct routing tests for the host firewall, with L3+L4 policies enforced on the paths node <-> local pod, node <-> remote pod, and node <-> remote node. The test design draws inspiration from early host firewall bugs and regressions: - Test ingress and egress at the same time with restrictions on allowed ports. This is meant to ensure we detect a regression where only one direction bypasses policy enforcement. If such a case arises, we will fail because the source port won't be allowed and the connection will be dropped. - Allow connections to/from world and pods not used in tests. This is meant to reduce the risk of bricking the nodes. Node to node communications are still strongly restricted, but the ports defined there have been stable for a while. - Test connections to local and remote pods separately. They follow very different paths through our datapath. 1 - #12621 2 - #14255 Signed-off-by: Paul Chaignon <paul@cilium.io>

We currently have a couple of host firewall tests, but they don't cover all possible packet paths. [1] added two tests for the path node <-> world (one with fromCIDR+toPorts and one combined with NodePort handling). Other firewall tests [2] are only validating correct loading without enforcing policies. This commit fills this gap by adding VXLAN and direct routing tests for the host firewall, with L3+L4 policies enforced on the paths node <-> local pod, node <-> remote pod, and node <-> remote node. The test design draws inspiration from early host firewall bugs and regressions: - Test ingress and egress at the same time with restrictions on allowed ports. This is meant to ensure we detect a regression where only one direction bypasses policy enforcement. If such a case arises, we will fail because the source port won't be allowed and the connection will be dropped. - Allow connections to/from world and pods not used in tests. This is meant to reduce the risk of bricking the nodes. Node to node communications are still strongly restricted, but the ports defined there have been stable for a while. - Test connections to local and remote pods separately. They follow very different paths through our datapath. 1 - cilium#12621 2 - cilium#14255 Signed-off-by: Paul Chaignon <paul@cilium.io>

[ upstream commit 6f59f4f ] We currently have a couple of host firewall tests, but they don't cover all possible packet paths. [1] added two tests for the path node <-> world (one with fromCIDR+toPorts and one combined with NodePort handling). Other firewall tests [2] are only validating correct loading without enforcing policies. This commit fills this gap by adding VXLAN and direct routing tests for the host firewall, with L3+L4 policies enforced on the paths node <-> local pod, node <-> remote pod, and node <-> remote node. The test design draws inspiration from early host firewall bugs and regressions: - Test ingress and egress at the same time with restrictions on allowed ports. This is meant to ensure we detect a regression where only one direction bypasses policy enforcement. If such a case arises, we will fail because the source port won't be allowed and the connection will be dropped. - Allow connections to/from world and pods not used in tests. This is meant to reduce the risk of bricking the nodes. Node to node communications are still strongly restricted, but the ports defined there have been stable for a while. - Test connections to local and remote pods separately. They follow very different paths through our datapath. 1 - cilium#12621 2 - cilium#14255 Signed-off-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Michal Rostecki <mrostecki@opensuse.org>

[ upstream commit 6f59f4f ] We currently have a couple of host firewall tests, but they don't cover all possible packet paths. [1] added two tests for the path node <-> world (one with fromCIDR+toPorts and one combined with NodePort handling). Other firewall tests [2] are only validating correct loading without enforcing policies. This commit fills this gap by adding VXLAN and direct routing tests for the host firewall, with L3+L4 policies enforced on the paths node <-> local pod, node <-> remote pod, and node <-> remote node. The test design draws inspiration from early host firewall bugs and regressions: - Test ingress and egress at the same time with restrictions on allowed ports. This is meant to ensure we detect a regression where only one direction bypasses policy enforcement. If such a case arises, we will fail because the source port won't be allowed and the connection will be dropped. - Allow connections to/from world and pods not used in tests. This is meant to reduce the risk of bricking the nodes. Node to node communications are still strongly restricted, but the ports defined there have been stable for a while. - Test connections to local and remote pods separately. They follow very different paths through our datapath. 1 - #12621 2 - #14255 Signed-off-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Michal Rostecki <mrostecki@opensuse.org>

pchaigno added area/CI-improvement Topic or proposal to improve the Continuous Integration workflow release-note/ci This PR makes changes to the CI. needs-backport/1.8 area/host-firewall Impacts the host firewall or the host endpoint. labels Jul 22, 2020

pchaigno force-pushed the pr/pchaigno/host-firewall-ci branch 2 times, most recently from 2987c8e to 9bb4503 Compare July 22, 2020 21:17

This comment has been minimized.

Sign in to view

pchaigno mentioned this pull request Jul 23, 2020

Followups for host endpoint and firewall #11799

Closed

34 tasks

pchaigno force-pushed the pr/pchaigno/host-firewall-ci branch 2 times, most recently from 16b2055 to 2572a2a Compare July 24, 2020 13:19

pchaigno marked this pull request as ready for review July 24, 2020 19:03

pchaigno requested a review from a team as a code owner July 24, 2020 19:03

pchaigno requested a review from christarazi July 24, 2020 19:07

pchaigno commented Jul 24, 2020

View reviewed changes

test/k8sT/Services.go Show resolved Hide resolved

pchaigno requested a review from brb July 24, 2020 19:09

aanm reviewed Jul 25, 2020

View reviewed changes

brb reviewed Jul 27, 2020

View reviewed changes

jenkinsfiles/ginkgo-kernel.Jenkinsfile Show resolved Hide resolved

test/k8sT/manifests/ccnp-host-ingress-from-cidr-to-ports.yaml Show resolved Hide resolved

test/k8sT/Services.go Outdated Show resolved Hide resolved

test/k8sT/Services.go Show resolved Hide resolved

pchaigno mentioned this pull request Aug 10, 2020

Host CCNP policies rejected on master #12834

Closed

pchaigno force-pushed the pr/pchaigno/host-firewall-ci branch from 2572a2a to 8ae8524 Compare August 10, 2020 19:24

pchaigno marked this pull request as draft August 10, 2020 19:24

pchaigno force-pushed the pr/pchaigno/host-firewall-ci branch 3 times, most recently from 6d785d9 to 722aeaf Compare August 14, 2020 06:58

pchaigno marked this pull request as ready for review August 14, 2020 12:51

pchaigno requested a review from aanm August 14, 2020 12:52

christarazi approved these changes Aug 17, 2020

View reviewed changes

brb reviewed Aug 25, 2020

View reviewed changes

test/helpers/kubectl.go Outdated Show resolved Hide resolved

test/helpers/kubectl.go Outdated Show resolved Hide resolved

pchaigno added 3 commits August 25, 2020 15:28

pchaigno force-pushed the pr/pchaigno/host-firewall-ci branch from 722aeaf to 3c86d17 Compare August 25, 2020 15:23

pchaigno requested a review from brb August 25, 2020 15:24

brb approved these changes Aug 25, 2020

View reviewed changes

christarazi merged commit eecd5b9 into master Aug 25, 2020

christarazi deleted the pr/pchaigno/host-firewall-ci branch August 25, 2020 22:40

kaworu mentioned this pull request Aug 27, 2020

v1.8 backports 2020-08-27 #12990

Merged

kaworu added backport-pending/1.8 and removed needs-backport/1.8 labels Aug 27, 2020

joestringer added backport-done/1.8 and removed backport-pending/1.8 labels Sep 4, 2020

pchaigno mentioned this pull request Dec 4, 2020

test: Fix flake on policy verdict count check #14286

Merged

pchaigno mentioned this pull request Feb 1, 2021

test: Extend coverage for host policies enforcement #14822

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Host firewall tests #12621

Host firewall tests #12621

Uh oh!

pchaigno commented Jul 22, 2020 •

edited

Loading

Uh oh!

This comment has been minimized.

Uh oh!

aanm left a comment

Uh oh!

pchaigno commented Jul 25, 2020

Uh oh!

aanm commented Jul 25, 2020

Uh oh!

pchaigno commented Jul 27, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pchaigno commented Aug 10, 2020

Uh oh!

pchaigno commented Aug 14, 2020 •

edited

Loading

Uh oh!

brb left a comment

Uh oh!

Uh oh!

Uh oh!

pchaigno commented Aug 25, 2020

Uh oh!

Uh oh!

Host firewall tests #12621

Host firewall tests #12621

Uh oh!

Conversation

pchaigno commented Jul 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment has been minimized.

Uh oh!

aanm left a comment

Choose a reason for hiding this comment

Uh oh!

pchaigno commented Jul 25, 2020

Uh oh!

aanm commented Jul 25, 2020

Uh oh!

pchaigno commented Jul 27, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pchaigno commented Aug 10, 2020

Uh oh!

pchaigno commented Aug 14, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pchaigno commented Aug 25, 2020

Uh oh!

Uh oh!

pchaigno commented Jul 22, 2020 •

edited

Loading

pchaigno commented Aug 14, 2020 •

edited

Loading