Extend DestinationRule with tunneling settings #2283

jewertow · 2022-03-15T20:59:58Z

Signed-off-by: Jacek Ewertowski jewertow@redhat.com

Background context

This change aims to enable tunneling outbound traffic. I described the idea in detail in this RFC.

The API in this pull request differs slightly from that described in the document, but I decided to make it as close to Envoy API as possible. What's more, I didn't mention about field destination_port in the RFC, because I forgot that port matching is optional in a VirtualService configuration, so without explicitly defined destination port it might be not possible to configure TcpProxy.tunneling_config.hostname in some cases.

This API change is related to this pull request: istio/istio#37968.

istio-policy-bot · 2022-03-15T21:00:01Z

😊 Welcome @jewertow! This is either your first contribution to the Istio api repo, or it's been
awhile since you've been here.

You can learn more about the Istio working groups, code of conduct, and contributing guidelines
by referring to Contributing to Istio.

Thanks for contributing!

Courtesy of your friendly welcome wagon.

istio-testing · 2022-03-15T21:00:14Z

Hi @jewertow. Thanks for your PR.

I'm waiting for a istio member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

ericvn · 2022-03-16T14:28:06Z

/ok-to-test

hzxuzhonghu · 2022-03-17T02:05:50Z

networking/v1beta1/destination_rule.proto

+  }
+
+  // Configuration for tunneling TCP over HTTP.
+  TunnelingSettings tunneling = 6;


same question :

IUUC, hosts will be used as tunneling_config.hostname

Then what if this is wildcard hosts?

What's the expected behavior if dr has subsets

That's one of the limitations that I mentioned in the RFC. Tunneling cannot be applied when a hostname contains a wildcard, because it's not possible to determine what hostname should be set in the tunneling_config.hostname. I started to work on setting tunneling_config.hostname automatically based on SNI, but it might work only for TLS connections. It's also important to note that the same limitation applies to virtual services with multiple hostnames. In such a case, it's also not possible to determine which hostname to choose, so the destination rule must be ignored.

Subsets with tunneling settings are handled like any other traffic policy for a subset.

@jewertow should there be some validation defined in DR analyzer to ensure all these limitations? Seems to me this can people into trouble quite easily, it will be super nice to reflect the logic/limitation in a specific filter analyzer.

I included relevant validations in Istio. Should it be done somehow in the API repository? Could you share an example of such a validation?

linsun · 2022-03-21T15:15:19Z

networking/v1alpha3/destination_rule.proto

+    // Specifies whether to use CONNECT or POST http method for the upstream tunnel request.
+    // If set to true, then POST is used, otherwise CONNECT.
+    bool use_post = 2;


Not a big fan of use_post as a boolean. What is the default? CONNECT?

I think i prefer a simple method string with CONNECT as the default.

Thank you for your feedback.

At the beginning I assumed that it will be method = CONNECT/POST, but then I thought that probably CONNECT is the most common case, so if it would be a default setting then the tunneling configuration might be less verbose.
What's more, I was considering to make this API as similar to Envoy API as possible, but it seems that it was not the right idea.

What is the default? CONNECT?

Yes, the CONNECT method is used by default.

Maybe not expose to users

Maybe not expose to users

Since CONNECT is probably the most common use case, it sounds reasonable. We might not expose this setting for now and consider enabling it in the future if necessary.

@howardjohn @costinm could you share your opinions?

string "protocol" ( but we already have that in the Service, so not clear why would we need it in DR as well ).

And we should define some protocol names - I think "hbone" can be the normal CONNECT and the default, and we can use hbone-post and other variants.

I think it is pretty clear that tunneling will not be a boolean, but we may need to support multiple protocols and modes. MASQUE and related standards, etc.

but we already have that in the Service, so not clear why would we need it in DR as well

In my opinion it would be unclear how to route traffic to a service which has specified a tunnel protocol. Let's say you define a service entry with protocol hbone. Which route type do you use in a virtual service? TCP or TLS? How to determine which one should be matched. I think it's tricky and we can't find an intuitive solution.

I think "hbone" can be the normal CONNECT and the default, and we can use hbone-post and other variants.

In my opinion "hbone" might be confusing and users might have problems to understand what is it and how does it work, because I found only one section in Istio docs with this term, but it's not explained there. I also searched in Google the following terms: "hbone", "hbone connect", "hbone protocol" and I didn't get any related article.
On the other hand, even if we will document in detail what hbone is, we still can't assume that users will find what they need, because it's a new term and I guess that users will search in Google terms like "Istio connect support" or "Istio tunneling" rather than "Istio hbone".

I see another important problem. Relying only on the service protocol would make it possible to tunnel traffic only directly from a sidecar proxy to the forward proxy, and tunneling traffic through a gateway would be impossible, because it requires to route traffic to a gateway and then to the forward proxy. So there are two steps of routing. With the destination rule we are able to choose on which stage to apply tunneling config. On the other hand, when you only have a service protocol, it's not possible to determine when exactly to enable tunneling.

I was investigating how it might be implemented only with service protocol, but there is more problems to identify and discuss. In my opinion it's much more difficult to implement and much less flexible.

How about 'h2-connect', 'h2-post' ? BTW, 'hbone' and 'better transport' proposals are inspired from https://datatracker.ietf.org/wg/masque/about/ - we could use 'h2-masque'.

Why would be impossible to tunnel through a gateway ? The same would work, we would set the protocol on the gateway port that supports CONNECT. Same for traffic to another app with a sidecar.

We already rely a lot on Service port identifying the protocol - grpc, tls, h2, http, https, etc.

I was thinking about this a bit more - extending DR with a 'protocol' setting that would override the Service is not
a bad thing, there are cases where the port name/protocol in Service can't be modified ( existing apps installed from helm), or needs to be overridden.

Why would be impossible to tunnel through a gateway ?

I assumed that intermediate tunneling between sidecar and egress gateway is undesired, so I didn't consider it. Maybe API would be simpler, but on the other hand the underlying implementation would be unnecessarily complicated.

linsun · 2022-03-21T15:18:05Z

networking/v1alpha3/destination_rule.proto

+    bool use_post = 2;
+  }
+
+  // Configuration for tunneling TCP over HTTP.


can we add this is for the host configured in the DR?

Yes, of course.

howardjohn · 2022-03-29T15:10:01Z

networking/v1alpha3/destination_rule.proto

@@ -340,6 +340,18 @@ message TrafficPolicy {
  // overridden by port-level settings, i.e. default values will be applied
  // to fields omitted in port-level traffic policies.
  repeated PortTrafficPolicy port_level_settings = 5;
+
+  message TunnelingSettings {


I definitely agree with the high level goal of improving/enabling tunneling in Istio. However, I want to make sure this aligns with our intentions to use HTTP/2 CONNECT ('bts' or 'hbone') as base line transport protocol in Istio, as well as general plans to improve Egress traffic in Istio. Can you sync up with @lambdai (the first topic) and @costinm (second topic) to make sure these are aligned?

Additionally, it could be useful to compare initiating the tunnel in Envoy vs in the app; most users that are going to use an egress proxy probably already have it when they adopt Istio, so their apps already have HTTP(s)_PROXY configured. AFAIK this is broken with Istio today.

Is there any RFC about "bts" or "hbone"? I need to know more context and plans for the future to figure out how tunneling fits these plans. I found only this package and this repository, but still don't know how Istio will use these things.

Yes, there is a doc describing "Better Transport Security" - we have been slowly working toward it, the support for CONNECT in Envoy and other changes are in part based on that design.

Support for POST as a fallback is IMO a strong requirement, but I don't think it should be exposed in the API, but be implemented at discovery level, like auto MTLS.

costinm

I commented on the istio PR as well - I personally don't think we should add this to DestinationRule at all, in particular in context of the BTS/Hbone proposal to have all in-mesh TCP streams use CONNECT.

We already have Auto-MTLS as an example - and we know the UX is far easier and better than
before, user not having to manually configure this each time.

The Service already has protocol - and we use that across Istio to determine how to connect
(h2, grpc, https, etc), it is IMO much better UX to just use discovery information and auto-configure the protocol used to tunnel.

costinm · 2022-03-29T22:16:37Z

I am also not completely opposed to have something in DestinationRule - if we find some strong use cases where using discovery info is not possible. But I can't think of a use case where we need DR and can't use the Service protocol name.

costinm · 2022-04-01T14:19:09Z

networking/v1alpha3/destination_rule.proto

@@ -340,6 +340,19 @@ message TrafficPolicy {
  // overridden by port-level settings, i.e. default values will be applied
  // to fields omitted in port-level traffic policies.
  repeated PortTrafficPolicy port_level_settings = 5;
+
+  message TunnelSettings {


Can we flatten it out ? I don't know if this is exclusive to 'tunnels' - what we would do is use the specified protocol and destination_port to override what the service normally define. It is not specific to egress gateways.

Also not sure if 'which HTTP method' is right - we may want in future to extend this to MASQUE ( over QUIC ), or support HAProxy prefix, or tunnel over WS.

Also 'protocol' is typically lowercase, and we may need to be more precise - h2-connect, h2-post, http-connect (http-post is not possible since http/1.1 POST is not bi-directional in most implementations).

Used HTTP version depends on the protocol defined in the ServiceEntry configured for a given proxy, so "h2-" and "http-" prefixes can't be used to choose the protocol.

Can we flatten it out ? I don't know if this is exclusive to 'tunnels'

If you see other potential use cases for these fields, I'm okay with that.

I was trying to document these fields without the TunnelingSettings wrapper, but then it becomes difficult to specify requirements and semantics clearly. Then configuration lacks the context and validation will be tricky. So after consideration I think it's not worth to flattening these properties.

Also not sure if 'which HTTP method' is right - we may want in future to extend this to MASQUE ( over QUIC ), or support HAProxy prefix, or tunnel over WS.

But why to document something that is not implemented? It could be document as well once it's implemented.

hzxuzhonghu · 2022-04-08T02:13:32Z

Yes, envoy only supports tunnel config under tcp proxy now.

hzxuzhonghu · 2022-04-08T02:14:39Z

cc @lambdai is an expert on this aspect.

costinm · 2022-04-08T02:20:37Z

Right - and CONNECT is only meant for TCP, plain text HTTP is not supposed to use CONNECT, the proxies will get confused and send it as plain text.

We need to document ( and ensure ) that tunnel DR is only used for TCP ( including HTTPS/TLS), at least until envoy has support for proper HTTP_PROXY.

jewertow · 2022-04-08T11:11:08Z

I think we need to figure out what to do about HTTP - because that's clearly going to be broken.

Why? What is going to be broken?

We can't treat HTTP as TCP.

Why? HTTP is built on top of TCP, so what's wrong with it? If you suggest to not support HTTP, because it's not encrypted, we should not support any other non-TLS traffic, but it doesn't make sense, because for plain traffic users can apply TLS origination.

Right - and CONNECT is only meant for TCP, plain text HTTP is not supposed to use CONNECT, the proxies will get confused and send it as plain text.

HTTP is TCP as well, so CONNECT proxy don't care what is this traffic. It accepts HTTP CONNECT, establishes connection and send bytes back and forth. Proxies will not get confused, because it's responsibility is only to forward traffic to a target socket. The whole point of such proxies is to not understand the traffic. So there is no difference in tunneling HTTP or HTTPS.

I really don't understand why to complicate so simple feature. I want to keep it as simple as possible and don't see any benefit in restricting which application protocols might be tunneled. As far as a protocol is TCP, it should be allowed to be tunneled. Restrictions will complicate semantics, implementation and debugging.

costinm · 2022-04-08T14:16:11Z

On Fri, Apr 8, 2022 at 4:11 AM Jacek Ewertowski ***@***.***> wrote: I think we need to figure out what to do about HTTP - because that's clearly going to be broken. Why? What is going to be broken?

The protocol ? The RFC is pretty clear about how HTTP proxy is supposed to work, and it isn't CONNECT. Plus telemetry, any RBAC rules that expect http attributes, routing, etc.

We can't treat HTTP as TCP. Why? HTTP is built on top of TCP, so what's wrong with it? If you suggest to not support HTTP, because it's not encrypted, we should not support any other non-TLS traffic, but it doesn't make sense, because for plain traffic users can apply TLS origination.

I meant: our typical example is upgrading HTTP to HTTPS via egress. User originates HTTP, egress gateway upgrade to HTTPS after applying various RBAC rules or routing. If HTTP is treated as TCP - all this is gone. Besides - if the goal is to support 'standard proxies' like SQUID - you must follow the RFC, and that is pretty clear about how http proxy is handled. I also doubt many properly configured proxies will allow CONNECT on port 80, usually they are used for some policy enforcement.

Right - and CONNECT is only meant for TCP, plain text HTTP is not supposed to use CONNECT, the proxies will get confused and send it as plain text. HTTP is TCP as well, so CONNECT proxy don't care what is this traffic. It accepts HTTP CONNECT, establishes connection and send bytes back and forth. Proxies will not get confused, because it's responsibility is only to forward traffic to a target socket. The whole point of such proxies is to not understand the traffic. So there is no difference in tunneling HTTP or HTTPS

There is a big difference - https is following the RFC semantics. Costin

…

. — Reply to this email directly, view it on GitHub <#2283 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAUR2U2TOTRFEW65RWAQ2DVEAH5NANCNFSM5QZ5Y7TA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

jewertow · 2022-04-08T14:36:24Z

It seems that I didn't understand what you mean by treating HTTP as TCP and we talk about different things.
I didn't wanted to say that HTTP proxy should be treated as TCP. I was thinking that you mean that outbound HTTP traffic can't be treat as TCP. HTTP proxy obviously shouldn't be treated as TCP service. To avoid misunderstanding please take a look at this pull request: istio/istio#37968. There are examples in the directory tests/integration/pilot/tunneling.

costinm · 2022-04-08T14:55:35Z

The examples seem to configure port 8080 as TCP - so a tcp proxy is created. That's wrong IMO, it means http telemetry and any RBAC in the egress proxy that uses http attributes will be gone. It may pass the test, since it's not including this. Neither in-cluster nor outbound HTTP should be treated as TCP, and I've seen many proxies that reject CONNECT traffic that doesn't look encrypted or even if dest is not 443 (or some small set of ports), in some cases doing MITM to apply further policies.

…

On Fri, Apr 8, 2022 at 7:36 AM Jacek Ewertowski ***@***.***> wrote: It seems that I didn't understand what you mean by treating HTTP as TCP and we talk about different things. I didn't wanted to say that HTTP proxy should be treated as TCP. I was thinking that you mean that outbound HTTP traffic can't be treat as TCP. HTTP proxy obviously shouldn't be treated as TCP service. To avoid misunderstanding please take a look at this pull request: istio/istio#37968 <istio/istio#37968>. There are examples in the directory tests/integration/pilot/tunneling. — Reply to this email directly, view it on GitHub <#2283 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAUR2W4EBUWAYP56MFISDLVEA77JANCNFSM5QZ5Y7TA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

howardjohn · 2022-04-08T15:00:05Z

networking/v1alpha3/destination_rule.proto

@@ -340,6 +340,26 @@ message TrafficPolicy {
  // overridden by port-level settings, i.e. default values will be applied
  // to fields omitted in port-level traffic policies.
  repeated PortTrafficPolicy port_level_settings = 5;
+


I mentioned this in the WG meeting but will reiterate it here. I think it's critical we have a strategy planned out for how this relates to user initiated CONNECT; not suggesting implementing that is a blocker but i would like to see some discussion of the long term plans around that so we can guide users, or docs, and our future plans.

We (at RedHat) have no other plans related to tunneling or initiating CONNECT requests. I proposed the simplest API I can see. I know that you had other ideas similar to auto-mtls or service protocol, but both have flaws which I explained above. As far as Envoy is not going to provide nor extend its tunneling capabilities, I can't see alternative solutions.

It's important to note that this API does not block you to provide other approach for this problem in the future - as is in case of mTLS which can be applied both with DestinationRule and PeerAuthentication.

jewertow · 2022-04-08T15:09:34Z

The examples seem to configure port 8080 as TCP - so a tcp proxy is
created. That's wrong IMO, it means http telemetry and any RBAC in the
egress proxy
that uses http attributes will be gone. It may pass the test, since it's
not including this.

Yes, but for TLS traffic you also can't apply HTTP telemetry, so it's not a matter of whether the traffic is encrypted or not.

On the other hand why do we have to assume that CONNECT proxies must reject not encrypted traffic? It's not the problem of Istio. I don't care how users configure external components.

so a tcp proxy is created. That's wrong IMO

It's not wrong. It's absolutely intentional configuration, because only TcpProxy enables tunneling, so HttpConnectionManager can't be used.

costinm · 2022-04-08T15:38:06Z

On Fri, Apr 8, 2022 at 8:09 AM Jacek Ewertowski ***@***.***> wrote: The examples seem to configure port 8080 as TCP - so a tcp proxy is created. That's wrong IMO, it means http telemetry and any RBAC in the egress proxy that uses http attributes will be gone. It may pass the test, since it's not including this. Yes, but for TLS traffic you also can't apply HTTP telemetry, so it's not a matter of whether the traffic is encrypted or not.

Yes, for TLS and HTTPS traffic we can't apply HTTP telemetry or RBAC ( without MITM), and according to the RFC CONNECT is the proper proxy methid . But for HTTP traffic - we can, and there is a clear standard on how a HTTP proxy should behave. Wasn't the purpose of this change to allow users to integrate with proxies ?

On the other hand why do we have to assume that CONNECT proxies must reject not encrypted traffic? It's not the problem of Istio. I don't care how users configure external components.

One of the main purposes of HTTP proxies is to enforce enterprise policies. You would reject plain text traffic on CONNECT because it's a clear violation of the RFC that the proxy implements, and a sign of likely abuse ( circumventing the http checks - like what hosts you are allowed to connect). Advanced proxies to MITM - so typically CONNECT is decrypted (using roots that get distributed to all machines) and policy enforced. BTW - AFAIK almost all proxies support 'transparent proxy' mode, so just forwarding HTTP to the proxy with the original Host: header will work fine ( proxies are also sometimes deployed via interception )

so a tcp proxy is created. That's wrong IMO It's not wrong. It's absolutely intentional configuration, because only TcpProxy enables tunneling, so HttpConnectionManager can't be used.

Well - it is wrong because it violates the RFC implemented by proxies, and the user expectations on how telemetry and RBAC will work for L7 traffic. Users having to configure HTTP as L4 as a workaround for Envoy implementation limitations doesn't seem right. I understand you can intentionally miss-configure to work around envoy - and it may work in some cases ( if egress doesn't do any Authz or L7 routing for example). Message ID: ***@***.***>

…

ramaraochavali · 2022-04-11T06:16:53Z

For http, does n't some thing similar will help https://github.com/envoyproxy/envoy/blob/8cb6862fe6099cd8583a64ff037ecdeaf0e939fa/configs/proxy_connect.yaml#L36?

nrjpoddar

Adding this to DR has few confusing semantics i.e. can you specify all other settings in the TrafficPollcy if you want to use the Tunnel protocol? Settings like TLS, connection level settings and load balancer settings?

Since the tunneling happens over HTTP the VS associated with this should be an HTTP block and the service port prefix of the upstream should be http*, is that correct?

nrjpoddar · 2022-04-11T09:47:42Z

networking/v1alpha3/destination_rule.proto

+    //   connect - uses HTTP CONNECT;
+    //   post - uses HTTP POST.
+    // HTTP version for upstream requests is determined by the service protocol defined for the proxy.
+    string protocol = 1 [(google.api.field_behavior) = REQUIRED];


Tunneling protocols are much more bounded than application protocols so it made sense to me keep service protocol as string so that we don't change our APIs often. If we expect new tunneling protocols to be supported by Envoy very frequently then I can understand keeping this as string else I would prefer a ENUM here.

costinm · 2022-04-11T17:28:21Z

There is no such thing as "CONNECT tunneling for HTTP" - the standard that defines CONNECT also defines how to proxy HTTP, using the absolute URL - you can't change the verb from POST/GET/etc to CONNECT or use CONNECT and then send the HTTP request as the body. We need to stop inventing weird protocols and follow the standards.

…

On Mon, Apr 11, 2022 at 2:55 AM Neeraj Poddar ***@***.***> wrote: ***@***.**** commented on this pull request. Adding this to DR has few confusing semantics i.e. can you specify all other settings in the TrafficPollcy if you want to use the Tunnel protocol? Settings like TLS, connection level settings and load balancer settings? Since the tunneling happens over HTTP the VS associated with this should be an HTTP block and the service port prefix of the upstream should be http*, is that correct? ------------------------------ In networking/v1alpha3/destination_rule.proto <#2283 (comment)>: > @@ -340,6 +340,22 @@ message TrafficPolicy { // overridden by port-level settings, i.e. default values will be applied // to fields omitted in port-level traffic policies. repeated PortTrafficPolicy port_level_settings = 5; + + message TunnelSettings { + // Specifies which protocol to use for tunneling the downstream connection. + // Supported protocols are: + // connect - uses HTTP CONNECT; + // post - uses HTTP POST. + // HTTP version for upstream requests is determined by the service protocol defined for the proxy. + string protocol = 1 [(google.api.field_behavior) = REQUIRED]; Tunneling protocols are much more bounded than application protocols so it made sense to me keep service protocol as string so that we don't change our APIs often. If we expect new tunneling protocols to be supported by Envoy very frequently then I can understand keeping this as string else I would prefer a ENUM here. — Reply to this email directly, view it on GitHub <#2283 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAUR2UOC5G3HKLWCPBJY5LVEPZJTANCNFSM5QZ5Y7TA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

jewertow · 2022-04-11T20:27:44Z

Adding this to DR has few confusing semantics i.e. can you specify all other settings in the TrafficPollcy if you want to use the Tunnel protocol? Settings like TLS, connection level settings and load balancer settings?

@nrjpoddar I don't know what's confusing in your opinion. tunnel is just another field like the others.

Since the tunneling happens over HTTP the VS associated with this should be an HTTP block and the service port prefix of the upstream should be http*, is that correct?

There is tcp block, because it's tunneling TCP over HTTP, so TcpProxy is needed.

jewertow · 2022-04-11T20:53:15Z

We need to stop inventing weird protocols and follow the standards.

I don't know what's weird for you. It's as simple as possible. If you don't agree to support plain TCP traffic, let's support at least TLS traffic.

The main purpose of this PR is to provide API which enable users to eliminate configuring HTTP proxies in their apps by setting envs like HTTP_PROXY or properties like java -Dhttps.proxyHost=host -Dhttps.proxyPort=port.
If "tunneling API" is too broad topic and you have other plans that conflict with this use case and submitted API, what do you think to change it to HttpsProxy and keep it simple and support only TLS?

costinm · 2022-04-11T23:56:56Z

I do agree with supporting TLS and TCP traffic, via TcpProxy. App-originated HTTPS too. I am also ok with supporting HTTP proxies - equivalent with setting HTTP_PROXY and HTTPS_PROXY. But setting HTTP_PROXY is NOT using CONNECT, and all HTTP proxies implement the RFC, which is to use the absolute URL. The configs you have - treating HTTP as a TCP connection - is not the same thing with setting HTTP_PROXY and is not supported/used by http proxies, and is changing the semantics of Istio, that's what I don't like.

…

On Mon, Apr 11, 2022 at 1:53 PM Jacek Ewertowski ***@***.***> wrote: We need to stop inventing weird protocols and follow the standards. I don't know what's weird for you. It's as simple as possible. If you don't agree to support plain TCP traffic, let's support at least TLS traffic. The main purpose of this PR is to provide API which enable users to eliminate configuring HTTP proxies in their apps by setting envs like HTTP_PROXY or properties like java -Dhttps.proxyHost=host -Dhttps.proxyPort=port. If "tunneling API" is too broad topic and you have other plans that conflict with this use case and submitted API, what do you think to change it to HttpsProxy and keep it simple and support only TLS? — Reply to this email directly, view it on GitHub <#2283 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAUR2R263WSK36VWV4J7ADVESGMPANCNFSM5QZ5Y7TA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

jewertow · 2022-04-12T11:39:15Z

But setting HTTP_PROXY is NOT using CONNECT, and all HTTP proxies implement the RFC

I tested multiple commonly used HTTP tools and libraries (Java, NodeJS, curl...) and all of them use HTTP CONNECT when HTTP proxy is configured, so I don't understand why you still say that it's weird protocol and why you suggest that proxies does not support it. I also inspected the traffic with Wireshark and it works as expected.

Could you share the RFC which you are referring to?
I read the following RFCs:

HTTP CONNECT: https://datatracker.ietf.org/doc/html/rfc7231#section-4.3.6
Tunneling TCP based protocols through Web proxy servers: https://datatracker.ietf.org/doc/html/draft-luotonen-web-proxy-tunneling-01

and I can't see there any information about absolute URL which you talk about. Both these RFCs says that tunnel should be established with the following request

     CONNECT server.example.com:80 HTTP/1.1
     Host: server.example.com:80

and it makes sense, because the absolute URL that you want connect to will be sent once the connection is established.

costinm · 2022-04-12T21:44:49Z

Quick search: https://stackoverflow.com/questions/7577917/how-does-a-http-proxy-utilize-the-http-protocol-a-proxy-rfc Can you share wireshark trace for any of those languages - with HTTP_PROXY set ( for a http:// request, not https:// ) ? The go implementation is net/http/request.go is if usingProxy && r.URL.Scheme != "" && r.URL.Opaque == "" { ruri = r.URL.Scheme + "://" + host + ruri .... and transport.go case cm.targetScheme == "http": pconn.isProxy = true if pa := cm.proxyAuth(); pa != "" { pconn.mutateHeaderFunc = func(h Header) { h.Set("Proxy-Authorization", pa) } } case cm.targetScheme == "https": conn := pconn.conn var hdr Header if t.GetProxyConnectHeader != nil { var err error hdr, err = t.GetProxyConnectHeader(ctx, cm.proxyURL, cm.targetAddr) if err != nil { conn.Close() return nil, err } I can probably find the java implementation as well.

…

On Tue, Apr 12, 2022 at 4:39 AM Jacek Ewertowski ***@***.***> wrote: But setting HTTP_PROXY is NOT using CONNECT, and all HTTP proxies implement the RFC I tested multiple commonly used HTTP tools and libraries (Java, NodeJS, curl...) and all of them use HTTP CONNECT when HTTP proxy is configured, so I don't understand why you still say that it's weird protocol and why you suggest that proxies does not support it. I also inspected the traffic with Wireshark and it works as expected. Could you share the RFC which you are referring to? I read the following RFCs: - HTTP CONNECT: https://datatracker.ietf.org/doc/html/rfc7231#section-4.3.6 - Tunneling TCP based protocols through Web proxy servers: https://datatracker.ietf.org/doc/html/draft-luotonen-web-proxy-tunneling-01 and I can't see there any information about absolute URL which you talk about. Both these RFCs says that tunnel should be established with the following request CONNECT server.example.com:80 HTTP/1.1 Host: server.example.com:80 and it makes sense, because the absolute URL that you want connect to will be sent once the connection is established. — Reply to this email directly, view it on GitHub <#2283 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAUR2UVBXMH5LZEGVXWQ3DVEVOG3ANCNFSM5QZ5Y7TA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

jewertow · 2022-04-13T10:20:31Z

Ok, now I understand what you mean and you're right that setting HTTP or HTTPS proxies in most clients don't use HTTP CONNECT for HTTP connection initiated by a client, but for TLS connections proxies use HTTP CONNECT.
I misunderstood your comment and I thought you mean that setting HTTP/HTTPS proxies in clients does not use CONNECT at all.

I think we look at this feature from completely different perspectives.
You are focused on that this is supposed to be a typical web/HTTP proxy (in the RFC called "proxy"). In that case your thoughts about forwarding HTTP would be absolutely correct and reasonable.
But this PR aims to enable integration with tunnel proxies (in the RFC called "tunnel").
Look at section "2.3. Intermediaries":

   A "tunnel" acts as a blind relay between two connections without
   changing the messages.  Once active, a tunnel is not considered a
   party to the HTTP communication, though the tunnel might have been
   initiated by an HTTP request.

and then at section "2.6. Protocol Versioning":

   Intermediaries that process HTTP messages (i.e., all intermediaries
   other than those acting as tunnels) MUST send their own HTTP-version
   in forwarded messages.

As you can see there is no information how tunnel proxies have to treat HTTP requests, because their purpose is to be blind, to not process received messages and just establish TCP connection between a client and an origin server.
So I think that this PR is compatible with the RFC you are referring to.

As far as this API is named TunnelSettings there should be no confusion why the connection is not initiated as typical HTTP clients which support HTTP_PROXY (or similar) settings.
I would like to cancel the previous idea to name it HttpProxy or HttpsProxy - it was not well thought out.

costinm · 2022-04-13T18:03:08Z

I am not actually opposed to supporting the CONNECT tunnel for HTTP connections ( in particular for HTTP/2 plain text), if: - we find some non-Istio proxies that support this and don't get confused - we implement it in a way that doesn't break Istio own APIs. Maybe I was not clear in my comments - the later point is very important and what started the discussion, in your example the HTTP port is labeled as TCP to force the use of the TCP proxy which supports CONNECT. That breaks telemetry and policies operating on HTTP attributes. We could either make changes in Envoy, or generate a config that still treats the request as HTTP, applies all policies/telemetry, including mTLS - but does an internal forward to a local Tcp cluster that does the upgrade to CONNECT. This is IMO the 'right' use of CONNECT, for tunneling encrypted traffic and preserving mTLS.

…

On Wed, Apr 13, 2022 at 8:52 AM Jacek Ewertowski ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In networking/v1alpha3/destination_rule.proto <#2283 (comment)>: > @@ -340,6 +340,26 @@ message TrafficPolicy { // overridden by port-level settings, i.e. default values will be applied // to fields omitted in port-level traffic policies. repeated PortTrafficPolicy port_level_settings = 5; + We (at RedHat) have no other plans related to tunneling or initiating CONNECT requests. I proposed the simplest API I can see. I know that you had other ideas similar to auto-mtls or service protocol, but both have flaws which I explained above. As far as Envoy is not going to provide nor extend its tunneling capabilities, I can't see alternative solutions. It's important to note that this API does not block you to provide other approach for this problem in the future - as is in case of mTLS which can be applied both with DestinationRule and PeerAuthentication. — Reply to this email directly, view it on GitHub <#2283 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAUR2SFTMYUAEVCVJIHAC3VE3USBANCNFSM5QZ5Y7TA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

jewertow · 2022-04-15T14:54:02Z

we find some non-Istio proxies that support this and don't get confused

Do you mean to ensure which proxies would it work with? I was testing it mostly with Envoy and there are no restrictions. I was also using squid and nginx with proxy_connect module. I could also test it with apache http.

It's definitely possible to support HTTP port without changing anything in Envoy, but it's much more complex solution. I was testing redirecting traffic from HttpConnectionManager to a TcpProxy listening on unix domain socket and it worked fine.

So what do you think to start with support only for TLS traffic and then investigate support for plain TCP and HTTP in another pull request? Tunneling TLS is the most important case and I don't want to block it with discussions about plain HTTP.

Edit:
I want to note that the current implementation supports HTTP telemetry for HTTP routes with TLS origination, so I think that there is no reason to block support for plain HTTP. The only case that is not cover yet is tunneling plain HTTP traffic to an HTTP port, but it might be done in the future. I don't want to block this feature just because of just one case which might be implemented later.

I also want to remember once again why both target_host and target_port are necessary:

without target_host we couldn't apply the destination rule to a virtual service which has more than one host or a host including a wildcard, because then we couldn't determine the target host;
without target_port we couldn't apply the destination rule to a virtual service which has no matching rule for a port.

So at the first glance it might seem that specifying these properties is redundant, but because of them there are no limitations for a related virtual service.

costinm · 2022-04-18T02:17:54Z

On Fri, Apr 15, 2022 at 7:54 AM Jacek Ewertowski ***@***.***> wrote: we find some non-Istio proxies that support this and don't get confused Do you mean to ensure which proxies would it work with? I was testing it mostly with Envoy and there are no restrictions. I was also using squid and nginx with proxy_connect module. I could also test it with apache http.

I'm sure all will "work" - treating it as a TCP stream. What I mean is that proxy may have http-level settings/policies - like allow some domains or block others, or keep track of the domains. It's the main reason proxies were required. I don't know how much those days with https. The fundamental problem is treating HTTP as TCP - both in Istio and in the proxy.

It's definitely possible to support HTTP port without changing anything in Envoy, but it's much more complex solution. I was testing redirecting traffic from HttpConnectionManager to a TcpProxy listening on unix domain socket and it worked fine.

Yes, it is more complex, but I think it is necessary. There are some changes to use an internal redirection and further optimize.

So what do you think to start with support only for TLS traffic and then investigate support for plain TCP and HTTP in another pull request? Tunneling TLS is the most important case and I don't want to block it with discussions about plain HTTP.

I agree. Let's just document in the API that it will not apply to HTTP or H2 for now - and not include any of the examples treating http as tcp. Message ID: ***@***.***>

…

jewertow · 2022-04-19T12:23:06Z

networking/v1alpha3/destination_rule.proto

+
+  // Configuration of tunneling TCP over other transport or application layers
+  // for the host configured in the DestinationRule.
+  // Tunnel settings can be applied to TCP or TLS routes and can't be applied to HTTP routes.


I will remove this note once I submit support for HTTP routes.

costinm · 2022-04-19T15:26:46Z

networking/v1alpha3/destination_rule.proto

+    string target_host = 2 [(google.api.field_behavior) = REQUIRED];
+
+    // Specifies a port to which the downstream connection is tunneled.
+    uint32 target_port = 3 [(google.api.field_behavior) = REQUIRED];


I know we traditionally have done things this way, but virtually everyone is using hostnames:port or URLs, there is no need to be so verbose and ask the user to act as a URL parser. Why not:

// Specify the host to which connection is tunneled, as host:port or host or URL
string target

HTTPS_PROXY is a host:port.

I know using URL is controversial in Istio for some reason, so treat this as a rant, not blocking.

To be honest, it completely doesn't matter for me. So I would like to know the opinion of API maintainers.
What do you think @linsun?

I think targetHost and targetPort is pretty simple for user to use, i could be convinced to use below but also felt targetHost and targetPort offers more clarity.

target: {host:port}

costinm · 2022-04-19T15:29:27Z

networking/v1alpha3/destination_rule.proto

+    string protocol = 1 [(google.api.field_behavior) = REQUIRED];
+
+    // Specifies a host to which the downstream connection is tunneled.
+    // Target host must be an FQDN.


Why not IP ? HTTPS_PROXY does allow this, and it is common to not have a DNS name for proxies (many times
they are used over VPN or hosts where the DNS resolver is hard to change, since setting the proxy can be done by non-root users).

That's true, thanks. I just forgot to mention about it. I added this information and pushed new commit.

TunnelSettings enables tunneling TCP traffic over other transport or application layers. Istio will initially support tunneling TCP over HTTP or H2 using CONNECT or POST methods, but the supported protocols list might be extended in the future. At the beginning tunnel settings will be applicable to TCP or TLS routes only, but support for HTTP routes is also on the roadmap. Signed-off-by: Jacek Ewertowski <jewertow@redhat.com>

linsun

LGTM

Thank you @jewertow for your hard work on this!

jewertow requested review from linsun, louiscryan, nrjpoddar, howardjohn and ericvn as code owners March 15, 2022 20:59

istio-testing added needs-rebase Indicates a PR needs to be rebased before being merged size/S Denotes a PR that changes 10-29 lines, ignoring generated files. needs-ok-to-test labels Mar 15, 2022

jewertow force-pushed the destination-rule-tunneling-api branch from cf83a1f to d7d8f5e Compare March 15, 2022 21:14

istio-testing removed the needs-rebase Indicates a PR needs to be rebased before being merged label Mar 15, 2022

hzxuzhonghu reviewed Mar 17, 2022

View reviewed changes

istio-testing added the needs-rebase Indicates a PR needs to be rebased before being merged label Mar 18, 2022

linsun reviewed Mar 21, 2022

View reviewed changes

jewertow force-pushed the destination-rule-tunneling-api branch from 2d3616f to 366b826 Compare March 28, 2022 17:30

istio-testing removed the needs-rebase Indicates a PR needs to be rebased before being merged label Mar 28, 2022

jewertow mentioned this pull request Mar 29, 2022

Tunneling outbound traffic istio/istio#37968

Merged

howardjohn reviewed Mar 29, 2022

View reviewed changes

costinm requested changes Mar 29, 2022

View reviewed changes

jewertow requested review from linsun and costinm March 31, 2022 13:44

costinm reviewed Apr 1, 2022

View reviewed changes

howardjohn reviewed Apr 8, 2022

View reviewed changes

nrjpoddar reviewed Apr 11, 2022

View reviewed changes

jewertow commented Apr 19, 2022

View reviewed changes

jewertow force-pushed the destination-rule-tunneling-api branch from 47c5a20 to 362698b Compare April 19, 2022 13:32

costinm reviewed Apr 19, 2022

View reviewed changes

jewertow force-pushed the destination-rule-tunneling-api branch from 362698b to 793b356 Compare April 19, 2022 15:48

linsun approved these changes Apr 20, 2022

View reviewed changes

istio-testing merged commit b6a03a9 into istio:master Apr 20, 2022

Extend DestinationRule with tunneling settings #2283

Extend DestinationRule with tunneling settings #2283

Uh oh!

Conversation

jewertow commented Mar 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background context

Uh oh!

istio-policy-bot commented Mar 15, 2022

Uh oh!

istio-testing commented Mar 15, 2022

Uh oh!

ericvn commented Mar 16, 2022

Uh oh!

hzxuzhonghu Mar 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jewertow Mar 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jewertow Mar 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

costinm left a comment

Choose a reason for hiding this comment

Uh oh!

costinm commented Mar 29, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jewertow Apr 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hzxuzhonghu commented Apr 8, 2022

jewertow commented Mar 15, 2022 •

edited

Loading

hzxuzhonghu Mar 17, 2022 •

edited

Loading

jewertow Mar 21, 2022 •

edited

Loading

jewertow Mar 31, 2022 •

edited

Loading

jewertow Apr 4, 2022 •

edited

Loading

jewertow commented Apr 8, 2022 •

edited

Loading