Refactor to enhance code re-use and prepare for HTTP/3 support #732

gseddon · 2025-05-06T14:20:34Z

Hi,
A bigger change here but I hope it is valuable! We want to contribute HTTP/3 support to Oha. This is an initial CR which refactors the client.rs to use common functions for http1 and http2 load generation.
It also introduces a HttpWorkType enum which is matched on to choose which functionality to execute, rather than the previous is_http2() function.

This lays the groundwork for adding a H3 work type in a following PR, and a http3 load generation common function.
This will be feature gated due to its experimental nature.
This has been split off into a separate PR to make them easier to understand and review individually. It's nice to be able to have a refactor PR which doesn't need to change any tests - it is internal structural changes only.

This PR has had an initial review here gseddon#1
and here is the HTTP3 branch, will make that PR once this one is merged gseddon#2

hatoo · 2025-05-07T04:00:56Z

Hi, HttpWorkType looks good.
But please leave load generation functions original.
Your changes are bad for performance, and it's not working on my machine.

cargo run -- -z 10s http://127.0.0.1:3000
# stop after 50 requests ...

gseddon · 2025-05-12T11:47:50Z

Hi, I fixed the performance bug for HTTP1 with 'deadline' enabled. Good catch sorry. As far as performance impact, these changes should have minimal effect. I've done a few runs comparing 'before' and after' and found no difference at all.
The only part I've split out into functions is the setup the initial loop for each of the load generation patterns. These only get called once per load generation run, so the impact is essentially nothing.
Do you want me to perform some more benchmarking and come back with data?

gseddon · 2025-05-21T12:23:14Z

Would it be better if I split this into smaller changes?

hatoo · 2025-05-21T12:46:13Z

Would it be better if I split this into smaller changes?

Sorry for delay.
Yeah it's very helpful if you split to smoller PRs.

hatoo · 2025-05-24T06:51:39Z

The only part I've split out into functions is the setup the initial loop for each of the load generation patterns. These only get called once per load generation run, so the impact is essentially nothing.

It's not true for fn work(). It is now using channel instead of AtomicUsize and it introduces some overheads.

In my environment, the difference are noticeable.

❯ cargo run --profile release-ci -- -n 10000000 -c 1000 http://localhost:3000
    Finished `release-ci` profile [optimized] target(s) in 0.15s
     Running `target/release-ci/oha -n 10000000 -c 1000 'http://localhost:3000'`
Summary:
  Success rate: 100.00%
  Total:        16.9223 secs
  Slowest:      0.1381 secs
  Fastest:      0.0000 secs
  Average:      0.0017 secs
  Requests/sec: 590935.7094

❯ oha -n 10000000 -c 1000 http://localhost:3000 # v1.8.0
Summary:
  Success rate: 100.00%
  Total:        16.3692 secs
  Slowest:      0.1259 secs
  Fastest:      0.0000 secs
  Average:      0.0016 secs
  Requests/sec: 610903.7964

Off course, the performance gap is small, but I care it.

hatoo · 2025-05-24T06:55:26Z

Could you make this PR only contain HttpWorkType stuffs?
We can discuss about refactoring on other PRs. I think general idea of refactoring is good. Thank you.

gseddon · 2025-05-29T13:20:11Z

Hi, so I removed the refactoring for the work_until function that used the endless_emitter, because that had more of a slowdown than the other paths What are your thoughts on the latest changes?
I think having a 1% performance difference for the 'slow' path is not so bad, for the amount of code that is reduced. The 'fast path' should still be just as fast.
Also what server are you using to test oha against? I want to use the same one so I can make sure we're measuring the same thing

hatoo · 2025-05-30T05:56:34Z

Could you just contain HttpWorkType for this PR?
I really care about the performance for all path.

For this code,

async fn parallel_work_http1(
    n_connections: usize,
    rx: AsyncReceiver<Option<Instant>>,
    report_tx: kanal::Sender<Result<RequestResult, ClientError>>,
    client: Arc<Client>,
    deadline: Option<std::time::Instant>,
) -> Vec<tokio::task::JoinHandle<()>> {

The rx: AsyncReceiver<Option<Instant>> adds some overheads for non latency correction workers.
Because they always send None but the size of it isn't zero.

I test against https://github.com/hatoo/sandbag

gseddon · 2025-05-30T11:44:42Z

Ok I can do that. One thought to discuss though - is it a good thing for oha to have different performance depending on the way a user invokes it? For example, all of the qps paths will perform say 1% worse because they use the channels. So the user might see a different performance of their application depending on whether they use -q or not, or even perhaps between the -n and -z flags. So is it better to have the same function used for all of these paths so that the user gets consistent performance? Also then if you performance optimise that one function, then all of the load tests automatically get faster.

hatoo · 2025-05-31T04:45:46Z

is it a good thing for oha to have different performance depending on the way a user invokes it? For example, all of the qps paths will perform say 1% worse because they use the channels. So the user might see a different performance of their application depending on whether they use -q or not, or even perhaps between the -n and -z flags. So is it better to have the same function used for all of these paths so that the user gets consistent performance?

Yeah, performance variation depending on command line flags is not good.
But keeping codes optimal is more important to me.

Also then if you performance optimise that one function, then all of the load tests automatically get faster.

I'm not saying I'll refuse all refactoring. I think we can refactoring work* to use common function while keeping the current performance by using generics.

hatoo · 2025-06-15T04:13:23Z

Done at #746

gseddon force-pushed the refactor branch from ec6a0cc to 41416f6 Compare May 6, 2025 15:15

gseddon force-pushed the refactor branch from b75a6a7 to a021432 Compare May 28, 2025 17:07

Refactor to enhance code re-use and prepare for HTTP3 support

6677684

gseddon force-pushed the refactor branch from a021432 to 6677684 Compare May 29, 2025 11:05

Fix new clippy failure

9e79216

gseddon mentioned this pull request Jun 2, 2025

Use HttpWorkType to choose work type #745

Merged

hatoo closed this Jun 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Refactor to enhance code re-use and prepare for HTTP/3 support #732

Refactor to enhance code re-use and prepare for HTTP/3 support #732

Uh oh!

gseddon commented May 6, 2025

Uh oh!

hatoo commented May 7, 2025

Uh oh!

gseddon commented May 12, 2025

Uh oh!

gseddon commented May 21, 2025

Uh oh!

hatoo commented May 21, 2025

Uh oh!

hatoo commented May 24, 2025

Uh oh!

hatoo commented May 24, 2025

Uh oh!

gseddon commented May 29, 2025 •

edited

Loading

Uh oh!

hatoo commented May 30, 2025

Uh oh!

gseddon commented May 30, 2025

Uh oh!

hatoo commented May 31, 2025

Uh oh!

hatoo commented Jun 15, 2025

Uh oh!

Uh oh!

Uh oh!

Refactor to enhance code re-use and prepare for HTTP/3 support #732

Refactor to enhance code re-use and prepare for HTTP/3 support #732

Uh oh!

Conversation

gseddon commented May 6, 2025

Uh oh!

hatoo commented May 7, 2025

Uh oh!

gseddon commented May 12, 2025

Uh oh!

gseddon commented May 21, 2025

Uh oh!

hatoo commented May 21, 2025

Uh oh!

hatoo commented May 24, 2025

Uh oh!

hatoo commented May 24, 2025

Uh oh!

gseddon commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hatoo commented May 30, 2025

Uh oh!

gseddon commented May 30, 2025

Uh oh!

hatoo commented May 31, 2025

Uh oh!

hatoo commented Jun 15, 2025

Uh oh!

Uh oh!

gseddon commented May 29, 2025 •

edited

Loading