Reduce `TestIntegration#restart_does_not_drop_connections` restart frequency #3589

joshuay03 · 2024-12-30T02:00:22Z

Description

Context:

[Fix #3044] Set conditional config defaults after CLI options are parsed and config files are loaded #3297 (comment)
CI: refactor restart_does_not_drop_connections code #3557 (comment)

With fork_worker un-commented and the original sleep the test times out for me locally, but doesn't on this branch. We go from ~10 restarts to ~3 restarts in a non-fork_worker test like this one, which I think is perfectly reasonable for what these tests are asserting.

Fixing this is necessary to unblock #3297.

cc/ @MSP-Greg

Your checklist for this pull request

I have reviewed the guidelines for contributing to this repository.
I have added (or updated) appropriate tests if this PR fixes a bug or adds a feature.
My pull request is 100 lines added/removed or less so that it can be easily reviewed.
If this PR doesn't need tests (docs change), I added [ci skip] to the title of the PR.
If this closes any issues, I have added "Closes #issue" to the PR description or my commit messages.
I have updated the documentation accordingly.
All new and existing tests passed, including Rubocop.

dentarg · 2024-12-30T09:49:55Z

test/test_integration_cluster.rb

  def test_phased_restart_does_not_drop_connections_threads_fork_worker
    restart_does_not_drop_connections num_threads: 10, total_requests: 3_000,
-      signal: :USR1 #, config: 'fork_worker', log: true
+      signal: :USR1, config: 'fork_worker'
  end


If this is the only test that needs sleep 1 in restart_does_not_drop_connections, what about passing that in from the test? Making it more clear why we need to sleep 1 second

Or does it always make sense to sleep 1 second? I'm not sure what we're waiting for, that should probably be explained better in the comment above the sleep.

Or does it always make sense to sleep 1 second? I'm not sure what we're waiting for, that should probably be explained better in the comment above the sleep.

IMO it always makes sense to sleep 1 second. The sleep is just how long we wait before signalling another restart. All the tests that use #restart_does_not_drop_connections at most set a total_requests of 3000. With a sleep of 0.15 we restart ~10 times, so something like 300 requests before each restart. With a sleep of 1, ~3 times, so 1000 requests before each restart.

Seeing as the tests are to ensure connections aren't dropped:

I don't think the number of restarts really matters, as long as there's at least one

I don't think (hot/phased) restarting puma every 300 requests consecutively is a realistic scenario anyway

Essentially, this is just an arbitrary threshold that just so happens to time out one particular test, and I don't think adjusting it to the new value for all tests affects their accuracy.

Agree, thanks for explaining!

There were still 1/2 macOS failures each run with the increased sleep. Tried increasing it further and also added an assertion for restart count to ensure there are always at least 2 restarts and the assertion started failing i.e. with that high a sleep we only restart once. I instead went for an exponential back-off which seems to be yield a more consistent and desirable result.

The remaining failure seems to be unrelated, would appreciate if someone could rerun to confirm.

Nice, thanks.

Yes, unrelated: #3478 (comment)

test/helpers/integration.rb

…equency

MSP-Greg · 2025-01-03T16:25:35Z

@joshuay03

Good day. Thanks for working on this. The test probably evolved from code tested locally. That often can be an issue with GHA CI. I'm sure you've had test code that ran fine locally, but failed when run in GHA.

I did a revamp of the code, where the number of restarts can be specified. See:

master...MSP-Greg:puma:00-hot-restart

I've run it in CI a few times, and it seems more reliable than the current code. Any thoughts? Feel free to use the code. I haven't worked on the fork_worker issue...

joshuay03 changed the title ~~Reduce TestIntegration#restart_does_not_drop_connections restart frequency~~ Reduce TestIntegration#restart_does_not_drop_connections restart frequency Dec 30, 2024

joshuay03 force-pushed the fix-timing-out-phased-restart-fork-worker-test branch from 1732a15 to ac6bdca Compare December 30, 2024 02:00

github-actions bot added the waiting-for-review Waiting on review from anyone label Dec 30, 2024

joshuay03 force-pushed the fix-timing-out-phased-restart-fork-worker-test branch 4 times, most recently from cfaf63a to d6155e8 Compare December 30, 2024 03:46

dentarg added the CI / Testing label Dec 30, 2024

This was referenced Dec 30, 2024

CI: refactor restart_does_not_drop_connections code #3557

Merged

[Fix #3044] Set conditional config defaults after CLI options are parsed and config files are loaded #3297

Merged

dentarg reviewed Dec 30, 2024

View reviewed changes

dentarg reviewed Dec 31, 2024

View reviewed changes

test/helpers/integration.rb Outdated Show resolved Hide resolved

joshuay03 force-pushed the fix-timing-out-phased-restart-fork-worker-test branch 8 times, most recently from eb47e82 to e2191b5 Compare January 2, 2025 09:44

Reduce TestIntegration#restart_does_not_drop_connections restart fr…

b1f5f48

…equency

joshuay03 force-pushed the fix-timing-out-phased-restart-fork-worker-test branch from e2191b5 to b1f5f48 Compare January 2, 2025 09:46

dentarg mentioned this pull request Jan 10, 2025

libev failures in CI #3478

Open

dentarg approved these changes Jan 10, 2025

View reviewed changes

dentarg merged commit a478d4d into puma:master Jan 10, 2025
86 of 87 checks passed

dentarg removed the waiting-for-review Waiting on review from anyone label Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce `TestIntegration#restart_does_not_drop_connections` restart frequency #3589

Reduce `TestIntegration#restart_does_not_drop_connections` restart frequency #3589

Uh oh!

joshuay03 commented Dec 30, 2024 •

edited

Loading

Uh oh!

dentarg Dec 30, 2024

Uh oh!

joshuay03 Dec 31, 2024

Uh oh!

dentarg Dec 31, 2024

Uh oh!

joshuay03 Jan 2, 2025 •

edited

Loading

Uh oh!

dentarg Jan 10, 2025

Uh oh!

Uh oh!

MSP-Greg commented Jan 3, 2025

Uh oh!

Uh oh!

Uh oh!

Reduce TestIntegration#restart_does_not_drop_connections restart frequency #3589

Reduce TestIntegration#restart_does_not_drop_connections restart frequency #3589

Uh oh!

Conversation

joshuay03 commented Dec 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Your checklist for this pull request

Uh oh!

dentarg Dec 30, 2024

Choose a reason for hiding this comment

Uh oh!

joshuay03 Dec 31, 2024

Choose a reason for hiding this comment

Uh oh!

dentarg Dec 31, 2024

Choose a reason for hiding this comment

Uh oh!

joshuay03 Jan 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dentarg Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MSP-Greg commented Jan 3, 2025

Uh oh!

Uh oh!

Uh oh!

Reduce `TestIntegration#restart_does_not_drop_connections` restart frequency #3589

Reduce `TestIntegration#restart_does_not_drop_connections` restart frequency #3589

joshuay03 commented Dec 30, 2024 •

edited

Loading

joshuay03 Jan 2, 2025 •

edited

Loading