[FEATURE] Makes entries being loaded in parallel #74

HorlogeSkynet · 2020-05-22T10:26:48Z

This patch is mainly inspired from the work of @ingrinder (see 2bbc2dae).
You may notice an execution up to twice as fast.

This behavior could be disabled with the new parallel_loading configuration option.

See #68.

How has this been tested ?

Locally & test cases.

Types of changes :

New feature (non-breaking change which adds functionality)
Breaking change [?] (fix or feature that would cause existing functionality to change)

Checklist :

[IF NEEDED] I have updated the README.md file accordingly ;
~~[ ] [IF NEEDED] I have updated the test cases (which pass) accordingly ;~~
My changes looks good ;
I agree that my code may be modified in the future ;
My code follows the code style of this project (PEP8).

This patch is mainly inspired from the work of @inrinder (see `2bbc2dae`). You may notice an execution up to twice as fast. This new behavior could be disabled with the new `parallel_loading` configuration option. Co-authored-by: Michael Bromilow <12384431+ingrinder@users.noreply.github.com>

ingrinder · 2020-05-24T18:15:39Z

Since we're executing I/O-heavy work, should we use len(Entries) (or maybe a static value, so 18 currently), as our ThreadPoolExecutor's maxcount? This would mean that in a case where nearly all entries were blocking on I/O, we'd still get at least another thread for another entry to begin its processing while waiting on the others - which should result in the quickest possible execution time. We don't really do anything heavy which is avoided by not spawning threads, and the improvements in Python 3.8 and above also mean we don't spawn more threads than ever necessary to complete all of the work in parallel.

HorlogeSkynet · 2020-05-24T18:58:34Z

You're actually right 👍
So what about mixing our needs with "new" Python 3.8+ behavior ?

            executor = cm_stack.enter_context(
-                ThreadPoolExecutor(max_workers=((os.cpu_count() or 1) * 5))
+                ThreadPoolExecutor(max_workers=min(len(enabled_entries), (os.cpu_count() or 1) + 4))
            )

Reference : python/cpython#13618

ingrinder

I was worried that limiting threads based on CPU count could slow down execution for the longer-running entries, however after some testing (and fun playing with matplotlib! 😆) on an older Core 2 Duo machine it looks like it's just fine 👍.

Just a moment - just noticed an (admittedly stupid) edge case we should deal with.

...maybe *someone* just wants the logo?

HorlogeSkynet · 2020-05-26T07:01:25Z

Nice catch ! Thanks 🙂

HorlogeSkynet added the enhancement ⬆️ Implements a new feature, fixes or improves existing ones label May 22, 2020

HorlogeSkynet added this to the v4.8.0 milestone May 22, 2020

HorlogeSkynet requested a review from ingrinder May 22, 2020 10:26

HorlogeSkynet self-assigned this May 22, 2020

HorlogeSkynet linked an issue May 22, 2020 that may be closed by this pull request

[QUESTION] [FEATURE] Multithreading? #68

Closed

HorlogeSkynet mentioned this pull request May 22, 2020

[QUESTION] [FEATURE] Multithreading? #68

Closed

Clean up with some DRY.

6d2ed37

Modify max_workers value

10f0a6c

ingrinder previously approved these changes May 26, 2020

View reviewed changes

Avoid crash if no entries are enabled.

18a7169

...maybe *someone* just wants the logo?

ingrinder approved these changes May 26, 2020

View reviewed changes

HorlogeSkynet merged commit f74dda9 into master May 26, 2020

HorlogeSkynet deleted the feature/multithreading branch May 26, 2020 07:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[FEATURE] Makes entries being loaded in parallel #74

[FEATURE] Makes entries being loaded in parallel #74

Uh oh!

HorlogeSkynet commented May 22, 2020 •

edited

Loading

Uh oh!

ingrinder commented May 24, 2020

Uh oh!

HorlogeSkynet commented May 24, 2020 •

edited

Loading

Uh oh!

ingrinder left a comment

Uh oh!

HorlogeSkynet commented May 26, 2020

Uh oh!

Uh oh!

Uh oh!

[FEATURE] Makes entries being loaded in parallel #74

[FEATURE] Makes entries being loaded in parallel #74

Uh oh!

Conversation

HorlogeSkynet commented May 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How has this been tested ?

Types of changes :

Checklist :

Uh oh!

ingrinder commented May 24, 2020

Uh oh!

HorlogeSkynet commented May 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ingrinder left a comment

Choose a reason for hiding this comment

Uh oh!

HorlogeSkynet commented May 26, 2020

Uh oh!

Uh oh!

HorlogeSkynet commented May 22, 2020 •

edited

Loading

HorlogeSkynet commented May 24, 2020 •

edited

Loading