Make ncnn memory budget configurable #2070

JeremyRand · 2023-08-08T21:28:41Z

#1867 didn't support automatic estimation of tile size, which was not great for UX. This PR adds that missing support, by allowing the user to choose a custom memory budget. Choosing a memory budget is likely to be better UX than choosing a tile size (since users are likely to have a better knowledge of how much RAM or VRAM they have, than what tile size will work with their hardware and the model they picked). This should also work fine with Vulkan (and is likely to help with this UX issue), though I wasn't able to test that.

JeremyRand · 2023-08-09T00:11:13Z

(I'll fix the Pyright error ASAP.)

JeremyRand · 2023-08-09T00:23:58Z

CI is now passing.

RunDevelopment · 2023-08-09T11:14:00Z

backend/src/packages/chaiNNer_ncnn/ncnn/processing/upscale_image.py

@@ -125,7 +133,7 @@ def estimate_cpu():
    schema_id="chainner:ncnn:upscale_image",
    name="Upscale Image",
    description="Upscale an image with NCNN. Unlike PyTorch, NCNN has GPU support on all devices, assuming your drivers support Vulkan. \
-            Select a manual number of tiles if you are having issues with the automatic mode.",
+            Select a manual number of tiles or set a memory budget limit if you are having issues with the automatic mode.",


or set a memory budget limit

This doesn't tell users how to do that.

~~This solution is factually incorrect. Setting a memory budget won't help at all in GPU mode.~~

I'm not sure I follow; why would it not help in Vulkan mode?

Sorry, my bad. I didn't see the heap_budget in min(heap_budget, vkdev.get_heap_budget() * 1024 * 1024 * 0.8)...

Point 1 still stands though.

I don't see instructions on switching between GPU's in the backend metadata (though there is a mention of this in the README), which is why I didn't add similar instructions for this. Would you be favorable to adding a note in the backend docs metadata about making sure the integrated GPU isn't selected, along with a note about setting a memory budget? (Seems to me that documenting both of them in-app would be a UX improvement.)

src/renderer/components/SettingsModal.tsx

RunDevelopment · 2023-08-09T11:17:09Z

src/renderer/contexts/SettingsContext.tsx

@@ -46,6 +47,7 @@ export const SettingsProvider = memo(({ children }: React.PropsWithChildren<unkn
    const useIsFp16 = useMemoArray(useLocalStorage('is-fp16', false));
    const usePyTorchGPU = useMemoArray(useLocalStorage('pytorch-gpu', 0));
    const useNcnnGPU = useMemoArray(useLocalStorage('ncnn-gpu', 0));
+    const useNcnnBudgetLimit = useMemoArray(useLocalStorage('ncnn-budget-limit', 1024 ** 5));


Why did you make the default 1 Petabyte?

The intent was to pick a default that is practically equivalent to "no limit". I find it very unlikely that any usage will go over 1 PiB; 1 TiB is low enough that some users might hit it (some of my test workflows hit 600 GiB or so, and I'm not as much of a power user as some people).

Wdym "might hit it"? This limit is for RAM/VRAM, right? Who even has 1TB of RAM? I don't think people will run chainner of super computers soon.

My workstation has around 300 GiB of RAM, and I typically run chaiNNer in a VM on that workstation that has around 200 GiB of RAM assigned to it. I wouldn't be surprised if there exists some power user out there with 1 TiB of RAM (my mainboard supports up to 2 TiB), and I'm trying to avoid surprising behavior in such (admittedly rare) situations.

That said -- am I correct in assuming that your concern here is that 1 PiB looks "weird" in the GUI as the default, as opposed to having a concern about the default being "no effective additional limit"? If so, would it be OK to instead modify the backend to special-case a limit of 0 so that 0 means "no additional limit", and make 0 the default?

JeremyRand · 2023-08-15T17:13:09Z

Putting this on hold until #2088 is merged.

theflyingzamboni · 2023-08-22T17:30:59Z

backend/src/packages/chaiNNer_ncnn/ncnn/processing/upscale_image.py

                )
+            else:
+                # Empirically determined


Since you have so much RAM, I'm wondering if you'd be willing to run some tests. I've done some testing on NCNN VRAM estimation in the past (though only with ESRGAN), and I can already tell you that this isn't really correct, and neither is what we already have in place. I've only got 8GB VRAM, so I could never extend my tests far enough to gather fully complete data.

What I can say is that:

Model size is not strongly correlative with VRAM usage. This is because a model can be made larger simply by performing more convolutions, which does not matter for VRAM usage because they are being done in sequence. The only thing it definitely correlates with is how much VRAM it takes to store the model itself.

Individual weight sizes have a correlation with VRAM usage when running a model.

Scale needs to be accounted for, which our estimation does not currently do. This estimation is based around a 4x scale, but an 8x model will blow past it.

I abandoned this back in the day because there seemed to be further factors I couldn't account for with the data I had, but maybe we can finally figure it out. Unfortunately, I seem to have deleted the set of different scale/nf/nb ESRGAN models I generated for these tests. If I can remember how I generated them all, I could send them to you.

@theflyingzamboni I would definitely be interested. Can we split this out into its own issue so that it doesn't get lost once this PR is merged?

JeremyRand · 2023-11-29T17:04:15Z

Superseded by #2351

JeremyRand force-pushed the budget branch from bf7fd4b to ccf20bd Compare August 9, 2023 00:17

RunDevelopment reviewed Aug 9, 2023

View reviewed changes

joeyballentine force-pushed the main branch 2 times, most recently from 42b1a92 to 1dc6422 Compare August 10, 2023 13:34

stonerl mentioned this pull request Aug 14, 2023

Setting for the amount of (V)RAM used for upscaling #2088

Draft

Make ncnn memory budget configurable

af6449a

JeremyRand force-pushed the budget branch from ccf20bd to af6449a Compare August 15, 2023 13:31

JeremyRand marked this pull request as draft August 15, 2023 17:12

theflyingzamboni suggested changes Aug 22, 2023

View reviewed changes

JeremyRand mentioned this pull request Nov 29, 2023

Make ncnn memory budget configurable (v2) #2351

Merged

JeremyRand closed this Nov 29, 2023

JeremyRand mentioned this pull request Nov 29, 2023

Improve ncnn memory estimation #2352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Make ncnn memory budget configurable #2070

Make ncnn memory budget configurable #2070

Uh oh!

JeremyRand commented Aug 8, 2023 •

edited

Loading

Uh oh!

JeremyRand commented Aug 9, 2023

Uh oh!

JeremyRand commented Aug 9, 2023

Uh oh!

RunDevelopment Aug 9, 2023 •

edited

Loading

Uh oh!

JeremyRand Aug 9, 2023

Uh oh!

RunDevelopment Aug 10, 2023

Uh oh!

JeremyRand Aug 11, 2023

Uh oh!

Uh oh!

RunDevelopment Aug 9, 2023

Uh oh!

JeremyRand Aug 9, 2023

Uh oh!

RunDevelopment Aug 10, 2023

Uh oh!

JeremyRand Aug 10, 2023 •

edited

Loading

Uh oh!

JeremyRand commented Aug 15, 2023

Uh oh!

theflyingzamboni Aug 22, 2023

Uh oh!

JeremyRand Aug 23, 2023

Uh oh!

JeremyRand commented Nov 29, 2023

Uh oh!

Uh oh!

Uh oh!

Make ncnn memory budget configurable #2070

Make ncnn memory budget configurable #2070

Uh oh!

Conversation

JeremyRand commented Aug 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JeremyRand commented Aug 9, 2023

Uh oh!

JeremyRand commented Aug 9, 2023

Uh oh!

RunDevelopment Aug 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JeremyRand Aug 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JeremyRand commented Aug 15, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JeremyRand commented Nov 29, 2023

Uh oh!

Uh oh!

JeremyRand commented Aug 8, 2023 •

edited

Loading

RunDevelopment Aug 9, 2023 •

edited

Loading

JeremyRand Aug 10, 2023 •

edited

Loading