Support Multipass shaders (buffer A ...) #30

Vipitis · 2024-05-09T22:21:58Z

update:
this branch will be split into smaller PRs:

support wgpu v0.18.1 #39
refactor inputs Input.py refactor #40
refactor passes Passes refactor #41
improve shadercode snippets Improve and simplify shadercode #42
buffer pass support Add multipass pass support (buffers A-D) #43
resizing for buffers (included above)
profiling (optional, likely not)

part of #4

approximately 17.5% of public Shadertoys are multipass. Multipass allows up to 4 buffers (A through D) to be rendered as a texture. These can also be used to store data and enable quite some more experiences.

Some of the challenges include timing as well as cross inputs.
Buffer passes can seemingly take the exact same inputs as the main "Image" renderpass, including other buffers (and themselves?)

This PR starts to bloat a little and contains some refactor for the whole channel input concept... still in flux

Instead, will try to implement BufferTexture as a ShadertoyChannel subclass so it can hold for example the sampler settings.
Additionally, there will likely be a RenderPass base class and subclasses for Image, Buffer(a-d) and later cube and sound.
So the main Shadertoy class contains several render passes, and all of these get their inputs(channels) attached.
I even started to try and sketch it out - but will have to sleep through this for a few more days... my concepts change every day but I need to just try and work on the ideas for a bit.

The render order should be Buffer A through D and then Image. So you can keep temporal data, by using itself has an input.

TODOs:

Refactor common code to ShadertoyChannel base class
~~additional tests cases for inferred input types, empty channels~~ (caching conflict with pytests)
dynamic headers
~~test coverage for examples in readme!~~(different PR)
Working buffer
working multiple buffers!
tests for buffers
multipass examples
~~(maybe) some debug mode where you can render the buffers to canvas?~~ (you can use RenderDoc with "capture child processes")

Vipitis · 2024-05-24T00:10:34Z

Little update:
It seems to sorta work. But a bunch of stuff is still broken:

missing sampler filters fail to recreate some behavior, example: https://www.shadertoy.com/view/MfyXzV
having unsupported channels break the layout, example: https://www.shadertoy.com/view/ssjyWc (layout fixed by detecting channels in the common code, shader still broken, fixed in float32)
resizing the window breaks the nbytes of the previous frame - so maybe there needs to be a hook to the on_resize function to fill the new space (similar to behaviour when going fullscreen on the website)

new breaking examples found, that might be unrelated to this PR, but I will note them down for later reference:

some issue with resolution(maybe filters), but could be some of those fetching functions: https://www.shadertoy.com/view/X3c3WN (also has horrible performance?)
not sure: https://www.shadertoy.com/view/4X33zH (fixed by ensuring the order of channels is correct)
~~kinda has similarities to the issue with nested loops (Index value issue in GLSL front nested loop gfx-rs/wgpu#5246)~~ https://www.shadertoy.com/view/MdX3Rr turns out this issue exist because the alpha of the buffer pass stores motion vectors inside a negative float. Meaning we likely need to use float format for the buffer texture. I will give it a try...

Vipitis · 2024-06-07T22:25:02Z

I think I finally fixed the compatibility issue. There is some small visual issues which look like precision problems to me (not sure yet). And the performance is horrible it seems... Please let me know if you find any shaders that are broken (not due to missing features, wgpu bugs)

Will work on tests, examples and documentation to hopefully get this ready for next week.

E: found this one seemingly broken: >wgpu-shadertoy https://www.shadertoy.com/view/tsKXR3
detailed example of precision of this alpha channel is different: https://www.shadertoy.com/view/wsjSDt

Vipitis · 2024-06-21T19:25:18Z

I think this is finally ready for review - and I welcome some feedback.

This PR refactors the whole inputs/channels to be easily extendable with the missing channel types.
Multipass shaders are quite complex, but I learned quite a lot to get this running, but I think my implementation is what is minimally required for wgpu (with the redoing the sampler and pipeline).
resizing is really janky, since you download the texture and pad it with numpy only to upload it again - but I wanted to mirror the website (this includes breaking quite a few shaders)

wgpu_shadertoy/inputs.py

hmaarrfk · 2024-07-21T14:42:11Z

Cool stuff!

almarklein · 2024-09-30T09:57:02Z

wgpu_shadertoy/shadertoy.py

-            device=self._device, format=wgpu.TextureFormat.bgra8unorm
+            device=self._device, format=self._format


Can also set to None to have it select the preferred format. Less code, unless you need self._format. Note that the format is also accessible in texture.format on the texture obtained via get_current_texture().

hm, I had a look and the only other solution I can think of is to make it a property of the ImageRenderPass, as it's needed to create the render pipeline. The awful part is that at init time for the Image class, the canvas(_present_context) might not be accessible via the parent instance of Shadertoy.
It could be returned by this method instead of passed via an attribute. It could be useful to have available when translating the snapshot back into RGB.

Vipitis · 2024-10-31T16:46:34Z

My thesis is nearly submitted, so I can spent time again getting this finished.

For the performance issue I want to try the following: ping pong the render target to reduce the number of copies (which are likely the slowdown).

Also it might be the case that buffers don't followed the A-B-C-D order but instead the order in which they were added. Which I believe might be recorded in the json we get from the API. I will try to find some examples that proof the behavior.

E: maybe this can be merged as a MVP and performance improvements can be made in follow up PRs instead

Vipitis · 2024-11-21T00:11:19Z

CI failures are addressed in #36 which isn't included in this branch

Vipitis · 2024-12-10T23:37:11Z

found some more time to figure out the performance issues. I read somewhere that the queue is smart and might figure out of some copies and be avoided. Doesn't seem to make much difference. To avoid the guesswork I added some rudimentary profiling which still needs some work. But looking at the data already shows that something is up.

the slower rendertimes on the right are full screen (instead of small window). But the constant spikes are a problem - will find the time to run more experiments in the following days and try different systems (to rule out weirdness of Intel GPUs). Seems like something(I suspect the copies) is causing the GPU to hang for ~60m.

Vipitis · 2024-12-20T00:25:25Z

while making the vertex and fragment code templates simpler - I still need to do a yflip for the image pass. I couldn't find any setup for the gui canvas to accepted a flipped image. This works mostly fine, however I have been informed of dFdy which will behave different due to this flip. Here is an example showing the problem.
: it should be the same red version across the whole canvas, but only the part reading from buffer texture is, the image pass (which is flipped in the vertex stage) doesn't behave.

Perhaps there is a solution to this down the line in rendercanvas? I would love to migrate to that once this PR gets merged.

almarklein · 2024-12-20T09:43:40Z

Perhaps there is a solution to this down the line in rendercanvas?

No, the definition of the viewport is simply part of the wgpu spec (https://wgpu-py.readthedocs.io/en/stable/guide.html#coordinate-system). IIRC this is reversed from OpenGL.

however I have been informed of dFdy which will behave different due to this flip.

Aha, I gues that one will be multiplied with -1 ... 🤔 can we tell users to use a custom dFdy that we provide?

Vipitis · 2024-12-20T18:29:07Z

can we tell users to use a custom dFdy that we provide?

Rewrite rules are one option, but can be really complicated, so there might be better ideas. Doing a whole renderpass or compute pass just to flip the out image also seems wasteful. Plus there might be other functionality that is currently broken which I don't know about yet.

I will track this as a standalone issue as it already exists before this PR too.

Vipitis · 2025-01-19T00:27:29Z

Since this is a massive PR and by now the git history is sorta messed up. Would it help to split this up into a few smaller pieces?

I am thinking: bug fixes, wgpu v0.18.1 support, input refactor/API stuff, buffer/Multipass(will have the improved shader code snippets), fixes to resizing, and optionally the profiling stuff (which I think isn't even reliable right now).

My goal is to get this reviewed and merged to have a release v0.2 done before 5th February for a paper(my thesis) deadline which links to this branch.

Korijn · 2025-01-19T08:12:15Z

Makes sense! It's a lot easier to review and test in smaller batches

Vipitis · 2025-05-07T22:05:39Z

closed as #43 has been merged instead. The branch will stay because some other projects still point to it.

Vipitis mentioned this pull request May 9, 2024

Missing compatibility features with Shadertoy webseite #4

Open

16 tasks

Vipitis marked this pull request as ready for review June 21, 2024 19:13

Vipitis changed the title ~~[WIP] Support Multipass shaders (buffer A ...)~~ Support Multipass shaders (buffer A ...) Jun 24, 2024

Vipitis commented Jun 25, 2024

View reviewed changes

wgpu_shadertoy/inputs.py Show resolved Hide resolved

Vipitis requested a review from Korijn July 2, 2024 23:10

Vipitis mentioned this pull request Jul 21, 2024

Update wgpu dependency to match pygfx #32

Merged

Vipitis mentioned this pull request Jul 21, 2024

Use ruff --check because ruff suggests it #33

Merged

Vipitis added 19 commits July 26, 2024 00:40

Initial texture channel refactor

786f8a2

small clarification on .snapshot usage

c49f43d

keep base channels working

5414d0d

consider renderpasses in main

d4c943a

add renderpass classe stubs

a07c201

refactor some code to the channel classes

5242c70

start move to ImagePass for main image code and channels

e667479

start draw_buffer function

451f9f4

split up _prepare_render

18c0990

initialize buffers with zero

71d97d4

move prepare_render function to passes

f182699

static buffer pass working?

60e8a3a

put passes into it's own file

e5b67fe

naive update textures function

2bcbac8

fix color and orientation

e10aa81

fix type annotations

8529aeb

only update dynamic channels

8e2b577

refactor duplicate code to method

cfc388f

add row padding, resizing still broken

040d9e9

almarklein reviewed Sep 30, 2024

View reviewed changes

Vipitis added 2 commits November 21, 2024 00:50

fix missing buffer pass

039ece8

fix example

2c36068

Vipitis added 4 commits November 25, 2024 20:38

fix gamma

a16ee5f

submit the command buffers once

452013f

add simple profiling

5b5befd

make profiling optional

1ce5500

Vipitis added 3 commits December 18, 2024 00:07

fix profiling for fewer passes

47e561c

refactor glsl vertex

f67a247

refactor wgsl vertex and fragment

7544126

Vipitis mentioned this pull request Dec 20, 2024

Y-Flip in Image pass causes inconsistencies #38

Open

Vipitis added 3 commits December 31, 2024 00:46

simplify GLSL uniforms

91d364e

merge branch main into multipass

a55566e

ruff format

263f5f7

Vipitis marked this pull request as draft January 20, 2025 00:30

This was referenced Jan 20, 2025

support wgpu v0.18.1 #39

Merged

Input.py refactor #40

Merged

Passes refactor #41

Merged

Vipitis mentioned this pull request Jan 28, 2025

Improve and simplify shadercode #42

Merged

Vipitis mentioned this pull request Mar 17, 2025

Add multipass pass support (buffers A-D) #43

Merged

8 tasks

Vipitis closed this May 7, 2025

		device=self._device, format=wgpu.TextureFormat.bgra8unorm
		device=self._device, format=self._format

Support Multipass shaders (buffer A ...) #30

Support Multipass shaders (buffer A ...) #30

Uh oh!

Conversation

Vipitis commented May 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODOs:

Uh oh!

Vipitis commented May 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Vipitis commented Jun 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Vipitis commented Jun 21, 2024

Uh oh!

Uh oh!

hmaarrfk commented Jul 21, 2024

Uh oh!

almarklein Sep 30, 2024

Choose a reason for hiding this comment

Uh oh!

Vipitis Oct 3, 2024

Choose a reason for hiding this comment

Uh oh!

Vipitis commented Oct 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Vipitis commented Nov 21, 2024

Uh oh!

Vipitis commented Dec 10, 2024

Uh oh!

Vipitis commented Dec 20, 2024

Uh oh!

almarklein commented Dec 20, 2024

Uh oh!

Vipitis commented Dec 20, 2024

Uh oh!

Vipitis commented Jan 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Korijn commented Jan 19, 2025

Uh oh!

Vipitis commented May 7, 2025

Uh oh!

Uh oh!

Vipitis commented May 9, 2024 •

edited

Loading

Vipitis commented May 24, 2024 •

edited

Loading

Vipitis commented Jun 7, 2024 •

edited

Loading

Vipitis commented Oct 31, 2024 •

edited

Loading

Vipitis commented Jan 19, 2025 •

edited

Loading