WebGPU Support by gkjohnson · Pull Request #713 · gkjohnson/three-gpu-pathtracer

gkjohnson · 2026-02-04T13:10:17Z

I've branched from #705 and changed things around quite a bit to address the storage buffer limitations by using storage textures and organized kernels into dedicated classes. So canvas resize etc all works. I have also separated the "MegaKernal" from the "PathTracerCore" so it's easier it will be easier to follow the differences and dependencies between the implementations.

Next I'm going to look into some of the ideas around a ray queue we'd discussed previously. Then we can try some timing to see how things pan out.

Relatedly, this write up will be interesting for a wave front path tracer:

https://developer.blender.org/docs/features/cycles/kernel_scheduling/

Plans

*Add async generation
*Adjust queue sizes based on needs, allow more control over exact number of rays handled per frame
*smarter background, env caching
*Adjust wavefront tile-based ray accumulation - just track a head pointer to a pixel and march forward?
*Test across browsers, mobile devices
- Use custom float texture interpolation
Optimize render to screen material
Issue announcing deprecation and future removal of WebGL version, close WebGL-related issues
WebGLPathTracer parity
- Add PBR materials
- Next event estimation
- Fog (subsurface scattering?)
- Perform opacity testing DURING BVH traversal
Cleanup
Move compute data class to three-mesh-bvh
Use global variable declarations for wgslTagFn variables (use .global member)
Consider "getShapecastFn" API - allow for more flexibility in function APIs? Remove requirement for return struct?
- make function signatures consistent (pass a pointer to "transformStruct")
Test and support negative scaling especially for face culling

Future

Inspect size limits for geometry (non indexed crab)
- Convert geometry definitions to textures
- Allow for expanding storage sizes (needs to be specified at construction time)
Add "debug" views (sample count, completion visualization, etc)
Reuse uniforms, data across backend swaps
Add support for partially-updating / fast-updating bvh data
- just transforms
- just visibility
- rewrite / refit bvhs
- just materials
- per-mesh, material, etc?
- adjust ObjectBVH to sort objects based on uuid before construction for reliable ordering
wgslTagFn, wgslFn, etc for general usage: add support for an on-the-fly transpiled code node so nodes can be used in both backends.
consider adding "normalMatrix" to transform struct to avoid recalculation
provide "minimal" structs to reduce fetching overhead from storage buffers (eg a struct with only position for bvh traversal). The same buffer can be bound with different struct layouts.
concurrent handling of rays for the same pixel
Add variance detection
Add "completion" detection
Bidirectional path tracing
Custom material BSDF, definitions
WebGPU Denoiser
- Render normal, albedo to separate array texture layers
Improved texture usage

src/webgpu/PathTracerCore.js

src/webgpu/WebGPUPathTracer.js

src/webgpu/WaveFrontPathTracer.js

src/webgpu/WebGPUPathTracer.js

src/webgpu/compute/wavefront/PrimeRayGenerationDispatchKernel.js

src/webgpu/WaveFrontPathTracer.js

src/webgpu/compute/wavefront/RayIntersectionKernel.js

src/webgpu/WaveFrontPathTracer.js

src/webgpu/compute/wavefront/RayIntersectionKernel.js

gkjohnson · 2026-02-07T04:03:18Z

@TheBlek - I'm going to call this "done" as a first pass, for now. There are some workarounds of three.js issues which are marked in TODOs but it's working fairly well. One of the features I'm liking the most about it is how scalable it is - we can reduce the amount of rays processed per frame based on framerate and the page can remain responsive since the whole 7+ bounce path doesn't need to finish in a single pass. Curious to hear your thoughts.

The overall approach works like so:

Iterate over all pixels in a tiled format and push rays to trace onto a ring buffer work queue. We only iterate over the tile if there's enough space in the queue to add rays for all pixels in the tile (even though in practice we may be skipping some). Rays that have been added to the queue have their pixels marked as "active" to avoid multiple rays for the same pixel to the queue. We also issue a compute call for every tile but use indirect dispatch buffers to "cancel" unneeded generation when the queue has become full.
Trace rays in the work queue against the BVH. If there is no hit then accumulate the color in the final target buffer, increment the sample count, and mark the pixel as "inactive". If it does hit then add it to the "hitQueue". Then increment the ray queue ring buffer head pointer forward .
Process the hits. If we have reached the maximum bounce count then terminate the ray, mark the pixel as inactive, and increment the sample. Otherwise add a scatter ray back to the ray queue. Then go back to step 1 to "top up" the queue with any inactive pixels and start again.

--

A few things that need to be considered or added to aid with performance at some point:

Add support for a maximum sample count to prevent adding and working on rays for pixels that will have "finished" more quickly.
We'll want some method for detecting that at a minimum X samples across the image have finished so that we can determine when it's ready to show and avoid the partially-finished rendering. Probably with a simple compute buffer that checks all pixels and writes a storage buffer we can read back if a pixel has not passed the threshold.
Adding some kind of "convergence detection" using a minimum sample count and tracking variance of the samples. This will let pixels be marked as "completed" early on if it converges early (diffuse surfaces, unlit surfaces, background, etc) so we can skip rays for these cases and focus on pixels that need more rays and samples to converge.
Related to the above point: we'll eventually get to a point where we only have a few hundred pixels or less left to process at which point it would be best to dispatch multiple rays per pixel and we'll need to handle the race condition of rays writing to the same pixel. This will probably involve adding a special kernel that can help resolve multiple rays writing to the same pixel.

--

I'll wait to see where you're going before putting too much more work into this path tracing logic specifically. I may look at some of the other points I mentioned in #705 (comment) when I have time.

Remove usages of storage textures <rgba32float, read_write>

# Conflicts: # example/viewerTest.js # src/index.js

# Conflicts: # package.json

WebGPUPathTracer: export webgpu subpath

gkjohnson · 2026-03-04T03:22:05Z

Here's another article from AMD on GPU-based path tracing that might have some good insight, as well:

https://gpuopen.com/download/2025_RT_TechReport.pdf

gkjohnson · 2026-03-10T02:16:09Z

An open BSDF implementation from Adobe - maybe good for reference: https://github.com/adobe/openpbr-bsdf

It only implements unidirectional pathtracing, though - not bidirectional. I'm not sure how complicated it is to derive the sibling PDF needed for bidirectional.

WebGPUPathTracer: Calculate detailed sample counts

WebGPUPathTracer: Improve build performance, fix failures when rendering an empty scene

WebGPUPathTracer: Get megakernel working in Firefox, Safari

src/webgpu/PathTracerBackend.js

PathTracerMegaKernel: Remove all string-based dependencies

WebGPUPathTracer: Convert kernels to use wgsl tag fn

github-advanced-security bot found potential problems Feb 4, 2026

View reviewed changes

src/webgpu/PathTracerCore.js Fixed Show fixed Hide fixed

src/webgpu/WebGPUPathTracer.js Fixed Show fixed Hide fixed

gkjohnson marked this pull request as draft February 4, 2026 13:13

github-advanced-security bot found potential problems Feb 5, 2026

View reviewed changes

src/webgpu/compute/wavefront/PrimeRayGenerationDispatchKernel.js Fixed Show fixed Hide fixed

TheBlek mentioned this pull request Feb 5, 2026

[WIP] Webgpu pathtracer prototype #705

Closed

5 tasks

github-advanced-security bot found potential problems Feb 6, 2026

View reviewed changes

src/webgpu/WaveFrontPathTracer.js Fixed Show fixed Hide fixed

src/webgpu/WaveFrontPathTracer.js Fixed Show fixed Hide fixed

Jitter rays, always set work group size to avoid stale values (??)

e9fa88f

github-advanced-security bot found potential problems Feb 6, 2026

View reviewed changes

src/webgpu/compute/wavefront/RayIntersectionKernel.js Fixed Show fixed Hide fixed

gkjohnson added 10 commits February 6, 2026 15:13

Process the correct mount of rays / materials

70ec831

Calculate queue sizes based on api limitations

5c63a50

Simplify screen shader

f9a0469

Add sample debug kernel

0e4fc8a

Fix incorrect queue head / tail offsets

133243d

Use the full queue space

652f7a1

Add comment

30debca

Kernels: use valid initializers

37340a5

Avoid creating uint arrays on initialization of storage buffers

0201edd

Initialize geometry to null

e0f7666

github-advanced-security bot found potential problems Feb 7, 2026

View reviewed changes

src/webgpu/WaveFrontPathTracer.js Fixed Show fixed Hide fixed

src/webgpu/compute/wavefront/RayIntersectionKernel.js Fixed Show fixed Hide fixed

Remove unused imports

79f946e

gkjohnson and others added 9 commits February 7, 2026 15:59

Fix case where we skip the first tile

35cea2b

Remove unnecessary clone

c287b26

Fix target cloning

bf9dd80

Make storage types consistent, remove invalid texture store arguments

c21141e

Remove usages of storage textures <rgba32float, read_write>

642ccb5

Merge pull request #715 from TheBlek/webgpu-ping-pong

b9ad7ba

Remove usages of storage textures <rgba32float, read_write>

Add wrappers for full-scene compute raycasting

5450878

Add functions

80d9297

Update Fns

728d428

gkjohnson added 6 commits March 4, 2026 11:59

Add a webgpu-export

5172405

Merge branch 'main' into webgpu-pathtracer

cc4ca26

# Conflicts: # example/viewerTest.js # src/index.js

Merge branch 'webgpu-pathtracer' into webgpu/package

c51876c

# Conflicts: # package.json

Remove webgpu content from main exports

f9fd8f6

Fix lint issue

877b94a

Merge pull request #744 from gkjohnson/webgpu/package

357ee2a

WebGPUPathTracer: export webgpu subpath

gkjohnson added 6 commits March 4, 2026 13:13

Get megakernel working cross-browser

1080f1e

Improve perf, fix resilience to 1 material, etc

d9af91a

Clean up TSL storage issue

363b6a2

More cleanup

ac88ea0

Fix attributes buffer

6999439

Resilience to empty bounds

d4b9d86

gkjohnson mentioned this pull request Mar 7, 2026

Support for custom ShaderMaterials #747

Open

TheBlek and others added 5 commits March 10, 2026 12:32

Little fixes from review

266e1d7

Merge branch 'webgpu-pathtracer' into webgpu-sample-counts

1a23da6

Merge pull request #739 from TheBlek/webgpu-sample-counts

f4fd1f8

WebGPUPathTracer: Calculate detailed sample counts

Merge pull request #746 from gkjohnson/webgpu/fixes

fd360cc

WebGPUPathTracer: Improve build performance, fix failures when rendering an empty scene

Merge pull request #745 from gkjohnson/webgpu/cross-browser

e8a18e6

WebGPUPathTracer: Get megakernel working in Firefox, Safari

github-advanced-security bot found potential problems Mar 10, 2026

View reviewed changes

src/webgpu/PathTracerBackend.js Fixed Show fixed Hide fixed

src/webgpu/PathTracerBackend.js Fixed Show fixed Hide fixed

gkjohnson added 9 commits March 10, 2026 18:28

WGSLTagFnNode: normalize arguments on build rather than immediately

880d656

PathTracerMegaKernel: Remove all string-based dependencies

Simplify megakernel

57eb9ca

Fix 2 wavefront kernels, normalize variable naming

b0e9d93

Adjust wavefront ray generation kernel

00931af

Update last two wavefront kernels

13320ea

Fix zero out buffer kernel

b498527

Remove unneeded arguments

a0d9744

Merge pull request #748 from gkjohnson/kernel-cleanup

7292053

WebGPUPathTracer: Convert kernels to use wgsl tag fn

WGSL tag function updates

4ed7758

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WebGPU Support#713

WebGPU Support#713
gkjohnson wants to merge 302 commits intomainfrom
webgpu-pathtracer

gkjohnson commented Feb 4, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gkjohnson commented Feb 7, 2026

Uh oh!

gkjohnson commented Mar 4, 2026

Uh oh!

gkjohnson commented Mar 10, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

gkjohnson commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gkjohnson commented Feb 7, 2026

Uh oh!

gkjohnson commented Mar 4, 2026

Uh oh!

gkjohnson commented Mar 10, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gkjohnson commented Feb 4, 2026 •

edited

Loading