[DRAFT] Upgrade onnx runtime by jpggvilaca · Pull Request #1764 · open-edge-platform/geti

jpggvilaca · 2026-03-31T09:30:31Z

📝 Description

New onnx-runtime version provides support for webgpu, which, in early tests showed a 10-12x speed improvement

✨ Type of Change

Select the type of change your PR introduces:

🐞 Bug fix – Non-breaking change which fixes an issue
🚀 New feature – Non-breaking change which adds functionality
🔨 Refactor – Non-breaking change which refactors the code base
💥 Breaking change – Changes that break existing functionality
📚 Documentation update
🔒 Security update
🧪 Tests

🧪 Testing Scenarios

Describe how the changes were tested and how reviewers can test them too:

✅ Tested manually
🤖 Run automated end-to-end tests

✅ Checklist

Before submitting the PR, ensure the following:

🔍 PR title is clear and meaningful
✍️ PR description clearly explains the changes and their reason
📝 I have linked the PR to the corresponding GitHub Issues, if any
💬 I have commented my code, especially in hard-to-understand areas
📄 I have made corresponding changes to the documentation
✅ I have added tests that prove my fix is effective or my feature works

Copilot

Pull request overview

Upgrades the @geti/smart-tools web ONNX Runtime dependency to a newer onnxruntime-web version to enable the webgpu execution provider, and updates the Segment Anything pipeline/session setup to work with the new runtime behavior (including worker-safe tensor passing).

Changes:

Bump onnxruntime-web to ~1.24.3 (and update the root package-lock.json accordingly).
Update WASM/runtime configuration and default execution providers to prefer webgpu, plus serialize InferenceSession.run() calls.
Make Segment Anything encoder output structured-clone safe by returning a plain-object tensor representation and reconstructing the tensor in the decoder.

Reviewed changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`web_ui/packages/smart-tools/src/utils/wasm-utils.ts`	Adjusts session defaults (WebGPU+CPU) and switches to a new WASM asset path/type.
`web_ui/packages/smart-tools/src/segment-anything/session.ts`	Updates ORT env/session creation usage and adds run-call serialization to avoid concurrent `run()` on a single session.
`web_ui/packages/smart-tools/src/segment-anything/segment-anything-encoder.ts`	Converts encoder output tensor into a serializable representation suitable for worker transfers.
`web_ui/packages/smart-tools/src/segment-anything/segment-anything-decoder.ts`	Reconstructs an `ort.Tensor` from the serialized encoder output for decoder execution.
`web_ui/packages/smart-tools/package.json`	Bumps `onnxruntime-web` dependency version.
`web_ui/package-lock.json`	Locks updated `onnxruntime-web`/`onnxruntime-common` and transitive dependency versions.

Files not reviewed (1)

web_ui/package-lock.json: Language not supported

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-31T09:37:14Z

web_ui/packages/smart-tools/src/utils/wasm-utils.ts

    numThreads: number;
    executionProviders: string[];
-    wasmRoot?: string | Record<string, string>;
+    wasmRoot?: string | { wasm?: string };


SessionParameters.wasmRoot is now typed as string | { wasm?: string }, but the rest of the code treats this as a concrete wasmPaths value (and passes it into ORT env config). Making wasm optional allows {} / undefined at the type level and tends to be incompatible with ORT’s usual Record<string, string>/string wasmPaths typing under strict mode. Consider making wasm required (e.g., { wasm: string }) or reverting to a broader Record<string, string>-style type that matches what ORT expects.

Suggested change

wasmRoot?: string | { wasm?: string };

wasmRoot?: string | { wasm: string };

Copilot · 2026-03-31T09:37:15Z

web_ui/packages/smart-tools/src/segment-anything/session.ts

+        env.wasm.numThreads = this.params.numThreads;
+        env.wasm.wasmPaths = this.params.wasmRoot;
+        env.wasm.simd = true;
+        // Suppress expected "some nodes not assigned to WebGPU EP" warnings —
+        // ORT intentionally keeps shape-related ops on CPU for performance.
+        env.logLevel = 'error';


env.logLevel = 'error' changes the global ONNX Runtime logging level for the entire app/package, not just this session. This can hide important warnings/errors elsewhere (and makes diagnosing WebGPU fallback issues harder). Consider scoping this (save/restore the previous logLevel) or making it configurable via the session parameters instead of forcing it in init().

Copilot · 2026-03-31T09:37:15Z

web_ui/packages/smart-tools/src/segment-anything/segment-anything-encoder.ts

+        const encoderResult: SerializableTensor = {
+            data: new Float32Array((await gpuTensor.getData()) as Float32Array),


data: new Float32Array((await gpuTensor.getData()) as Float32Array) creates an extra copy of the tensor data (and will double-copy again when structured-cloned), which can be expensive for SAM embeddings. If getData() already returns a Float32Array, consider using it directly (and/or using Comlink transfer of the underlying buffer) to avoid redundant allocations.

Suggested change

const encoderResult: SerializableTensor = {

data: new Float32Array((await gpuTensor.getData()) as Float32Array),

const tensorData = (await gpuTensor.getData()) as Float32Array;

const encoderResult: SerializableTensor = {

data: tensorData,

jpggvilaca added 2 commits March 31, 2026 11:29

Update onnx version

4e7302d

Update interfaces

5f173d6

jpggvilaca added the UI label Mar 31, 2026

jpggvilaca requested review from ActiveChooN, MarkRedeman, camiloHimura and pplaskie as code owners March 31, 2026 09:30

Copilot AI review requested due to automatic review settings March 31, 2026 09:30

jpggvilaca requested review from dwesolow, jcchr, mgumowsk, piotrgrubicki and romanowska as code owners March 31, 2026 09:30

Copilot started reviewing on behalf of jpggvilaca March 31, 2026 09:31 View session

Copilot AI reviewed Mar 31, 2026

View reviewed changes

jpggvilaca added 2 commits April 1, 2026 13:23

Update browserlist

33c6071

Fix eslint/ts issues

e2bc61b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] Upgrade onnx runtime#1764

[DRAFT] Upgrade onnx runtime#1764
jpggvilaca wants to merge 4 commits intomainfrom
jvilaca/upgrade-onnx-runtime

jpggvilaca commented Mar 31, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	wasmRoot?: string \| { wasm?: string };
	wasmRoot?: string \| { wasm: string };

		const encoderResult: SerializableTensor = {
		data: new Float32Array((await gpuTensor.getData()) as Float32Array),

Conversation

jpggvilaca commented Mar 31, 2026

📝 Description

✨ Type of Change

🧪 Testing Scenarios

✅ Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants