ONNX support to run the QualT5 models for inference on CPU by sheineking · Pull Request #1 · terrierteam/pyterrier-quality

sheineking · 2025-09-19T14:52:59Z

ONNXQualT5 class for inference on CPU
Add utility code for exporting (to new cache directory) and loading

On a sample of 4,000 web documents, the maximum absolute deviation in quality score between the original qt5-tiny and the ONNX-version was 0.01.

cmacdonald · 2025-09-19T14:56:02Z

Great idea, thank you. Presumably this is faster on CPU than normal inference? Do you have timings on the same 4000 docs, say?

Can we make the onnx dependencies optional? We normally do that with pyproject.toml. Do you need an example?

sheineking · 2025-09-19T15:15:39Z

Thanks a lot for the quick response. Yes, it is faster on CPU. I have the timings only in the context of the OWS pipeline Resilipipe, building on the module that Ariane developed. This comes with the following limitations:

The timings are based on all processing steps in the pipeline
Texts are processed not in batches but as individual elements

The different setups I tried are:

qt5-tiny on CPU: ~0.7 records / s (Tested only on 100 records)
qt5-tiny on GPU: ~55 records / s (Tested on all 4,000 records)
qt5-tiny with ONNX: ~50 records / s (Tested on all 4,000 records)

I will do a separate comparison and update the results.

And yes, I will add a pyproject.toml to make the onnx dependencies optional.

sheineking · 2025-09-19T15:43:41Z

I added the dependencies as an option to the setup.py and removed the import in the init.py to make it optional.

cmacdonald · 2025-09-19T15:46:58Z

Thanks. Given the context, I'll let @seanmacavaney give a final review.

sheineking · 2025-09-19T15:53:27Z

And here are the updated times when applying only QualT5.transform on batches of webpage text. The model is qt5-tiny for all three settings.

qmodel = QualT5("pyterrier-quality/qt5-tiny")
df = qmodel.transform(df)

GPU: 4,000 records in 2.08s
CPU: 400 records in 117.5s
ONNX: 400 records in 4.47s

- Support ONNX session options - Move expensive imports to model export - Create separate package for ONNX code

ONNX support to run the QualT5 models for inference on CPU

c0eab28

sheineking and others added 2 commits September 19, 2025 17:41

ONNX QualT5: Make dependencies optional

0cbdf30

Update requirements.txt

01dc7fb

cmacdonald requested a review from seanmacavaney September 19, 2025 15:46

cmacdonald closed this Sep 19, 2025

cmacdonald reopened this Sep 19, 2025

ONNX Refactor

9f5e946

- Support ONNX session options - Move expensive imports to model export - Create separate package for ONNX code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ONNX support to run the QualT5 models for inference on CPU#1

ONNX support to run the QualT5 models for inference on CPU#1
sheineking wants to merge 4 commits intoterrierteam:mainfrom
sheineking:onnx_qualt5

sheineking commented Sep 19, 2025

Uh oh!

cmacdonald commented Sep 19, 2025

Uh oh!

sheineking commented Sep 19, 2025

Uh oh!

sheineking commented Sep 19, 2025

Uh oh!

cmacdonald commented Sep 19, 2025

Uh oh!

sheineking commented Sep 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sheineking commented Sep 19, 2025

Uh oh!

cmacdonald commented Sep 19, 2025

Uh oh!

sheineking commented Sep 19, 2025

Uh oh!

sheineking commented Sep 19, 2025

Uh oh!

cmacdonald commented Sep 19, 2025

Uh oh!

sheineking commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sheineking commented Sep 19, 2025 •

edited

Loading