Skip to content

fix: qwen3.5-plus-kv-cache.ipynb to use KVCache#981

Merged
rasbt merged 1 commit intorasbt:mainfrom
casinca:qwen3.5-fix-KVcache
Mar 21, 2026
Merged

fix: qwen3.5-plus-kv-cache.ipynb to use KVCache#981
rasbt merged 1 commit intorasbt:mainfrom
casinca:qwen3.5-fix-KVcache

Conversation

@casinca
Copy link
Contributor

@casinca casinca commented Mar 17, 2026

Hello Sebastian,

The Qwen3.5 with KVCache notebook was actually not using KVcache because it wasn't instantiated/used in generate_text_basic_stream.

We can see the trace being the same as the Qwen3.5 (no cache) notebook (which implies kvcache couldn't have been used):
~Generation speed: 9.00 tokens/sec

After fix:
~Generation speed: 15.16 tokens/sec and test passed.

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

karthikandsuthani-bit

This comment was marked as resolved.

@rasbt
Copy link
Owner

rasbt commented Mar 21, 2026

Arg, must have forgotten to update that cell. Good catch, and thanks for the PR!

@rasbt rasbt merged commit 9320a5e into rasbt:main Mar 21, 2026
13 checks passed
@casinca casinca deleted the qwen3.5-fix-KVcache branch March 25, 2026 10:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants