Skip to content

CUDA: reduce MMQ stream-k overhead#22298

Merged
JohannesGaessler merged 2 commits intoggml-org:masterfrom
JohannesGaessler:cuda-mmq-fastdiv-8
Apr 25, 2026
Merged

CUDA: reduce MMQ stream-k overhead#22298
JohannesGaessler merged 2 commits intoggml-org:masterfrom
JohannesGaessler:cuda-mmq-fastdiv-8

Commits

Commits on Apr 23, 2026

Commits on Apr 25, 2026