llama-fit-params: free memory target per device by JohannesGaessler · Pull Request #18679 · ggml-org/llama.cpp

JohannesGaessler · 2026-01-07T23:09:13Z

This PR extends llama_params_fit with the option to specify a memory target per device instead of only a single one. If a user specifies only a single number this number is broadcast across all devices.

llama-fit-params: free memory target per device

d2402c1

JohannesGaessler requested a review from ggerganov as a code owner January 7, 2026 23:09

JohannesGaessler mentioned this pull request Jan 7, 2026

Feature Request: Allow per-device memory margin for --fit-target #18390

Closed

loci-dev mentioned this pull request Jan 7, 2026

UPSTREAM PR #18679: llama-fit-params: free memory target per device auroralabs-loci/llama.cpp#847

Open

github-actions Bot added the examples label Jan 7, 2026

ggerganov approved these changes Jan 8, 2026

View reviewed changes

JohannesGaessler merged commit 64848de into ggml-org:master Jan 8, 2026
75 checks passed

gary149 pushed a commit to gary149/llama-agent that referenced this pull request Jan 8, 2026

llama-fit-params: free memory target per device (ggml-org#18679)

2da8023

teleprint-me mentioned this pull request Jan 10, 2026

Eval bug: Model fails to load into memory #18746

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-fit-params: free memory target per device#18679

llama-fit-params: free memory target per device#18679
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:llama-fp-targets-per-gpu

JohannesGaessler commented Jan 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JohannesGaessler commented Jan 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants