common,server : fix custom preset dedup against cached models by angt · Pull Request #25235 · ggml-org/llama.cpp

angt · 2026-07-02T10:21:39Z

Overview

Revert the canonical_tag() case-folding in preset.cpp that collapsed distinct alias presets (e.g. Q4_K_XL-thinking and Q8_0-thinking into THINKING) and broke downstream tools keying by name.

Instead, add cached_base_name() in server-models.cpp so a custom preset section whose quant tag carries a convenience prefix merges into the cached entry the cache stores without that prefix. Presets whose tag suffix is not a cached quant keep their own name as a separate alias row.

Also track custom_names during the merge so the '*' marker lands on the actual mapping key rather than the unmerged custom name.

Additional information

This is another solution that is #25226.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES

Revert the canonical_tag() case-folding in preset.cpp that collapsed distinct alias presets (e.g. Q4_K_XL-thinking and Q8_0-thinking into THINKING) and broke downstream tools keying by name. Instead, add cached_base_name() in server-models.cpp so a custom preset section whose quant tag carries a convenience prefix merges into the cached entry the cache stores without that prefix. Presets whose tag suffix is not a cached quant keep their own name as a separate alias row. Also track custom_names during the merge so the '*' marker lands on the actual mapping key rather than the unmerged custom name.

ngxson · 2026-07-03T11:39:37Z

IMO it might be less error-prone if the dedup logic firstly resolves the model to exact local path. I'll will try this direction and push a fix if it works

angt requested review from a team as code owners July 2, 2026 10:21

github-actions Bot added the server label Jul 2, 2026

angt mentioned this pull request Jul 2, 2026

server : don't list cached models when a preset is used #25226

Open

angt linked an issue Jul 2, 2026 that may be closed by this pull request

Misc. bug: preset names changed and some presets ignored #25150

Open

angt requested review from ggerganov and ngxson July 3, 2026 08:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

common,server : fix custom preset dedup against cached models#25235

common,server : fix custom preset dedup against cached models#25235
angt wants to merge 1 commit into
ggml-org:masterfrom
angt:common-server-fix-custom-preset-dedup-against-cached-models

angt commented Jul 2, 2026

Uh oh!

ngxson commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

angt commented Jul 2, 2026

Overview

Additional information

Requirements

Uh oh!

ngxson commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants