[fix] regression introduced by #45534 by eustlb · Pull Request #46456 · huggingface/transformers

eustlb · 2026-06-05T17:04:38Z

What does this PR do?

A continuation of #46400

Verified by running (on this branch) this script

model                 lm_head   lm==embed   weights tied  config tie  match
---------------------------------------------------------------------------
qwen2_audio           yes       False       False         False       OK
voxtral               yes       False       False         False       OK
voxtral_realtime      no        n/a         True          True        OK
glmasr                yes       True        False         True        MISMATCH <<<
granite_speech        no        n/a         True          True        OK
granite_speech_plus   no        n/a         True          True        OK
audioflamingo3        yes       False       False         False       OK
musicflamingo         yes       False       False         False       OK
vibevoice_asr         yes       False       False         False       OK

lm_head — yes/no: whether the checkpoint has a separate lm_head.* tensor in its safetensors (has_lm_head).
lm==embed — True/False/n/a: when an lm_head exists, whether lm_head.weight is bitwise identical to embed_tokens.weight. n/a when there's no separate lm_head.
weights tied — True/False: what the actual checkpoint implies (not has_lm_head → no separate head means weights are tied).
config tie — True/False: what the config class resolves tie_word_embeddings to (fallback False).
match — OK if config tie == weights tied, else MISMATCH <<< (the regression check).

For glmasr, hub weights have a lm head but it's bitwise to embed tokens so we keep weight tying

HuggingFaceDocBuilderDev · 2026-06-05T17:17:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

hmellor · 2026-06-05T17:47:13Z

For the models with no LM head, why set tie_word_embeddings=True?

hmellor · 2026-06-05T17:48:29Z

Should this be propagated to modelling_glmasr.py?

And tie_word_embeddings updated?

It's already propagated, but since _tied_weights_keys has been removed from AudioFlamingo3ForConditionalGeneration, this needs to be readded here (so modeling stays the same)

eustlb · 2026-06-06T10:34:37Z

no lm_heads in the above tables means no lm_head in hub weigths, so tie_word_embeddings must be set tot True

github-actions · 2026-06-06T10:50:52Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: audioflamingo3, glmasr, musicflamingo, qwen2_audio, vibevoice_asr, voxtral, voxtral_realtime

eustlb · 2026-06-06T11:02:10Z

run-slow: audioflamingo3, glmasr, musicflamingo, qwen2_audio, vibevoice_asr, voxtral, voxtral_realtime

github-actions · 2026-06-06T11:02:28Z

CI Dashboard: View test results in Grafana

github-actions · 2026-06-06T11:02:51Z

Workflow Run ⚙️💔 This comment contains run-slow, but unknown error occurred and the workflow run aborted!

fix

ab195fc

hmellor mentioned this pull request Jun 5, 2026

Bump Transformers version to 5.10.2 vllm-project/vllm#41359

Open

15 tasks

fix

ac481d8

hmellor added the for patch Tag issues / labels that should be included in the next patch label Jun 5, 2026

hmellor reviewed Jun 5, 2026

View reviewed changes

unnecessary and misleading

91edb56

Merge branch 'main' into fix-tie-words-embeddings-regression

c8881ea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] regression introduced by #45534#46456

[fix] regression introduced by #45534#46456
eustlb wants to merge 4 commits into
mainfrom
fix-tie-words-embeddings-regression

eustlb commented Jun 5, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 5, 2026

Uh oh!

hmellor commented Jun 5, 2026

Uh oh!

hmellor Jun 5, 2026 •

edited

Loading

Uh oh!

eustlb Jun 6, 2026

Uh oh!

eustlb commented Jun 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

eustlb commented Jun 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

eustlb commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jun 5, 2026

Uh oh!

hmellor commented Jun 5, 2026

Uh oh!

hmellor Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eustlb Jun 6, 2026

Choose a reason for hiding this comment

Uh oh!

eustlb commented Jun 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

eustlb commented Jun 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eustlb commented Jun 5, 2026 •

edited

Loading

hmellor Jun 5, 2026 •

edited

Loading