fix[vLLM x v5]: Default untied embeddings in AudioFlamingo3 and VibeVoice by harshaljanjani · Pull Request #46400 · huggingface/transformers

harshaljanjani · 2026-06-04T06:59:09Z

What does this PR do?

→ Fixes vllm-project/vllm#39330 (comment)
→ Corrects the tie_word_embeddings default for AudioFlamingo3 and VibeVoice for integration with the vLLM Transformers backend. This only causes issues with vLLM and not Transformers because Transformers always loads checkpoint weights regardless of tie_word_embeddings (or in other words, it's a hint for init-time tying, not a directive as in vLLM to my understanding).
→ Made sure this doesn't cause any regressions in tests/models/audioflamingo3/ and tests/models/vibevoice_asr/

Model-wise behavior

→ AudioFlamingo3 and VibeVoice: These models break under the current default; their checkpoints contain distinct lm_head and embed_tokens weights (with a max difference of ~1.0 to 1.7). Substitution produces garbage outputs.
→ GraniteSpeech: Works fine as-is because it genuinely ties its embeddings and has no separate lm_head in the checkpoint.
→ GLM-ASR: Works fine as-is. It has separate lm_head and embed_tokens, but they are bitwise identical (max diff = 0.0), so tying them produces the same result.

cc: @eustlb @vasqu

Code Agent Policy

I confirm that this is not a pure code agent PR.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
Did you fix any necessary existing tests?

github-actions · 2026-06-04T07:00:24Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: audioflamingo3, vibevoice_asr

vasqu

LGTM but cc @ebezzam to double-check (I think you were involved in those models?)

In general iirc, we only tie if we also really detect missing keys so really might as well be skipping those parts

ebezzam

@vasqu thanks for the ping. Those lines were added with the refactor from #45534
Probably accidental, so I think we can remove altogether (rather than setting to False) as you suggest.

Unless @eustlb you had found a reason to set them to True?

eustlb

Yep, it slipped through... Thanks a lot for catching @harshaljanjani ! Agree that simply removing the field is better here

github-actions · 2026-06-04T15:25:26Z

CI Dashboard: View test results in Grafana

HuggingFaceDocBuilderDev · 2026-06-04T15:36:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

fix: Default untied embeddings in AudioFlamingo3 and VibeVoice

57982e2

harshaljanjani mentioned this pull request Jun 4, 2026

feat[vLLM × v5]: Add audio support for the Transformers backend vllm-project/vllm#39330

Open

7 tasks

harshaljanjani marked this pull request as ready for review June 4, 2026 07:13

github-actions Bot requested review from ArthurZucker and Rocketknight1 June 4, 2026 07:14

vasqu approved these changes Jun 4, 2026

View reviewed changes

ebezzam reviewed Jun 4, 2026

View reviewed changes

eustlb approved these changes Jun 4, 2026

View reviewed changes

vasqu enabled auto-merge June 4, 2026 15:25

vasqu added this pull request to the merge queue Jun 4, 2026

Merged via the queue into huggingface:main with commit b56c3cc Jun 4, 2026
24 checks passed

harshaljanjani deleted the fix/untied-embeddings-audioflamingo3-vibevoice branch June 4, 2026 16:30

eustlb mentioned this pull request Jun 5, 2026

[fix] regression introduced by #45534 #46456

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix[vLLM x v5]: Default untied embeddings in AudioFlamingo3 and VibeVoice#46400

fix[vLLM x v5]: Default untied embeddings in AudioFlamingo3 and VibeVoice#46400
vasqu merged 1 commit into
huggingface:mainfrom
harshaljanjani:fix/untied-embeddings-audioflamingo3-vibevoice

harshaljanjani commented Jun 4, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

vasqu left a comment

Uh oh!

ebezzam left a comment

Uh oh!

eustlb left a comment

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

harshaljanjani commented Jun 4, 2026

What does this PR do?

Model-wise behavior

Code Agent Policy

Before submitting

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

ebezzam left a comment

Choose a reason for hiding this comment

Uh oh!

eustlb left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants