Modified qwen_2.5 modelling file to allow replicate_kv_script to work for custom num_kv_heads. by quic-dhirajku · Pull Request #595 · quic/efficient-transformers

quic-dhirajku · 2025-10-18T06:57:04Z

Edited the replicate_kv_heads script to allow loading the VLM and export properly after KV_heads update.

Export goes through successfuly, need to export full model for it to work due to TF version issues.

… for custom num_kv_heads. Edited the replicate_kv_heads script to allow loading the VLM and export properly after KV_heads update. Signed-off-by: quic-dhirajku <quic_dhirajku@quicinc.com> ## Export goes through successfuly, need to export full model for it to work due to TF version issues.

quic-hemagnih · 2026-02-24T09:20:39Z

Hi @quic-dhirajku as discussed please raise a ticket for SIT to verify the Perf hit. In case if we are good from perf side and CI then lets go ahead and merge it.

In parallel we can work on below PR
#625

quic-dhirajku requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners October 18, 2025 06:57

quic-rishinr mentioned this pull request Nov 4, 2025

[WIP]: Add early support for KV replication in VLMs #594

Closed

quic-dhirajku closed this Apr 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modified qwen_2.5 modelling file to allow replicate_kv_script to work for custom num_kv_heads.#595

Modified qwen_2.5 modelling file to allow replicate_kv_script to work for custom num_kv_heads.#595
quic-dhirajku wants to merge 1 commit into
quic:mainfrom
quic-dhirajku:qwen_2.5_vl_replicate_kv_heads

quic-dhirajku commented Oct 18, 2025

Uh oh!

quic-hemagnih commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

quic-dhirajku commented Oct 18, 2025

Export goes through successfuly, need to export full model for it to work due to TF version issues.

Uh oh!

quic-hemagnih commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants