Fix for fp16/bf16 export & compile in qwen3vl & qwen3vlmoe models by qcdipankar · Pull Request #980 · quic/efficient-transformers

qcdipankar · 2026-05-12T16:44:15Z

Added fix for fp16 export in qwen3 and qwen3vl modeling files.

asmigosw · 2026-05-13T08:39:34Z

Please convert all the nodes and IO info datatype except the logits in custom dtype:

final_mask = torch.ones((seq_len, seq_len), dtype=torch.float32)
IOInfo(name="pixel_values", datatype=torch.float32, shape=("batch_size", 3, "image_size", "image_size")),

asmigosw · 2026-05-19T11:13:51Z

Can you please change the below dtype also at line 115:
self._set_cos_sin_cache(seq_len=self.original_max_seq_len, device=self.inv_freq.device, dtype=torch.get_default_dtype())

Change dtype = config.torch_dtype as torch.get_default_dtype() sets the ROPE weights to default float32.

Please make same changes in both modelling file and check for any other dtype which is in float32 and make it take from torch_dtype passed by user.

asmigosw

LGTM

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

qcdipankar self-assigned this May 12, 2026

quic-rishinr requested review from asmigosw and quic-rishinr May 13, 2026 04:22

qcdipankar force-pushed the fp16_bug_fix_qwen3vl branch from ef89c88 to 0a0db36 Compare May 18, 2026 10:14

qcdipankar changed the title ~~Fix for fp16 export in qwen3vl & qwen3vlmoe models~~ Fix for fp16/bf16 export & compile in qwen3vl & qwen3vlmoe models May 19, 2026

qcdipankar force-pushed the fp16_bug_fix_qwen3vl branch from 4a0dbf7 to bcbdffe Compare May 19, 2026 14:43

asmigosw reviewed May 19, 2026

View reviewed changes

Comment thread QEfficient/transformers/models/qwen3_vl_moe/modeling_qwen3_vl_moe.py Outdated

asmigosw approved these changes May 20, 2026

View reviewed changes

qcdipankar force-pushed the fp16_bug_fix_qwen3vl branch from e55232b to d926081 Compare May 23, 2026 10:57

quic-rishinr added the 1.22 Release 1.22 candidate label May 25, 2026

quic-rishinr changed the base branch from main to release/v1.22.0_tmp May 25, 2026 17:02

qcdipankar added 11 commits May 25, 2026 22:34

Fix for fp16 export in qwen3vl & qwen3vlmoe models

b85d106

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Cleaning done 1

bb8a909

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Addressing the review changes

432d0b0

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Addressed the review comments

a02455e

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Cleaning done 2

c0f95c0

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Cleaning Done 3

7e42b79

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Minor Fix and cleaning

75e36b2

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Cleaning Done 4

87f4538

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Removing spli model io flag as qgenie is raising it as a concern

a0af1c6

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Adding split model io flag to example script

627d537

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Added spli flag to qwen3vl example script

374b6ed

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

quic-rishinr force-pushed the fp16_bug_fix_qwen3vl branch from d926081 to 374b6ed Compare May 25, 2026 17:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for fp16/bf16 export & compile in qwen3vl & qwen3vlmoe models#980

Fix for fp16/bf16 export & compile in qwen3vl & qwen3vlmoe models#980
qcdipankar wants to merge 11 commits into
release/v1.22.0_tmpfrom
fp16_bug_fix_qwen3vl

qcdipankar commented May 12, 2026

Uh oh!

asmigosw commented May 13, 2026

Uh oh!

asmigosw commented May 19, 2026

Uh oh!

Uh oh!

asmigosw left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qcdipankar commented May 12, 2026

Uh oh!

asmigosw commented May 13, 2026

Uh oh!

asmigosw commented May 19, 2026

Uh oh!

Uh oh!

asmigosw left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants