Gemma4 by tchawada · Pull Request #966 · quic/efficient-transformers

tchawada · 2026-05-06T10:32:21Z

End to end pipeline for gemma4

quic-hemagnih · 2026-05-12T10:02:22Z

@tchawada can you please resolve the conflicts and update the PR.

quic-hemagnih

Can we add the test case in the CI

quic-hemagnih · 2026-05-25T10:08:55Z

+        skip_model_io=True,
+        **compiler_kwargs,
+    )
+    breakpoint()


remove breakpoint

quic-hemagnih · 2026-05-26T05:24:08Z

 for i in range(num_chunks):
    chunk_inputs["input_ids"] = lang_inputs["input_ids"][:, i * PREFILL_SEQ_LEN : (i + 1) * PREFILL_SEQ_LEN]
    chunk_inputs["position_ids"] = lang_inputs["position_ids"][..., i * PREFILL_SEQ_LEN : (i + 1) * PREFILL_SEQ_LEN]
+    breakpoint()


remove breakpoints @qcdipankar Please check and remove all the breakpoints

qcdipankar · 2026-05-26T06:29:57Z

        return self.model.config.__dict__


+class _QEffAutoModelForImageTextToTextLanguageOnlyCompat:


@tchawada why are we making a new AutoModel for lang part in text can't we handle using the same conditional generation as that of other VLM's?

I am changing this

qcdipankar · 2026-05-26T06:32:14Z

+        #     and vision_onnx_path is None
+        # )
+        # need_export_lang = self.lang_model.onnx_path is None and lang_onnx_path is None
+        # if need_export_vision or need_export_lang:


this patch is different and changes the variables and the meaning of their usage for diss mode can we go back to mainline variables

qcdipankar · 2026-05-26T06:32:46Z

            if k
-            in {"pixel_values", "image_masks", "image_input_idx", "valid_idx", "aspect_ratio_ids", "aspect_ratio_mask"}
+            in {
+                "pixel_values",


why is image position ids required here?

Without them, the output is not meaningful

qcdipankar · 2026-05-26T06:33:36Z


        target_dtype = getattr(self.model.config, "torch_dtype", torch.float32)
+        convert_to_fp16 = CUSTOM_IO_DTYPE_MAP[target_dtype] == "float16"
+        print(convert_to_fp16)


remove the print from here

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

tchawada force-pushed the gemma4 branch from 8465e7e to a7a06f0 Compare May 7, 2026 07:44

quic-hemagnih requested changes May 12, 2026

View reviewed changes

tchawada force-pushed the gemma4 branch 2 times, most recently from ebd6845 to 4e97579 Compare May 19, 2026 11:05

qcdipankar force-pushed the gemma4 branch from 4e97579 to 0427e64 Compare May 20, 2026 04:23

tchawada force-pushed the gemma4 branch from 3e3ebbe to 32aa012 Compare May 20, 2026 06:08

quic-rishinr added the 1.22 Release 1.22 candidate label May 22, 2026

quic-rishinr requested a review from vbaddi May 25, 2026 10:06

quic-hemagnih reviewed May 25, 2026

View reviewed changes

Comment thread examples/image_text_to_text/models/gemma_vision/gemma4_diss.py Outdated

quic-hemagnih reviewed May 25, 2026

View reviewed changes

quic-hemagnih requested changes May 25, 2026

View reviewed changes

quic-hemagnih reviewed May 26, 2026

View reviewed changes

Comment thread QEfficient/transformers/models/gemma4/modeling_gemma4.py

quic-hemagnih requested changes May 26, 2026

View reviewed changes

qcdipankar marked this pull request as draft May 26, 2026 05:47

qcdipankar requested changes May 26, 2026

View reviewed changes

tchawada added 13 commits May 26, 2026 12:41

Gemma4 full code

301c274

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Updated pyproject.toml

2d73666

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Aligning modeling_auto with main

03d878c

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Rebased the branch

fefb5d3

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Gemma4 full code

02a6e4f

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Aligning modeling_auto with main

e43c101

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

updating modeling_qeff

24780d3

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Solving lint error

313fdef

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

solving lint error

3f44569

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

solving lint error

036f797

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Rebasing the branch

f7dc244

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Add support for testing with fewer layers configuration

37f4fbd

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Add support for testing with fewer layers configuration

0278b20

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

tchawada and others added 9 commits May 26, 2026 12:44

Add support for testing with fewer layers configuration

d27b8e5

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Solving DCO error

200fd3c

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Solving DCO error

bb14cc2

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Optimized Prefill Added to Gemma4

79274fa

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

adding diss support to gemma4

d33b6b2

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Added example script for gemma4 diss

da945f0

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Minor changes

386e1a9

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

Aligning modeling_auto with main

28701c9

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

Aligning modeling_auto with main

4c2b1d1

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

tchawada force-pushed the gemma4 branch from f1f3719 to 4c2b1d1 Compare May 26, 2026 13:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemma4#966

Gemma4#966
tchawada wants to merge 22 commits into
quic:mainfrom
tchawada:gemma4

tchawada commented May 6, 2026

Uh oh!

quic-hemagnih commented May 12, 2026

Uh oh!

quic-hemagnih left a comment

Uh oh!

Uh oh!

quic-hemagnih May 25, 2026

Uh oh!

quic-hemagnih May 26, 2026

Uh oh!

Uh oh!

qcdipankar May 26, 2026

Uh oh!

tchawada May 26, 2026

Uh oh!

Uh oh!

qcdipankar May 26, 2026

Uh oh!

qcdipankar May 26, 2026

Uh oh!

tchawada May 26, 2026

Uh oh!

qcdipankar May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		return self.model.config.__dict__


		class _QEffAutoModelForImageTextToTextLanguageOnlyCompat:

Conversation

tchawada commented May 6, 2026

Uh oh!

quic-hemagnih commented May 12, 2026

Uh oh!

quic-hemagnih left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants