Add Gemma4 layer-wise unit tests by hengtaoguo · Pull Request #3905 · AI-Hypercomputer/maxtext

hengtaoguo · 2026-05-14T06:01:39Z

Description

Add layer-wise unit tests comparing MaxText and PyTorch implementations for Gemma 4 vision components, including VisionEntry, Gemma4VisionRotaryEmbedding, Gemma4Attention, Gemma4EncoderBlock, VisionExit, Gemma4VisionEncoderLayer, and Gemma4VisionProjector (still offline).

TODO: A follow-up PR to transform such layer-wise tests to PyTorch-free and runnable on CI.

Tests

python -m pytest tests/unit/gemma4_layers_test.py -vv -s

collected 7 items                                               

tests/unit/gemma4_layers_test.py::TestGemma4VisionEntry::test_vision_entry_matches_torch W0514 05:57:15.485060 2196772 pjrt_executable.cc:642] Assume version compatibility. PjRt-IFRT does not track XLA executable versions.
PASSED
tests/unit/gemma4_layers_test.py::TestGemma4VisionRotaryEmbedding::test_rotary_embedding_matches_torch W0514 05:57:20.120236 2196772 pjrt_executable.cc:642] Assume version compatibility. PjRt-IFRT does not track XLA executable versions.
PASSED
tests/unit/gemma4_layers_test.py::TestGemma4VisionAttention::test_attention_matches_torch W0514 05:57:23.553000 2196772 pjrt_executable.cc:642] Assume version compatibility. PjRt-IFRT does not track XLA executable versions.
PASSED
tests/unit/gemma4_layers_test.py::TestGemma4VisionEncoderBlock::test_encoder_block_matches_torch W0514 05:57:29.738181 2196772 pjrt_executable.cc:642] Assume version compatibility. PjRt-IFRT does not track XLA executable versions.
PASSED
tests/unit/gemma4_layers_test.py::TestGemma4VisionExit::test_vision_exit_matches_torch W0514 05:57:31.237544 2196772 pjrt_executable.cc:642] Assume version compatibility. PjRt-IFRT does not track XLA executable versions.
PASSED
tests/unit/gemma4_layers_test.py::TestGemma4VisionEncoderEndToEnd::test_vision_encoder_matches_torch W0514 05:57:44.092117 2196772 pjrt_executable.cc:642] Assume version compatibility. PjRt-IFRT does not track XLA executable versions.
PASSED
tests/unit/gemma4_layers_test.py::TestGemma4VisionProjector::test_vision_projector_matches_torch W0514 05:57:45.259719 2196772 pjrt_executable.cc:642] Assume version compatibility. PjRt-IFRT does not track XLA executable versions.
PASSED

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-05-14T06:20:20Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

github-actions · 2026-05-14T23:51:39Z

🤖 Hi @aireenmei, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

github-actions

## 📋 Review Summary

The Pull Request introduces comprehensive layer-wise unit tests for the Gemma 4 vision components, comparing the MaxText (JAX) implementation against the PyTorch reference from the transformers library. The tests cover all key layers, including VisionEntry, RotaryEmbedding, Attention, EncoderBlock, and the full VisionEncoderLayer, ensuring numerical parity across frameworks.

🔍 General Feedback

Completeness: The coverage is excellent, including both individual components and end-to-end encoder tests.
Numerical Parity: Most individual layer tests use a tight tolerance (1e-3), which is a strong indicator of implementation correctness.
Standardized Helpers: The use of shared multimodal test utilities and clear weight-copying functions makes the tests easy to follow and maintain.
CI Status: Note that the new test file is added to pytest.ini's ignore list, which is consistent with the stated TODO to transform these into PyTorch-free tests for CI compatibility.

github-actions · 2026-05-14T23:53:57Z

+    float32_logits=True,
+    float32_qk_product=True,
+)
+


🟢 This variable is defined but not used in the pyconfig.initialize call below. Consider removing it if it's not needed.

Suggested change

base_config_path = os.path.join(MAXTEXT_REPO_ROOT, "src", "maxtext", "configs", "base.yml")

jax_config = pyconfig.initialize(

github-actions · 2026-05-14T23:53:57Z

+    jax_inputs, torch_inputs = create_random_jax_torch(batch_size, seq_len, self.config.hidden_size_for_vit)
+
+    torch_output = torch_model(torch_inputs)
+    jax_output = jax_model(jax_inputs)


🟡 The relative tolerance rtol=5e-2 (5%) is quite high for a unit test comparing float32 implementations. While the cumulative error in a 27-layer model and the scaling by sqrt(d_model) in VisionExit might justify some variance, it would be ideal to see if this can be tightened (e.g., to 1e-3 or 1e-2) to ensure higher precision matches.

If this tolerance is the tightest possible due to framework differences, consider adding a brief comment explaining the reason.

Suggested change

jax_output = jax_model(jax_inputs)

assert_all_close_jax_torch(

jax_output_squeezed,

torch_lhs,

rtol=5e-2,

atol=5e-2,

error_msg="Gemma4VisionEncoderLayer end-to-end outputs differ",

)

aireenmei

Thanks for adding tests!

aireenmei · 2026-05-15T01:33:53Z

+# =============================================================================
+
+
+def copy_rmsnorm_weights(torch_norm, jax_norm):


Seems not very specific to gemma4? Should we move to tests.utils.multimodal_test_utils for re-use?

aireenmei · 2026-05-15T01:38:41Z

+    seq_len = 42 * 60
+    dummy_shape = (batch_size, seq_len, self.config.hidden_size_for_vit)
+
+    jax_model = JaxGemma4Attention(


How about JaxGemma4VisionAttention?

hengtaoguo force-pushed the hengtaoguo-gemma4 branch 2 times, most recently from a98f5ee to 4908150 Compare May 14, 2026 06:11

hengtaoguo marked this pull request as ready for review May 14, 2026 06:13

hengtaoguo requested review from A9isha, NicoGrande, NuojCheng, RissyRan, SurbhiJainUSC, abhinavclemson, aireenmei, bvandermoon, dipannita08, gagika, gobbleturk, igorts-git, jesselu-google, jiangjy1982, khatwanimohit, richjames0, shralex, suexu1025 and vipannalla as code owners May 14, 2026 06:13

hengtaoguo force-pushed the hengtaoguo-gemma4 branch from bc9ff8d to adebb01 Compare May 14, 2026 06:16

Add Gemma4 layer-wise unit tests

1282054

hengtaoguo force-pushed the hengtaoguo-gemma4 branch from 2071949 to 1282054 Compare May 14, 2026 17:51

aireenmei added the gemini-review label May 14, 2026

github-actions Bot reviewed May 14, 2026

View reviewed changes

aireenmei approved these changes May 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Gemma4 layer-wise unit tests#3905

Add Gemma4 layer-wise unit tests#3905
hengtaoguo wants to merge 1 commit into
mainfrom
hengtaoguo-gemma4

hengtaoguo commented May 14, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot May 14, 2026

Uh oh!

github-actions Bot May 14, 2026

Uh oh!

aireenmei left a comment

Uh oh!

aireenmei May 15, 2026

Uh oh!

aireenmei May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


	base_config_path = os.path.join(MAXTEXT_REPO_ROOT, "src", "maxtext", "configs", "base.yml")
	jax_config = pyconfig.initialize(

-    jax_output = jax_model(jax_inputs)
+    assert_all_close_jax_torch(
+        jax_output_squeezed,
+        torch_lhs,
+        rtol=5e-2,
+        atol=5e-2,
+        error_msg="Gemma4VisionEncoderLayer end-to-end outputs differ",
+    )

		# =============================================================================


		def copy_rmsnorm_weights(torch_norm, jax_norm):

Conversation

hengtaoguo commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

codecov Bot commented May 14, 2026

Codecov Report

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

🔍 General Feedback

Uh oh!

github-actions Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

aireenmei left a comment

Choose a reason for hiding this comment

Uh oh!

aireenmei May 15, 2026

Choose a reason for hiding this comment

Uh oh!

aireenmei May 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hengtaoguo commented May 14, 2026 •

edited

Loading