NNX migration prep (2/N): NNX utils and sharding utilities by ecnal-cienet · Pull Request #3470 · AI-Hypercomputer/maxtext

ecnal-cienet · 2026-03-20T18:38:24Z

Description

Note: This is the first in a series of NNX migration PRs. Pure NNX training is not yet implemented — all NNX code paths currently raise NotImplementedError. This PR only introduces the structural scaffolding needed for subsequent patches to plug in NNX logic without modifying shared infrastructure.

NNX sharding utilities (maxtext_utils_nnx.py) — Functions to manipulate NNX model shardings using abstract model state: get_named_sharding_nnx, set_named_sharding_nnx, get_partition_spec_nnx, and memory movement helpers (move_memory_to_host / move_memory_to_device).
get_abstract_state NNX path — Added get_abstract_state_nnx to maxtext_utils.py, which uses nnx.get_abstract_model to return a flat nnx.State (rather than a full TrainStateNNX), and updated get_abstract_state to dispatch to it when pure_nnx=True.
maxtext_utils.get_mesh_from_config() — Extracted mesh creation into a standalone function with unit tests.
Unit tests — Added tests/unit/maxtext_utils_nnx_test.py and extended tests/unit/maxtext_utils_test.py to cover the new mesh and sharding utilities.

Note on Flax deprecation warnings:
Flax v0.12 emits DeprecationWarning for .value access and VariableState. These are intentionally left unaddressed because post-training currently requires Flax v0.11 compatibility.

Tests

pytest tests/unit/maxtext_utils_nnx_test.py tests/unit/maxtext_utils_test.py -v

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

- pure_nnx: a flag to to choose pure NNX logic when NNX and linen models co-exist. - init_state_fn: a function to initialize the model state for the training. It will be set to different function for NNX and Linen.

codecov · 2026-03-20T18:42:25Z

Codecov Report

❌ Patch coverage is 64.13793% with 52 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/maxtext/trainers/pre_train/train_compile.py	33.33%	14 Missing and 4 partials ⚠️
...rc/maxtext/utils/generate_param_only_checkpoint.py	30.00%	5 Missing and 2 partials ⚠️
src/maxtext/utils/train_utils.py	68.18%	5 Missing and 2 partials ⚠️
src/maxtext/layers/train_state_nnx.py	50.00%	5 Missing ⚠️
src/maxtext/utils/maxtext_utils_nnx.py	86.84%	2 Missing and 3 partials ⚠️
src/maxtext/utils/lora_utils.py	20.00%	4 Missing ⚠️
src/maxtext/trainers/pre_train/train.py	50.00%	1 Missing and 1 partial ⚠️
src/maxtext/utils/maxtext_utils.py	90.90%	1 Missing and 1 partial ⚠️
src/maxtext/utils/model_creation_utils.py	66.66%	2 Missing ⚠️

📢 Thoughts on this report? Let us know!

- Add utils to manipulate the NNX shardings with abstract state of a model - also add unit tests for the utils - Extract mesh creation function to maxtext_utils.get_mesh_from_config() - also add unit tests for this func Note: flax v0.12 has DeprecationWarning in multiple places: - DeprecationWarning: '.value' access is now deprecated. Use variable.get_value() or variable[...] (for [Array]). - DeprecationWarning: 'VariableState' was removed, this is just an alias to 'Variable'. Plase use 'Variable' directly instead. But since the code needs to work with post-training, which currently requires flax v0.11, we didn't change code for these warnings.

NNX migration preparation: pure_nnx flag and init_state_fn

062aa7a

- pure_nnx: a flag to to choose pure NNX logic when NNX and linen models co-exist. - init_state_fn: a function to initialize the model state for the training. It will be set to different function for NNX and Linen.

ecnal-cienet changed the title ~~Feat/migrate nnx utils~~ NNX migration prep (1/N): Migrate MaxText Utils Mar 20, 2026

ecnal-cienet changed the title ~~NNX migration prep (1/N): Migrate MaxText Utils~~ NNX migration prep (2/N): Migrate MaxText Utils Mar 20, 2026

ecnal-cienet force-pushed the feat/migrate-nnx-utils branch from 7669e8e to 4fc37b6 Compare March 20, 2026 21:54

ecnal-cienet force-pushed the feat/migrate-nnx-utils branch from 4fc37b6 to 722386f Compare March 21, 2026 00:57

ecnal-cienet changed the title ~~NNX migration prep (2/N): Migrate MaxText Utils~~ NNX migration prep (2/N): NNX utils and sharding utilities Mar 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NNX migration prep (2/N): NNX utils and sharding utilities#3470

NNX migration prep (2/N): NNX utils and sharding utilities#3470
ecnal-cienet wants to merge 2 commits intomainfrom
feat/migrate-nnx-utils

ecnal-cienet commented Mar 20, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ecnal-cienet commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

codecov bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ecnal-cienet commented Mar 20, 2026 •

edited

Loading

codecov bot commented Mar 20, 2026 •

edited

Loading