Update the Ironwood offloading description #2997

RissyRan · 2026-01-22T21:37:46Z

Description

Harini reached out and pointed out those xla_tpu_enable_sparse_core_collective_offload* flags are enabled by default on Ironwood, not like v6e or v5p (I also cross checked in this doc). Reword the document a little bit and recommend customers to tune flags if needed.

# Original:
Enable SparseCore offloading for collectives: By setting the appropriate [XLA flags](https://github.com/AI-Hypercomputer/maxtext/blob/ed517cf80d9aa81f76e236c5516dacebfe39e96d/benchmarks/xla_flags_library.py#L70-L116), you can offload collective operations (like All-Reduce, All-Gather, etc.) to the SparseCores. These operations then run in parallel with the TensorCore computations, effectively hiding communication latency and improving Model Flop Utilization (MFU).

# Updated:
Leverage SparseCore offloading: By default, collective operations (like All-Reduce, All-Gather, etc.) are offloaded to SparseCore, allowing them to run in parallel with TensorCore computations. This effectively hides communication latency and improving Model Flop Utilization (MFU). Also, you can maximize throughput tuning those [XLA flags](https://github.com/AI-Hypercomputer/maxtext/blob/ed517cf80d9aa81f76e236c5516dacebfe39e96d/benchmarks/xla_flags_library.py#L70-L116).

Also format change to pass the mdformat check.

Tests

Manual check

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

docs/guides/optimization/custom_model.md

harini-sridhar

LGTM

codecov · 2026-01-22T23:27:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

NuojCheng

Thank you Ran!

RissyRan requested review from A9isha, bvandermoon, gagika, gobbleturk, jacoguzo, jiangjy1982, richjames0, shralex and vipannalla as code owners January 22, 2026 21:37

RissyRan force-pushed the sc_offload_doc branch 2 times, most recently from 12dfd5c to 95cdb41 Compare January 22, 2026 21:51

RissyRan assigned NuojCheng and gobbleturk Jan 22, 2026

harini-sridhar reviewed Jan 22, 2026

View reviewed changes

docs/guides/optimization/custom_model.md Outdated Show resolved Hide resolved

Update the Ironwood offloading description

824c566

RissyRan force-pushed the sc_offload_doc branch from 95cdb41 to 824c566 Compare January 22, 2026 23:12

RissyRan requested review from NuojCheng, Obliviour, SujeethJinesh, khatwanimohit, mitalisi, notabee, shauryagup and suexu1025 as code owners January 22, 2026 23:12

harini-sridhar reviewed Jan 22, 2026

View reviewed changes

gobbleturk approved these changes Jan 22, 2026

View reviewed changes

NuojCheng approved these changes Jan 22, 2026

View reviewed changes

RissyRan added the pull ready label Jan 23, 2026

RissyRan unassigned gobbleturk Jan 23, 2026

RissyRan unassigned NuojCheng Jan 23, 2026

copybara-service bot merged commit 4bcee99 into main Jan 23, 2026
33 checks passed

copybara-service bot deleted the sc_offload_doc branch January 23, 2026 02:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the Ironwood offloading description #2997

Update the Ironwood offloading description #2997

Uh oh!

RissyRan commented Jan 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

harini-sridhar left a comment

Uh oh!

codecov bot commented Jan 22, 2026

Uh oh!

NuojCheng left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Update the Ironwood offloading description #2997

Update the Ironwood offloading description #2997

Uh oh!

Conversation

RissyRan commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

Uh oh!

harini-sridhar left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Jan 22, 2026

Codecov Report

Uh oh!

NuojCheng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

RissyRan commented Jan 22, 2026 •

edited

Loading