Add Qwen3-30b-a3b RL recipe by SurbhiJainUSC · Pull Request #3896 · AI-Hypercomputer/maxtext

SurbhiJainUSC · 2026-05-13T19:23:00Z

Description

This PR introduces a new RL training recipe for the Qwen3-30b-a3b model.

Why is this change being made: To provide a standardized and optimized RL configuration for fine-tuning the Qwen3-30b-a3b architecture within MaxText.
The problem being solved: Provides an out-of-the-box recipe for this model size and architecture, reducing the manual setup required for RL runs.
Specific implementation: Adds the necessary configuration and script files to support this specific pipeline.

Tests

Tested on Ironwood cluster

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-05-13T19:43:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

hengtaoguo · 2026-05-14T04:55:30Z

+load_parameters_path=$MAXTEXT_CKPT_PATH"
+
+# Workload Creation
+xpk workload create-pathways \


Should we still use XPK in the new recipes, considering its upcoming deprecation?

SurbhiJainUSC marked this pull request as ready for review May 13, 2026 19:24

SurbhiJainUSC force-pushed the qwen3_rl_recipe branch from 692a837 to 1047974 Compare May 14, 2026 00:06

Add Qwen3-30b-a3b RL recipe

4ecf635

SurbhiJainUSC force-pushed the qwen3_rl_recipe branch from 1047974 to 4ecf635 Compare May 14, 2026 00:08

hengtaoguo approved these changes May 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen3-30b-a3b RL recipe#3896

Add Qwen3-30b-a3b RL recipe#3896
SurbhiJainUSC wants to merge 1 commit into
mainfrom
qwen3_rl_recipe

SurbhiJainUSC commented May 13, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 13, 2026

Uh oh!

hengtaoguo May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SurbhiJainUSC commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

codecov Bot commented May 13, 2026

Codecov Report

Uh oh!

hengtaoguo May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SurbhiJainUSC commented May 13, 2026 •

edited

Loading