[Debug] add ONLY_CALC_MISMATCH_RATIO and support cuda_device by hhaAndroid · Pull Request #1847 · InternLM/xtuner

hhaAndroid · 2026-05-27T04:20:15Z

新特性

支持 ONLY_CALC_MISMATCH_RATIO 可以仅仅跑 RL 的 mismatch。不需要真的训练，适用于 debug + 少卡情况。比如 qwen35b 之前需要 8 卡，现在只需要 2 卡即可跑
支持 CUDA_VISIBLE_DEVICES 方便指定任意卡数 debug
支持 TRAIN_BATCH_SIZE，方便想快速 debug 或者在卡数减少情况下 debug 而无需修改配置

以上新特性都是为了更方便的 debug。用法如下：

export ONLY_CALC_MISMATCH_RATIO=1
export TRAIN_BATCH_SIZE=8
export CUDA_VISIBLE_DEVICES=2,3

bash examples/v1/scripts/run_rl.sh examples/v1/config/rl_qwen3p5_vl_35B_grpo_mixdata.py "lmdeploy" $QWEN3P5_VL_MODEL_PATH $META_DATA_PATH

可以进一步配合如下参数，实现更高效 debug

export DEBUG_ROLLOUT_DIR='work_dirs_rl/debug_rollout'
export DEBUG_TRAIN=False # True
export DEBUG_ROLLOUT=False  # True

add ONLY_CALC_MISMATCH_RATIO and support cuda_device

152bc83

hhaAndroid changed the title ~~[Feature] add ONLY_CALC_MISMATCH_RATIO and support cuda_device~~ [Debug] add ONLY_CALC_MISMATCH_RATIO and support cuda_device May 27, 2026

YanhuiDua approved these changes May 27, 2026

View reviewed changes

hhaAndroid merged commit 5d7b104 into InternLM:main May 27, 2026
6 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Debug] add ONLY_CALC_MISMATCH_RATIO and support cuda_device#1847

[Debug] add ONLY_CALC_MISMATCH_RATIO and support cuda_device#1847
hhaAndroid merged 1 commit into
InternLM:mainfrom
hhaAndroid:cuda_device

hhaAndroid commented May 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hhaAndroid commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hhaAndroid commented May 27, 2026 •

edited

Loading