fix(datasets): align reasoning + assistant loss masks for left padding by khazic · Pull Request #2314 · NVIDIA-NeMo/Automodel

khazic · 2026-05-25T13:53:11Z

What does this PR do?

Fix loss-mask misalignment for left-padding tokenizers in
format_chat_template. Affects both the assistant mask and the
reasoning mask paths.

Background

_build_multiturn_assistant_mask and _build_reasoning_mask compute
their span indices from unpadded (padding=False) tokenizations
because they walk the formatted text to find role boundaries. The mask
arrays they return are sized to the padded input_ids, however --
and when tokenizer.padding_side == "left" (e.g. Phi-3,
some Qwen-VL configs) the actual content is right-aligned in
input_ids. The mask positions then fall in the left-padding region,
the subsequent attention_mask-based zeroing wipes them, and training
silently learns from an all-zero loss mask for ~80% of samples.

Relationship to #2312

#2312 fixes this for the assistant mask path with an inline shift.
This PR is a strict superset of that fix:

Extracts the shift into a reusable _maybe_shift_mask_for_left_padding
helper.
Applies it to both the assistant mask path (same site as feat(speculative): add Phi-3 support for EAGLE-1/2/3 + correctness fixes #2312)
and the reasoning mask path (which feat(speculative): add Phi-3 support for EAGLE-1/2/3 + correctness fixes #2312 missed).
Adds unit tests for the helper (8 cases covering happy / no-op /
edge paths).

#2312 has since landed; the rebase conflict at the assistant-mask site has
been resolved by replacing the inline shift with the helper call.

Changelog

components/datasets/llm/formatting_utils.py:
- Add _maybe_shift_mask_for_left_padding(mask, tokenizer, attention_mask)
  helper. Three short-circuit guards make it a no-op for
  right-padding tokenizers, missing attention_mask, and
  pad_len == 0.
- Replace the inline shift from feat(speculative): add Phi-3 support for EAGLE-1/2/3 + correctness fixes #2312 with a call to the helper after
  _build_multiturn_assistant_mask.
- Call it after _build_reasoning_mask (the bug feat(speculative): add Phi-3 support for EAGLE-1/2/3 + correctness fixes #2312 missed).
tests/unit_tests/datasets/llm/test_shift_mask_left_padding.py:
8 unit tests using a SimpleNamespace fake tokenizer (no HF
dependency), covering right-padding no-op, left-padding shift,
zero pad_len no-op, attention_mask=None, missing
padding_side attribute (default right), all-padding,
single-content-token.

Verification

Unit tests pass locally:

pytest tests/unit_tests/datasets/llm/test_shift_mask_left_padding.py -v

For right-padding tokenizers (Llama / Qwen / Mistral / ...) the helper
returns the original list object unchanged -- no behavioural change.

For left-padding tokenizers (Phi-3, ...) the previously-silent
all-zero loss_mask is now correctly aligned with content positions.

Before your PR is "Ready for review"

Pre checks:

Contributor guidelines followed
Unit tests added (test_shift_mask_left_padding.py)
No documentation changes needed -- internal helper.

copy-pr-bot · 2026-05-25T13:53:14Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

HuiyingLi · 2026-05-25T13:58:25Z

/ok to test 861fa9d

HuiyingLi · 2026-05-25T14:03:44Z

/claude review

HuiyingLi · 2026-05-26T08:32:16Z

/claude review

_build_multiturn_assistant_mask and _build_reasoning_mask compute span indices from unpadded (padding=False) tokenizations, but the mask arrays are sized to the padded input_ids. When the tokenizer pads on the left, content is right-aligned and the mask positions are off by pad_len. Extract a shared _shift_mask_for_left_padding helper that shifts any token-level mask right by the padding offset when padding_side="left". Apply it to both the assistant mask and the reasoning mask in format_chat_template. For right-padding tokenizers (the vast majority) the helper is a no-op. Signed-off-by: khazic <khazzz1c@gmail.com>

HuiyingLi · 2026-05-26T09:20:46Z

/ok to test f35f088

khazic requested review from HuiyingLi, ZhiyuLi-Nvidia, adil-a, akoumpa, athitten, hemildesai, pthombre and zyzhou5 as code owners May 25, 2026 13:53

github-actions Bot added the community-request label May 25, 2026

copy-pr-bot Bot temporarily deployed to nemo-ci May 25, 2026 13:58 Inactive

copy-pr-bot Bot temporarily deployed to test May 25, 2026 13:58 Inactive

copy-pr-bot Bot temporarily deployed to public May 25, 2026 13:58 Inactive

copy-pr-bot Bot temporarily deployed to public May 25, 2026 14:00 Inactive

copy-pr-bot Bot temporarily deployed to public May 25, 2026 14:01 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci May 25, 2026 14:03 Inactive

copy-pr-bot Bot temporarily deployed to public May 25, 2026 14:07 Inactive

khazic force-pushed the khazic/fix/reasoning-mask-left-padding branch from 861fa9d to f35f088 Compare May 26, 2026 09:14

HuiyingLi approved these changes May 26, 2026

View reviewed changes

HuiyingLi enabled auto-merge (squash) May 26, 2026 09:19

copy-pr-bot Bot requested a deployment to test May 26, 2026 09:21 Waiting

copy-pr-bot Bot deployed to nemo-ci May 26, 2026 09:21 Active

copy-pr-bot Bot temporarily deployed to nemo-ci May 26, 2026 09:21 Inactive

copy-pr-bot Bot temporarily deployed to public May 26, 2026 09:21 Inactive

copy-pr-bot Bot temporarily deployed to public May 26, 2026 09:23 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci May 26, 2026 09:26 Inactive

copy-pr-bot Bot deployed to public May 26, 2026 09:30 Active

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(datasets): align reasoning + assistant loss masks for left padding#2314

fix(datasets): align reasoning + assistant loss masks for left padding#2314
khazic wants to merge 1 commit into
NVIDIA-NeMo:mainfrom
khazic:khazic/fix/reasoning-mask-left-padding

khazic commented May 25, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented May 25, 2026

Uh oh!

HuiyingLi commented May 25, 2026

Uh oh!

HuiyingLi commented May 25, 2026

Uh oh!

HuiyingLi commented May 26, 2026

Uh oh!

HuiyingLi commented May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

khazic commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Background

Relationship to #2312

Changelog

Verification

Before your PR is "Ready for review"

Uh oh!

copy-pr-bot Bot commented May 25, 2026

Uh oh!

HuiyingLi commented May 25, 2026

Uh oh!

HuiyingLi commented May 25, 2026

Uh oh!

HuiyingLi commented May 26, 2026

Uh oh!

HuiyingLi commented May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

khazic commented May 25, 2026 •

edited

Loading