Fix WarmupCosineLR multi-group initialization by tohtana · Pull Request #7969 · deepspeedai/DeepSpeed

tohtana · 2026-04-11T06:39:05Z

WarmupCosineLR returned a singleton pre-start LR list even when the optimizer had multiple parameter groups. Because scheduler initialization applies LRs with zip(param_groups, lrs), only group 0 was updated and later groups kept their base LR before the first optimizer step.

The fix changes the pre-start scheduler outputs to match the multi-group contract by returning scalar 0.0 from get_lr_ratio() and a zero-filled LR list sized to self.org_lrs.

Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>

chatgpt-codex-connector · 2026-04-11T06:39:14Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

delock

LGTM

fix(lr): initialize warmup cosine lr for all groups

7f15708

Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>

tohtana requested review from loadams and tjruwase as code owners April 11, 2026 06:39

delock approved these changes Apr 12, 2026

View reviewed changes

delock merged commit 3fd762c into deepspeedai:master Apr 12, 2026
11 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix WarmupCosineLR multi-group initialization#7969

Fix WarmupCosineLR multi-group initialization#7969
delock merged 1 commit intodeepspeedai:masterfrom
tohtana:tohtana/fix-warmup-cosine-lr-multi-group-init

tohtana commented Apr 11, 2026

Uh oh!

chatgpt-codex-connector bot commented Apr 11, 2026

Uh oh!

delock left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tohtana commented Apr 11, 2026

Uh oh!

chatgpt-codex-connector bot commented Apr 11, 2026

Uh oh!

delock left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants