Skip to content

Pull requests: deepspeedai/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Simplify module_inject.transpose
#8028 opened May 26, 2026 by xbcReal Contributor Loading…
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
#8027 opened May 26, 2026 by PKUWZP Collaborator Loading…
3 of 5 tasks
feat(zero): enable torch.func transforms on engine for ZeRO 0/1/2
#8026 opened May 25, 2026 by roycho96 Contributor Loading…
Add engine.coalesce_grad_reduction() for ZeRO 1/2/3 multi-backward
#7992 opened May 5, 2026 by roycho96 Contributor Loading…
Add Qwen 3.5 preset to AutoTP
#7978 opened Apr 16, 2026 by tohtana Collaborator Draft
Fix/warnings stacklevel mvapich runner
#7949 opened Apr 2, 2026 by nathon-lee Contributor Draft
Refactor/torch autocast encapsulate global state
#7946 opened Apr 2, 2026 by nathon-lee Contributor Loading…
Add AutoEP
#7938 opened Mar 31, 2026 by tohtana Collaborator Loading…
Add torch_xla TPU support for ZeRO-1/2
#7917 opened Mar 21, 2026 by PKUWZP Collaborator Loading…
doc: Remove suggestion to build extensions in parallel
#7899 opened Mar 12, 2026 by Flamefire Contributor Loading…
Fix subgroup optimizer metadata inconsistency
#7820 opened Jan 27, 2026 by st-bang97 Contributor Draft
Fix bf16 dtype mismatch in ZeRO-3 with zero_quantized_weights
#7792 opened Jan 18, 2026 by juyterman1000 Contributor Loading…
ProTip! Follow long discussions with comments:>50.