Skip to content

Pull requests: foundation-model-stack/fms-fsdp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Various dataloader updates and fixes
#144 opened May 23, 2025 by daviswer Collaborator Loading…
Add zloss and partial weight decay
#136 opened Apr 15, 2025 by daviswer Collaborator Loading…
Add support for Mamba-MoE
#129 opened Jan 23, 2025 by lchu6 Contributor Loading…
Add support for FIM training
#125 opened Jan 10, 2025 by daviswer Collaborator Loading…
Suppress spammy warnings
#120 opened Oct 10, 2024 by daviswer Collaborator Loading…
Remove embed_* variants of model architectures in speculator training
#115 opened Sep 10, 2024 by sahilsuneja1 Collaborator Loading…
Minimal implementation of muP scaling for Llama
#98 opened Jul 22, 2024 by daviswer Collaborator Loading…
ProTip! Exclude everything labeled bug with -label:bug.