-
Notifications
You must be signed in to change notification settings - Fork 391
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
beep boop 🤖: Bumping NeMo-RL to v0.6.1
CI
Relating to CI
#2549
opened May 22, 2026 by
nemo-automation-bot
Bot
Loading…
beep boop 🤖: Bumping NeMo-RL to v0.6.1
#2548
opened May 22, 2026 by
nemo-automation-bot
Bot
Loading…
beep boop 🤖: Bumping NeMo-RL to v0.6.1
#2547
opened May 22, 2026 by
nemo-automation-bot
Bot
Loading…
ci: pin FW-CI-templates to NVIDIA-NeMo/FW-CI-templates#480
CI:docs
Run doctest
CI
Relating to CI
#2546
opened May 22, 2026 by
ko3n1g
Contributor
Loading…
feat: make only_unmask_final configurable in SFT
#2543
opened May 22, 2026 by
ashors1
Contributor
Loading…
4 tasks
ci: switch sglang to prebuilt PyPI wheels (v0.5.11)
CI:L1
Run doctests, unit tests, and functional tests
[feat:] Add CISPO loss
community-request
Documentation
Improvements or additions to documentation
waiting-on-maintainers
Waiting on maintainers to respond
#2531
opened May 19, 2026 by
pengdurice
Contributor
Loading…
1 of 4 tasks
feat: PPO with MCore
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2530
opened May 19, 2026 by
bg51717
Contributor
Loading…
4 tasks done
feat: add AsyncNemoGymRolloutManager for gym per-prompt rollouts
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2528
opened May 19, 2026 by
yuki-97
Contributor
Loading…
1 task done
refactor(distillation): migrate DistillationConfig, DistillationSaveS…
CI:L1
Run doctests, unit tests, and functional tests
#2527
opened May 19, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(rm): migrate RMConfig, RMSaveState, RMValMetrics to BaseModel
CI:L1
Run doctests, unit tests, and functional tests
#2526
opened May 19, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(sft): migrate SFTConfig, SFTSaveState to BaseModel
CI:L1
Run doctests, unit tests, and functional tests
#2525
opened May 19, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(dpo): migrate DPOConfig, DPOSaveState, DPOValMetrics to Base…
CI:L1
Run doctests, unit tests, and functional tests
#2524
opened May 19, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(loss): migrate DPOLossConfig, DistillationLossConfig, DraftC…
CI:L1
Run doctests, unit tests, and functional tests
#2520
opened May 18, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
refactor(grpo): migrate TypedDict configs to pydantic BaseModel
CI:L1
Run doctests, unit tests, and functional tests
#2518
opened May 18, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
feat: fix the vLLM DP path
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#2517
opened May 18, 2026 by
guyueh1
Contributor
Loading…
4 tasks
feat(sft): make only_unmask_final configurable in SFTConfig
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2516
opened May 17, 2026 by
yuki-97
Contributor
Loading…
1 task done
fix: fix preserving dataset merge
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2515
opened May 17, 2026 by
yuki-97
Contributor
Loading…
1 task done
fix(nemo-gym): clamp max_new_tokens to prompt + output <= max_model_len
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2514
opened May 17, 2026 by
yuki-97
Contributor
Loading…
2 tasks done
fix(vllm): serialise AsyncMPClient input_socket sends to prevent zmq race
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
community-request
#2513
opened May 17, 2026 by
kaloyan-inherent
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.