Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

beep boop 🤖: Bumping NeMo-RL to v0.6.1 CI Relating to CI
#2549 opened May 22, 2026 by nemo-automation-bot Bot Loading…
beep boop 🤖: Bumping NeMo-RL to v0.6.1
#2548 opened May 22, 2026 by nemo-automation-bot Bot Loading…
beep boop 🤖: Bumping NeMo-RL to v0.6.1
#2547 opened May 22, 2026 by nemo-automation-bot Bot Loading…
ci: pin FW-CI-templates to NVIDIA-NeMo/FW-CI-templates#480 CI:docs Run doctest CI Relating to CI
#2546 opened May 22, 2026 by ko3n1g Contributor Loading…
fix: set TIS type for Automodel GRPO release recipes
#2544 opened May 22, 2026 by zpqiu Contributor Draft
4 tasks done
feat: make only_unmask_final configurable in SFT
#2543 opened May 22, 2026 by ashors1 Contributor Loading…
4 tasks
feat: SDPO community-request
#2538 opened May 21, 2026 by bogdansalyp Contributor Draft
4 tasks
ci: switch sglang to prebuilt PyPI wheels (v0.5.11) CI:L1 Run doctests, unit tests, and functional tests
#2535 opened May 20, 2026 by kajalj22 Contributor Draft
4 tasks
[feat:] Add CISPO loss community-request Documentation Improvements or additions to documentation waiting-on-maintainers Waiting on maintainers to respond
#2531 opened May 19, 2026 by pengdurice Contributor Loading…
1 of 4 tasks
feat: PPO with MCore CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2530 opened May 19, 2026 by bg51717 Contributor Loading…
4 tasks done
feat: add AsyncNemoGymRolloutManager for gym per-prompt rollouts CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2528 opened May 19, 2026 by yuki-97 Contributor Loading…
1 task done
refactor(distillation): migrate DistillationConfig, DistillationSaveS… CI:L1 Run doctests, unit tests, and functional tests
#2527 opened May 19, 2026 by NolenLiang Contributor Loading…
4 tasks
refactor(rm): migrate RMConfig, RMSaveState, RMValMetrics to BaseModel CI:L1 Run doctests, unit tests, and functional tests
#2526 opened May 19, 2026 by NolenLiang Contributor Loading…
4 tasks
refactor(sft): migrate SFTConfig, SFTSaveState to BaseModel CI:L1 Run doctests, unit tests, and functional tests
#2525 opened May 19, 2026 by NolenLiang Contributor Loading…
4 tasks
refactor(dpo): migrate DPOConfig, DPOSaveState, DPOValMetrics to Base… CI:L1 Run doctests, unit tests, and functional tests
#2524 opened May 19, 2026 by NolenLiang Contributor Loading…
4 tasks
ci: build container on Azure for main builds CI Relating to CI
#2521 opened May 18, 2026 by kajalj22 Contributor Draft
1 of 4 tasks
refactor(loss): migrate DPOLossConfig, DistillationLossConfig, DraftC… CI:L1 Run doctests, unit tests, and functional tests
#2520 opened May 18, 2026 by NolenLiang Contributor Loading…
4 tasks
refactor(grpo): migrate TypedDict configs to pydantic BaseModel CI:L1 Run doctests, unit tests, and functional tests
#2518 opened May 18, 2026 by NolenLiang Contributor Loading…
4 tasks
feat: fix the vLLM DP path CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#2517 opened May 18, 2026 by guyueh1 Contributor Loading…
4 tasks
feat(sft): make only_unmask_final configurable in SFTConfig CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2516 opened May 17, 2026 by yuki-97 Contributor Loading…
1 task done
fix: fix preserving dataset merge CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2515 opened May 17, 2026 by yuki-97 Contributor Loading…
1 task done
fix(nemo-gym): clamp max_new_tokens to prompt + output <= max_model_len CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2514 opened May 17, 2026 by yuki-97 Contributor Loading…
2 tasks done
fix(vllm): serialise AsyncMPClient input_socket sends to prevent zmq race CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) community-request
#2513 opened May 17, 2026 by kaloyan-inherent Loading…
ProTip! Add no:assignee to see everything that’s not assigned.