Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: allow zero grad norm in dtensor policies for consistency with Megatron CI:L1 Run doctests, unit tests, and functional tests
#1618 opened Dec 9, 2025 by smahdavi4 Loading…
[Draft] Add dapo recipe and test
#1617 opened Dec 9, 2025 by ZhiyuLi-Nvidia Loading…
4 tasks
📝 Add docstrings to feat/train-yaml-add-swanlab
#1616 opened Dec 9, 2025 by coderabbitai bot Loading…
fix: swanlab logger error caused by define_metric CI:L1 Run doctests, unit tests, and functional tests community-request
#1615 opened Dec 9, 2025 by Zeyi-Lin Loading…
1 of 4 tasks
feat: Support for Ray spinup within Gym
#1613 opened Dec 9, 2025 by pjin-nvidia Draft
4 tasks
Remove policy offload for async grpo dtensor
#1608 opened Dec 7, 2025 by smahdavi4 Loading…
train on transitions
#1606 opened Dec 6, 2025 by cmunley1 Draft
4 tasks
feat: add support from building images using vllm from private repos CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1605 opened Dec 6, 2025 by terrykong Loading…
4 tasks
Megatron refactor POC
#1592 opened Dec 2, 2025 by ashors1 Draft
4 tasks
docs: get started section documentation Improvements or additions to documentation
#1582 opened Dec 1, 2025 by lbliii Loading…
feat: add SGLang rollout backend, part1 community-request
#1580 opened Nov 30, 2025 by PrinsYin Loading…
4 tasks
fix: Fix Fp8 sequence padding for PP>1 case CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1579 opened Nov 29, 2025 by guyueh1 Loading…
4 tasks
feat: Support top-p and top-k CI:L1 Run doctests, unit tests, and functional tests
#1578 opened Nov 27, 2025 by zhandaz Loading…
3 of 4 tasks
feat: genrm rlhf
#1576 opened Nov 27, 2025 by yfw Draft
4 tasks
chore: Bump vllm to 0.11.2, torch to 2.9, transformers to 4.57.1 CI:L1 Run doctests, unit tests, and functional tests CI Relating to CI
#1563 opened Nov 24, 2025 by yfw Loading…
4 tasks
feat: LoRA SFT support for DTensorV2 path CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1556 opened Nov 21, 2025 by samodi-nv Loading…
2 tasks done
fix: remove sft-qwen2.5-fsdp2tp8sp from nighlies CI:L0 Run doctests and unit tests
#1555 opened Nov 20, 2025 by ahmadki Loading…
ProTip! Filter pull requests by the default branch with base:main.