-
Notifications
You must be signed in to change notification settings - Fork 181
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: Set validation accuracy to mean of rewards to handle non-[0,1] r…
#1619
opened Dec 9, 2025 by
alexandery-nvidia
Loading…
4 tasks
fix: allow zero grad norm in dtensor policies for consistency with Megatron
CI:L1
Run doctests, unit tests, and functional tests
#1618
opened Dec 9, 2025 by
smahdavi4
Loading…
📝 Add docstrings to
feat/train-yaml-add-swanlab
#1616
opened Dec 9, 2025 by
coderabbitai
bot
Loading…
fix: swanlab logger error caused by Run doctests, unit tests, and functional tests
community-request
define_metric
CI:L1
#1615
opened Dec 9, 2025 by
Zeyi-Lin
Loading…
1 of 4 tasks
fix: Handle disabled validation in SFT training
community-request
#1611
opened Dec 8, 2025 by
sahgerlad
Loading…
fix: Support datasets saved with save_to_disk in ResponseDataset
community-request
#1610
opened Dec 8, 2025 by
sahgerlad
Loading…
feat: add support from building images using vllm from private repos
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
#1605
opened Dec 6, 2025 by
terrykong
Loading…
4 tasks
feat: refactor dtensor v2 policy __init__ and introduce core types
#1588
opened Dec 2, 2025 by
hemildesai
•
Draft
4 tasks
docs: get started section
documentation
Improvements or additions to documentation
#1582
opened Dec 1, 2025 by
lbliii
Loading…
feat: add SGLang rollout backend, part1
community-request
#1580
opened Nov 30, 2025 by
PrinsYin
Loading…
4 tasks
fix: Fix Fp8 sequence padding for PP>1 case
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1579
opened Nov 29, 2025 by
guyueh1
Loading…
4 tasks
feat: Support top-p and top-k
CI:L1
Run doctests, unit tests, and functional tests
#1578
opened Nov 27, 2025 by
zhandaz
Loading…
3 of 4 tasks
chore: update megatron dev (11/21/2025) / mbridge (11/28/2025)
#1568
opened Nov 25, 2025 by
yaoyu-33
Loading…
4 tasks
chore: Bump vllm to 0.11.2, torch to 2.9, transformers to 4.57.1
CI:L1
Run doctests, unit tests, and functional tests
CI
Relating to CI
#1563
opened Nov 24, 2025 by
yfw
Loading…
4 tasks
fix: fix Dtensor sharding error when bump up pytorch version
#1557
opened Nov 21, 2025 by
ZhiyuLi-Nvidia
Loading…
4 tasks
feat: LoRA SFT support for DTensorV2 path
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
#1556
opened Nov 21, 2025 by
samodi-nv
Loading…
2 tasks done
fix: remove sft-qwen2.5-fsdp2tp8sp from nighlies
CI:L0
Run doctests and unit tests
#1555
opened Nov 20, 2025 by
ahmadki
Loading…
feat: refactor dtensor policy v2 into core modular functions
#1542
opened Nov 19, 2025 by
hemildesai
•
Draft
4 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.