Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: lower DPO Qwen2.5-Math-7B accuracy threshold after torch 2.11 up… CI:L1 Run doctests, unit tests, and functional tests
#2676 opened Jun 3, 2026 by NolenLiang Contributor Loading…
4 tasks
chore: bump _code_freeze workflow to v1.4.2 CI Relating to CI
#2675 opened Jun 3, 2026 by ko3n1g Contributor Loading…
ci: Bump Megatron-Bridge to b236c13 CI:L1 Run doctests, unit tests, and functional tests
#2674 opened Jun 3, 2026 by svcnvidia-nemo-ci Loading…
Add nano_v3 vLLM reasoning parser plugin
#2673 opened Jun 2, 2026 by dpickem Loading…
4 tasks
fix(grpo): improve non-colocated refit handling CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) super-v3
#2672 opened Jun 2, 2026 by macandro96 Contributor Loading…
4 tasks
feat: Add script to re-initialize near-zero HF embeddings
#2671 opened Jun 2, 2026 by ashors1 Contributor Draft
4 tasks
fix: Fix fp8 memory fragmentation
#2670 opened Jun 2, 2026 by ashors1 Contributor Draft
4 tasks
fix: resolve qwen3.5-35ba3b megatron ep16 OOM via TP=2 (#2619)
#2668 opened Jun 2, 2026 by sharonyu-115 Contributor Loading…
4 tasks
chore: bump transfomrers 5.5 CI:L1 Run doctests, unit tests, and functional tests
#2667 opened Jun 2, 2026 by yuekaizhang Contributor Loading…
fix(test): increase timeout for grpo-llama3.2-1b-1n4g nightly test CI:docs Run doctest
#2664 opened Jun 2, 2026 by kajalj22 Contributor Loading…
fix(security): bump deps for CVE remediation (June 2026)
#2663 opened Jun 2, 2026 by kajalj22 Contributor Draft
4 tasks
fix(grpo): penalize invalid tool call and malformed thinking CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) super-v3
#2656 opened Jun 1, 2026 by macandro96 Contributor Loading…
4 tasks
fix(nrl-k8s): remove SA impersonation from dev pod RBAC check CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2655 opened Jun 1, 2026 by terrykong Collaborator Loading…
2 tasks done
refactor: refactor generation config CI:L1 Run doctests, unit tests, and functional tests Documentation Improvements or additions to documentation
#2653 opened Jun 1, 2026 by yuki-97 Contributor Draft
fix(grpo): release async replay-buffer reservations on batch consumption CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) super-v3
#2651 opened May 31, 2026 by macandro96 Contributor Loading…
4 tasks
initial commit of linear ce fusion for grpo community-request Documentation Improvements or additions to documentation
#2647 opened May 30, 2026 by pengdurice Contributor Draft
4 tasks done
docs: add policy lifecycle guide for custom training loops community-request Documentation Improvements or additions to documentation waiting-on-maintainers Waiting on maintainers to respond
#2644 opened May 30, 2026 by lonexreb Loading…
2 of 3 tasks
feat(data): support dotted import paths in dataset_name community-request Documentation Improvements or additions to documentation
#2642 opened May 30, 2026 by lonexreb Loading…
3 of 5 tasks
chore: Remove unused converter type community-request waiting-on-maintainers Waiting on maintainers to respond
#2640 opened May 30, 2026 by MyviordDjaja Loading…
1 task
fix(vllm): selectively port NeMo Gym/vLLM cherry-pick fixes CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) super-v3
#2639 opened May 30, 2026 by macandro96 Contributor Loading…
4 tasks
feat: add Claude skill for building new native RL environments CI:docs Run doctest
#2638 opened May 29, 2026 by terrykong Collaborator Loading…
3 tasks
fix: fix fp8_params
#2633 opened May 29, 2026 by ashors1 Contributor Loading…
4 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.