-
Notifications
You must be signed in to change notification settings - Fork 407
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Enable tqdm configuration for vllm generation
community-request
#2677
opened Jun 3, 2026 by
louisfaury
Loading…
2 of 4 tasks
fix: lower DPO Qwen2.5-Math-7B accuracy threshold after torch 2.11 up…
CI:L1
Run doctests, unit tests, and functional tests
#2676
opened Jun 3, 2026 by
NolenLiang
Contributor
Loading…
4 tasks
chore: bump Relating to CI
_code_freeze workflow to v1.4.2
CI
#2675
opened Jun 3, 2026 by
ko3n1g
Contributor
Loading…
ci: Bump Megatron-Bridge to b236c13
CI:L1
Run doctests, unit tests, and functional tests
#2674
opened Jun 3, 2026 by
svcnvidia-nemo-ci
Loading…
fix(grpo): improve non-colocated refit handling
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
super-v3
#2672
opened Jun 2, 2026 by
macandro96
Contributor
Loading…
4 tasks
fix: resolve qwen3.5-35ba3b megatron ep16 OOM via TP=2 (#2619)
#2668
opened Jun 2, 2026 by
sharonyu-115
Contributor
Loading…
4 tasks
chore: bump transfomrers 5.5
CI:L1
Run doctests, unit tests, and functional tests
#2667
opened Jun 2, 2026 by
yuekaizhang
Contributor
Loading…
test(converters): add CLI entry-point coverage for all three converte…
community-request
#2666
opened Jun 2, 2026 by
SakethKoona
Loading…
3 of 4 tasks
fix(test): increase timeout for grpo-llama3.2-1b-1n4g nightly test
CI:docs
Run doctest
#2664
opened Jun 2, 2026 by
kajalj22
Contributor
Loading…
fix(grpo): penalize invalid tool call and malformed thinking
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
super-v3
#2656
opened Jun 1, 2026 by
macandro96
Contributor
Loading…
4 tasks
fix(nrl-k8s): remove SA impersonation from dev pod RBAC check
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2655
opened Jun 1, 2026 by
terrykong
Collaborator
Loading…
2 tasks done
refactor: refactor generation config
CI:L1
Run doctests, unit tests, and functional tests
Documentation
Improvements or additions to documentation
fix(grpo): release async replay-buffer reservations on batch consumption
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
super-v3
#2651
opened May 31, 2026 by
macandro96
Contributor
Loading…
4 tasks
initial commit of linear ce fusion for grpo
community-request
Documentation
Improvements or additions to documentation
#2647
opened May 30, 2026 by
pengdurice
Contributor
•
Draft
4 tasks done
feat(grpo): add Mistral 3.x recipes for GRPO (closes #2542)
community-request
#2645
opened May 30, 2026 by
RudimentaryChef
•
Draft
4 of 8 tasks
docs: add policy lifecycle guide for custom training loops
community-request
Documentation
Improvements or additions to documentation
waiting-on-maintainers
Waiting on maintainers to respond
#2644
opened May 30, 2026 by
lonexreb
Loading…
2 of 3 tasks
feat(data): support dotted import paths in dataset_name
community-request
Documentation
Improvements or additions to documentation
#2642
opened May 30, 2026 by
lonexreb
Loading…
3 of 5 tasks
chore: Remove unused converter type
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2640
opened May 30, 2026 by
MyviordDjaja
Loading…
1 task
fix(vllm): selectively port NeMo Gym/vLLM cherry-pick fixes
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
super-v3
#2639
opened May 30, 2026 by
macandro96
Contributor
Loading…
4 tasks
feat: add Claude skill for building new native RL environments
CI:docs
Run doctest
#2638
opened May 29, 2026 by
terrykong
Collaborator
Loading…
3 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.