Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(profiling): log ProfilingContext metrics to Trackio backend
#5979 opened Jun 8, 2026 by Anai-Guo Loading…
3 of 6 tasks
Fix broken code examples in docs (RLOO syntax, SFTConfig max_length)
#5970 opened Jun 7, 2026 by DaoyuanLi2816 Contributor Loading…
3 of 8 tasks
[GOLD] VLM support for GOLDTrainer
#5969 opened Jun 7, 2026 by Strongich Loading…
4 of 8 tasks
Secure trl vllm-serve server defaults
#5965 opened Jun 7, 2026 by LaPhilosophie Loading…
4 of 8 tasks
Fix reward function name fallback for callable objects
#5955 opened Jun 6, 2026 by msdsm Loading…
5 of 8 tasks
Align KTO with DPO: Support VLM
#5939 opened Jun 4, 2026 by albertvillanova Member Loading…
Delta weight sync for AsyncGRPO (sparse patches over HF Bucket)
#5937 opened Jun 4, 2026 by AmineDiro Member Loading…
1 task done
chore: update pr_style_bot.yml
#5921 opened Jun 2, 2026 by hf-security-analysis Bot Loading…
Add AMD/ROCm CI
#5918 opened Jun 2, 2026 by kashif Collaborator Loading…
Fix prepare_deepspeed crash with cpu offload optimizer
#5916 opened Jun 2, 2026 by roycho96 Contributor Loading…
2 of 8 tasks
AsyncGRPOTrainer: add PEFT/LoRA support
#5896 opened May 31, 2026 by rycerzes Contributor Loading…
5 of 8 tasks
AsyncGRPOTrainer: add ProcessorMixin handling
#5895 opened May 31, 2026 by rycerzes Contributor Loading…
5 of 8 tasks
AsyncGRPOTrainer: add sampling parameters (top_p, top_k, min_p, repetition_penalty)
#5894 opened May 31, 2026 by rycerzes Contributor Loading…
5 of 8 tasks
AsyncGRPOTrainer: add model_init_kwargs support
#5893 opened May 31, 2026 by rycerzes Contributor Loading…
5 of 8 tasks
async grpo native weight sync with vllm>=0.22.0
#5892 opened May 30, 2026 by AmineDiro Member Loading…
Simplify reference model handling in GRPO/RLOO
#5877 opened May 29, 2026 by albertvillanova Member Loading…
ProTip! Filter pull requests by the default branch with base:main.