Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add prompt-learning guard for PEFT with Liger in GRPO
#6186 opened Jun 26, 2026 by albertvillanova Member Loading…
Align KTO collator keys with the DPO convention
#6182 opened Jun 25, 2026 by qgallouedec Member Loading…
Move _get_kl_completion_ids into _get_kl_dataset
#6181 opened Jun 25, 2026 by qgallouedec Member Loading…
Sort conditional imports
#6180 opened Jun 25, 2026 by qgallouedec Member Loading…
Remove RUNNING_NAME from KTO
#6179 opened Jun 25, 2026 by qgallouedec Member Loading…
Align data collators across DPO / SFT / Reward / KTO
#6178 opened Jun 25, 2026 by qgallouedec Member Loading…
Promote KTO to stable API
#6175 opened Jun 25, 2026 by albertvillanova Member Loading…
Guard compute_mfu against a zero denominator (world_size == 0)
#6174 opened Jun 25, 2026 by CharlesCNorton Loading…
2 of 8 tasks
Add dataset mixture fractions
#6171 opened Jun 24, 2026 by nZiben Draft
5 of 8 tasks
Support PEFT with Liger in DPO
#6159 opened Jun 24, 2026 by albertvillanova Member Loading…
Remove redundant get_kbit_device_map()
#6158 opened Jun 24, 2026 by qgallouedec Member Loading…
Fix chunked_nll patch hiding VLM kwargs from generate
#6156 opened Jun 23, 2026 by Strongich Loading…
3 of 7 tasks
SFT: Truncate during dataset preparation, not collation
#6155 opened Jun 23, 2026 by qgallouedec Member Loading…
fix: don't re-flatten vLLM server completion_ids in Online DPO
#6146 opened Jun 23, 2026 by vineethsaivs Loading…
4 of 8 tasks
test: add FSDP2 distributed coverage for AsyncGRPOTrainer
#6144 opened Jun 23, 2026 by behroozazarkhalili Collaborator Loading…
Add entropy regularization to GRPO
#6140 opened Jun 22, 2026 by albertvillanova Member Loading…
[codex] Fix OpenEnv example remote compatibility
#6138 opened Jun 22, 2026 by burtenshaw Collaborator Draft
ProTip! Updated in the last three days: updated:>2026-06-23.