-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Comparing changes
Open a pull request
base repository: huggingface/trl
base: v0.20.0
head repository: huggingface/trl
compare: v0.21.0
- 18 commits
- 63 files changed
- 15 contributors
Commits on Jul 29, 2025
-
Configuration menu - View commit details
-
Copy full SHA for eb5d0fe - Browse repository at this point
Copy the full SHA eb5d0feView commit details -
Fix broken PEFT+TRL docs link in
using_llama_models.md
(#3794)Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 9269f9f - Browse repository at this point
Copy the full SHA 9269f9fView commit details
Commits on Jul 30, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 25ce0f3 - Browse repository at this point
Copy the full SHA 25ce0f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 72bbc6d - Browse repository at this point
Copy the full SHA 72bbc6dView commit details -
Add vLLM transformers backend to online methods (#3773)
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: sergiopaniego <sergiopaniegoblanco@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 90c7876 - Browse repository at this point
Copy the full SHA 90c7876View commit details -
Correction parameter description (#3803)
Co-authored-by: lunzhongwang <lunzhongwang@soulapp.cn> Co-authored-by: LeonEricsson <70749762+LeonEricsson@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9a1e6a4 - Browse repository at this point
Copy the full SHA 9a1e6a4View commit details
Commits on Jul 31, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 3ae60cd - Browse repository at this point
Copy the full SHA 3ae60cdView commit details -
add xpu support for mergekit (#3800)
Signed-off-by: Yao, Matrix <matrix.yao@intel.com>
Configuration menu - View commit details
-
Copy full SHA for ab24000 - Browse repository at this point
Copy the full SHA ab24000View commit details -
GSPO parameters update from v2 (#3798)
Co-authored-by: LeonEricsson <70749762+LeonEricsson@users.noreply.github.com> Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 79c5797 - Browse repository at this point
Copy the full SHA 79c5797View commit details -
Configuration menu - View commit details
-
Copy full SHA for 294e8cb - Browse repository at this point
Copy the full SHA 294e8cbView commit details -
fix CI docs and grpo slow test (#3814)
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for dbbc770 - Browse repository at this point
Copy the full SHA dbbc770View commit details
Commits on Aug 1, 2025
-
Performance optimization: Replace list comprehensions with tensor ope…
…rations in BCO and KTO trainers (#3813) Co-authored-by: chiliu <chiliu@paypal.com>
Configuration menu - View commit details
-
Copy full SHA for ead5aaf - Browse repository at this point
Copy the full SHA ead5aafView commit details -
Configuration menu - View commit details
-
Copy full SHA for 072d7dd - Browse repository at this point
Copy the full SHA 072d7ddView commit details
Commits on Aug 4, 2025
-
Add 'Post training a VLM for reasoning with GRPO using TRL' recipe to…
… Community tutorials (#3843)
Configuration menu - View commit details
-
Copy full SHA for 6776376 - Browse repository at this point
Copy the full SHA 6776376View commit details
Commits on Aug 5, 2025
-
[GRPO]: Fix Entropy Mask Threshold Calculation when using Multi-GPU t…
…raining (#3833) Co-authored-by: LeonEricsson <70749762+LeonEricsson@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 5d914a4 - Browse repository at this point
Copy the full SHA 5d914a4View commit details -
Co-authored-by: LeonEricsson <70749762+LeonEricsson@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 21060b2 - Browse repository at this point
Copy the full SHA 21060b2View commit details -
🌺 OpenAI GPT OSS & Harmony support (#3848)
Co-authored-by: Shirin Yamani <75791599+shirinyamani@users.noreply.github.com> Co-authored-by: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 17393b8 - Browse repository at this point
Copy the full SHA 17393b8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 46d09bd - Browse repository at this point
Copy the full SHA 46d09bdView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v0.20.0...v0.21.0