🎀 New defaults: preparing the new structure #3530

qgallouedec · 2025-06-03T17:08:20Z

A preparatory PR to cleanly overwrite the default values of the training argument.

…ingface/trl into gradient_checkpointing_true

…iles

Copilot

Pull Request Overview

A preparatory PR to standardize and document overridden default training arguments across multiple trainer config classes.

Unifies class docstrings to reference full TrainingArguments docs and note potential default differences.
Introduces or updates a “# Parameters whose default values are overridden from TrainingArguments” section in each config.
Removes duplicate learning_rate definitions and centralizes metadata for overridden defaults.

Reviewed Changes

Copilot reviewed 15 out of 15 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
trl/trainer/utils.py	Added class doc updates and override‐defaults placeholder
trl/trainer/sft_config.py	Unified docstring, moved `learning_rate` override section
trl/trainer/rloo_config.py	Updated docstring to reference base classes and defaults note
trl/trainer/reward_config.py	Added doc intro and override‐defaults placeholder
trl/trainer/prm_config.py	Unified docstring, migrated `learning_rate` override section
trl/trainer/ppo_config.py	Updated docstring to mention `OnPolicyConfig` and defaults note
trl/trainer/orpo_config.py	Unified docstring, updated override‐defaults section
trl/trainer/online_dpo_config.py	Updated doc intro and override‐defaults section
trl/trainer/kto_config.py	Unified docstring, added override‐defaults placeholder
trl/trainer/iterative_sft_config.py	Updated docstring and added override‐defaults placeholder
trl/trainer/grpo_config.py	Migrated `learning_rate` override section and cleaned duplicates
trl/trainer/gkd_config.py	Updated doc intro (missing override‐defaults section)
trl/trainer/dpo_config.py	Introduced override‐defaults section, removed duplicate field
trl/trainer/cpo_config.py	Unified docstring and override‐defaults section
trl/trainer/bco_config.py	Added doc intro and override‐defaults placeholder

Comments suppressed due to low confidence (1)

trl/trainer/gkd_config.py:26

This class omits the '# Parameters whose default values are overridden from TrainingArguments' section present in other configs. For consistency, either add the header (even if empty) or remove all such placeholders across classes.

class GKDConfig(SFTConfig):

Copilot · 2025-06-03T17:19:10Z

trl/trainer/utils.py

@@ -1063,6 +1067,9 @@ class OnPolicyConfig(TrainingArguments):
            Whether to push the model to the Hub after training.
    """

+    # Parameters whose default values are overridden from TrainingArguments
+    # None yet


[nitpick] The placeholder comment '# None yet' is ambiguous. Consider either removing it when no defaults are overridden or replacing it with a clearer note like 'No default overrides currently'.

Can you make a suggestion so that I can just commit?

I guess I'll have to take care of it myself 😢

HuggingFaceDocBuilderDev · 2025-06-03T17:20:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…tatus

shirinyamani

I see this PR is setting the new LR, and for all the chaged trainers are almost identical to 1e-6 except for sft which is 2e-5, curious to know why its lower in sft case?

qgallouedec · 2025-06-03T17:34:01Z

I see this PR is setting the new LR, and for all the chaged trainers are almost identical to 1e-6 except for sft which is 2e-5, curious to know why its lower in sft case?

On average, these values should work best. In any case, these are more standard values.
I mention it just to avoid confusion: this PR is not changing any default value

qgallouedec and others added 16 commits May 29, 2025 16:19

new default in BCO

d788dbd

no dev?

f1c5e2a

no dev branch

7d090c2

precommits

95f0456

valid branch

a6e1c4b

wo deepspeed?

2c4dcdc

revert

0fe99f0

Merge branch 'main' into gradient_checkpointing_true

5ebcb6a

no vllm?

10a73fa

Merge branch 'gradient_checkpointing_true' of https://github.com/hugg…

fbdbdd4

…ingface/trl into gradient_checkpointing_true

Merge branch 'main' into gradient_checkpointing_true

1021cf2

doc

6c1c8de

reorganize a bit

ac6fd26

gradient_checkpointing everywhere

ce5e67f

revert dep

09b3e7f

Merge branch 'main' into gradient_checkpointing_true

b7bbbe9

qgallouedec marked this pull request as ready for review June 3, 2025 17:15

Remove gradient checkpointing parameter from multiple configuration f…

8766c1c

…iles

qgallouedec requested review from kashif, edbeeching, lewtun, shirinyamani and Copilot June 3, 2025 17:18

Copilot AI reviewed Jun 3, 2025

View reviewed changes

Update comments in configuration files to clarify default overrides s…

1bb23b7

…tatus

shirinyamani approved these changes Jun 3, 2025

View reviewed changes

qgallouedec merged commit a912ad1 into main Jun 3, 2025
11 checks passed

qgallouedec deleted the new-defaults branch June 3, 2025 17:48

qgallouedec mentioned this pull request Jun 4, 2025

🎀 [SFT][Bugfix] sets average_tokens_across_devices to true in SFTConfig #3538

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🎀 New defaults: preparing the new structure #3530

🎀 New defaults: preparing the new structure #3530

Uh oh!

qgallouedec commented Jun 3, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jun 3, 2025

Uh oh!

qgallouedec Jun 3, 2025

Uh oh!

qgallouedec Jun 3, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 3, 2025

Uh oh!

shirinyamani left a comment

Uh oh!

qgallouedec commented Jun 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

🎀 New defaults: preparing the new structure #3530

🎀 New defaults: preparing the new structure #3530

Uh oh!

Conversation

qgallouedec commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 3, 2025

Uh oh!

shirinyamani left a comment

Choose a reason for hiding this comment

Uh oh!

qgallouedec commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Jun 3, 2025 •

edited

Loading

qgallouedec commented Jun 3, 2025 •

edited

Loading