Skip to content

Conversation

kashif
Copy link
Collaborator

@kashif kashif commented Sep 15, 2024

What does this PR do?

recommendation for the KTO learning rate to be small. See: https://x.com/ethayarajh/status/1834780782126153763

@kashif kashif requested a review from lewtun September 15, 2024 10:31
@qgallouedec qgallouedec added the 🏋 KTO Related to KTO label Sep 15, 2024
@qgallouedec
Copy link
Member

What do you think about overriding the default lr of TrainingArgument in this case? It seems to be more convenient for the user.

cc @kawine @corbt

@dataclass
class KTOConfig(TrainingArguments):
    ...
    learning_rate: float = 5e-7

@kawine
Copy link
Contributor

kawine commented Sep 16, 2024

thanks for taking the initiative @qgallouedec ! I just had one suggestion re: the docs.

@kashif
Copy link
Collaborator Author

kashif commented Sep 16, 2024

@qgallouedec done

@qgallouedec qgallouedec changed the title [KT0] learning rate recomentations for kto [KTO] learning rate recomentations for kto Sep 16, 2024
@qgallouedec
Copy link
Member

Thanks, for the record, we should extend this to all our configs, when necessary

@qgallouedec qgallouedec merged commit cdafc93 into main Sep 16, 2024
10 checks passed
@qgallouedec qgallouedec deleted the kto-learning-rate branch September 16, 2024 10:24
yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025
)

* learning rate recomentations for kto

* update from suggestion

* override default lr

* add tip tag

* Update trl/trainer/kto_config.py

---------

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🏋 KTO Related to KTO
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants