Add GSPO script examples (VLM/LLM) #3810

sergiopaniego · 2025-07-30T09:33:05Z

What does this PR do?

Adding training scripts for GSPO (VLM/LLM).
I'll train 2 models with them with a subset of the dataset to verify but they're quite similar to grpo and grpo_vlm.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2025-07-30T09:38:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

examples/scripts/gspo.py

examples/scripts/gspo_vlm.py

qgallouedec · 2025-07-30T09:55:25Z

Super cool and useful! Some comments about the hyperparam

sergiopaniego added 2 commits July 30, 2025 11:30

Add GSPO script examples (VLM/LLM)

3cfb238

Typo fixed

0dd9b76

qgallouedec reviewed Jul 30, 2025

View reviewed changes

examples/scripts/gspo.py Outdated Show resolved Hide resolved

examples/scripts/gspo_vlm.py Outdated Show resolved Hide resolved

sergiopaniego added 5 commits July 30, 2025 12:21

Correct model

d7ec454

Merge branch 'main' of github.com:huggingface/trl into gspo-scripts

2067b62

Reorder examples

66ad107

Reorder examples

9c62541

Update params

ab00636

kashif approved these changes Jul 30, 2025

View reviewed changes

qgallouedec merged commit 3ae60cd into main Jul 31, 2025
11 checks passed

qgallouedec deleted the gspo-scripts branch July 31, 2025 02:07

LuisVasquezBSC pushed a commit to langtech-bsc/trl that referenced this pull request Aug 28, 2025

Add GSPO script examples (VLM/LLM) (huggingface#3810)

d95b967

LuisVasquezBSC pushed a commit to langtech-bsc/trl that referenced this pull request Aug 28, 2025

Add GSPO script examples (VLM/LLM) (huggingface#3810)

01e24b8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add GSPO script examples (VLM/LLM) #3810

Add GSPO script examples (VLM/LLM) #3810

Uh oh!

sergiopaniego commented Jul 30, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

Add GSPO script examples (VLM/LLM) #3810

Add GSPO script examples (VLM/LLM) #3810

Uh oh!

Conversation

sergiopaniego commented Jul 30, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!