⏯️ Fix logging when resuming from checkpoint GRPO #3185

qgallouedec · 2025-03-31T03:29:13Z

First training, (orange) from 0 to 100 timestep
Second training (purple), resume from timestep 40: curves overlap.

…ngface/trl into fix_resume_from_checkpoint

qgallouedec · 2025-04-01T21:22:44Z

trl/trainer/grpo_trainer.py

@@ -481,7 +480,7 @@ def data_collator(features):  # No data collation is needed in GRPO
            # vLLM specific sampling arguments
            self.guided_decoding_regex = args.vllm_guided_decoding_regex

-            self._last_loaded_step = 0  # tag to avoid useless loading during grad accumulation
+            self._last_loaded_step = -1  # tag to avoid useless loading during grad accumulation


Make sure that for the very first generation, the model loaded is the same as that of the model running on the server.

HuggingFaceDocBuilderDev · 2025-04-01T21:25:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

fix resume and num_tokens

ef84deb

lewtun mentioned this pull request Mar 31, 2025

Cannot Resume Training From a Trained Checkpoint. Is this a bug ? huggingface/open-r1#553

Closed

Merge branch 'main' into fix_resume_from_checkpoint

569949e

qgallouedec marked this pull request as ready for review April 1, 2025 21:18

qgallouedec added 2 commits April 1, 2025 21:20

Empty commit to trigger CI

d8b5835

Merge branch 'fix_resume_from_checkpoint' of https://github.com/huggi…

fc72d67

…ngface/trl into fix_resume_from_checkpoint

qgallouedec commented Apr 1, 2025

View reviewed changes

qgallouedec changed the title ~~⏯️ Fix resuming from checkpoint GRPO~~ ⏯️ Fix logging when resuming from checkpoint GRPO Apr 1, 2025

qgallouedec requested review from kashif, edbeeching, lewtun and shirinyamani April 2, 2025 16:41

Merge branch 'main' into fix_resume_from_checkpoint

ca1068e

kashif approved these changes Apr 4, 2025

View reviewed changes

qgallouedec merged commit 65308cf into main Apr 5, 2025
8 of 10 checks passed

qgallouedec deleted the fix_resume_from_checkpoint branch April 5, 2025 04:51

yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025

⏯️ Fix logging when resuming from checkpoint GRPO (huggingface#3185)

4bb0032

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

⏯️ Fix logging when resuming from checkpoint GRPO #3185

⏯️ Fix logging when resuming from checkpoint GRPO #3185

Uh oh!

qgallouedec commented Mar 31, 2025

Uh oh!

qgallouedec Apr 1, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 1, 2025

Uh oh!

Uh oh!

Uh oh!

⏯️ Fix logging when resuming from checkpoint GRPO #3185

⏯️ Fix logging when resuming from checkpoint GRPO #3185

Uh oh!

Conversation

qgallouedec commented Mar 31, 2025

Uh oh!

qgallouedec Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 1, 2025

Uh oh!

Uh oh!

Uh oh!