After SFT, the model make repetitions

Hello, I tried to use the following two datasets:

- [OpenThoughts-114k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k)
- [OpenR1-Math220k](https://huggingface.co/datasets/open-r1/OpenR1-Math-220k)

After SFT on Qwen2.5-Math-7B, the model with `packing=true` had serious auto-regression, and the repetitions with `packing=false` was reduced. I have fixed [SFT model make repetitions during the inference phase](https://github.com/huggingface/open-r1/issues/492).

Any ideas?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

After SFT, the model make repetitions #520

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

After SFT, the model make repetitions #520

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions