Hello, I tried to use the following two datasets: - [OpenThoughts-114k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k) - [OpenR1-Math220k](https://huggingface.co/datasets/open-r1/OpenR1-Math-220k) After SFT on Qwen2.5-Math-7B, the model with `packing=true` had serious auto-regression, and the repetitions with `packing=false` was reduced. I have fixed [SFT model make repetitions during the inference phase](https://github.com/huggingface/open-r1/issues/492). Any ideas?