-
Notifications
You must be signed in to change notification settings - Fork 2.1k
✋ Prevent applying the chat template to tokenized datasets #2939
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@DanFosing which version of TRL are you using? |
I experienced this issue with both v0.15.1 and with the alpha version downloaded using: |
Oh and I forgot to mention, max_seq_length didn't seem to work for me for some reason, the warning says it will be deprecated in v.0.20.0 but are you sure it wasn't deprecated already? (that's why I added a comment there in the code, but it's not related to the main fix) |
@DanFosing ok so kindly remove the |
we can fix the warning and say: removed in version |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
|
i don't think there is a bug, but please correct me @DanFosing if I am mistaken, the issue is that it's doing this extra work when it's not needed. |
WDYM "didn't seem to work"? Same question, is an exception raised? if so what's the traceback? |
for clarification, the only extra work done is iterating through the dataset: Lines 218 to 221 in 5c05913
which is usually very fast |
That being said, I'm ok to add the |
@bot /style |
Style fixes have been applied. View the workflow run here. |
* Update sft_config.py * Update sft_trainer.py * Update sft_config.py * Update sft_trainer.py * Apply style fixes --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
…ce#2939) * Update sft_config.py * Update sft_trainer.py * Update sft_config.py * Update sft_trainer.py * Apply style fixes --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
…ce#2939) * Update sft_config.py * Update sft_trainer.py * Update sft_config.py * Update sft_trainer.py * Apply style fixes --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Fixes: