Issue #1751 Fix #1754

yash-srivastava19 · 2024-06-18T11:58:52Z

#1751 mentioned that the TRL CLI is not completely capturing the torch_dtype. I thought the issue was urgent, so quickly patched a hacky fix, which at least initiates the SFT Trainer.

Original Issue :
On running the following command :

trl sft --model_name_or_path=facebook/opt-125m --dataset_name=imdb  --dataset_text_field=text --max_steps=1 --torch_dtype=bfloat16 --output_dir=./test

The error was that trl sft is does not identify it as a string when calling getattr(torch, model_init_kwargs["torch_dtype"]).

The fix was made which allows the to not break the pipeline at this stage. Although it is a hacky fix, I'm willing to work on it further :)

The error after that is from the transformers library that isn't able to serialize the dtype object(screenshot attached):

...
...
TypeError: Object of type dtype is not JSON serializable
Traceback (most recent call last):

alvarobartt · 2024-06-18T12:20:36Z

Hi @yash-srivastava19 thanks for this PR, but this is not how we should fix that since ideally we should catch that either by checking that the received type is a torch.dtype or just ensuring that the str provided as torch_dtype via the CLI is not transformed to a torch.dtype before instantiating the SFTTrainer for example.

So a more suitable fix should be the following:

model_init_kwargs["torch_dtype"] = (
  model_init_kwargs["torch_dtype"]
  if model_init_kwargs["torch_dtype"] in ["auto", None]
  or isinstance(model_init_kwargs["torch_dtype"], torch.dtype)
  else getattr(torch, model_init_kwargs["torch_dtype"])
)

Anyway, I'll let the authors chime in with their thoughts and ideas about a potential fix! Thanks anyway 🤗

younesbelkada

Thanks a lot for this ! I second what @alvarobartt said above, we can change this fix to something like:

diff --git a/trl/trainer/sft_trainer.py b/trl/trainer/sft_trainer.py
index e739b2d..80e11ad 100644
--- a/trl/trainer/sft_trainer.py
+++ b/trl/trainer/sft_trainer.py
@@ -159,11 +159,13 @@ class SFTTrainer(Trainer):
             raise ValueError("You passed model_init_kwargs to the SFTConfig, but your model is already instantiated.")
         else:
             model_init_kwargs = args.model_init_kwargs
-            model_init_kwargs["torch_dtype"] = (
-                model_init_kwargs["torch_dtype"]
-                if model_init_kwargs["torch_dtype"] in ["auto", None]
-                else getattr(torch, model_init_kwargs["torch_dtype"])
-            )
+            torch_dtype = model_init_kwargs["torch_dtype"]
+
+            # Convert to `torch.dtype` if an str is passed
+            if isinstance(torch_dtype, str) and torch_dtype != "auto":
+                torch_dtype = getattr(torch, torch_dtype)
+
+            model_init_kwargs["torch_dtype"] = torch_dtype

         if infinite is not None:
             warnings.warn(

And it worked fine on my end! Would you be happy to apply these changes instead in this PR?

HuggingFaceDocBuilderDev · 2024-06-18T19:34:51Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yash-srivastava19 · 2024-06-19T17:23:41Z

Thanks a lot for this ! I second what @alvarobartt said above, we can change this fix to something like:

diff --git a/trl/trainer/sft_trainer.py b/trl/trainer/sft_trainer.py
index e739b2d..80e11ad 100644
--- a/trl/trainer/sft_trainer.py
+++ b/trl/trainer/sft_trainer.py
@@ -159,11 +159,13 @@ class SFTTrainer(Trainer):
             raise ValueError("You passed model_init_kwargs to the SFTConfig, but your model is already instantiated.")
         else:
             model_init_kwargs = args.model_init_kwargs
-            model_init_kwargs["torch_dtype"] = (
-                model_init_kwargs["torch_dtype"]
-                if model_init_kwargs["torch_dtype"] in ["auto", None]
-                else getattr(torch, model_init_kwargs["torch_dtype"])
-            )
+            torch_dtype = model_init_kwargs["torch_dtype"]
+
+            # Convert to `torch.dtype` if an str is passed
+            if isinstance(torch_dtype, str) and torch_dtype != "auto":
+                torch_dtype = getattr(torch, torch_dtype)
+
+            model_init_kwargs["torch_dtype"] = torch_dtype

         if infinite is not None:
             warnings.warn(

And it worked fine on my end! Would you be happy to apply these changes instead in this PR?

Yes, it is much more optimal. Agreed

yash-srivastava19 · 2024-06-19T17:37:16Z

Did the json encoding error rectified as well or it pertains even after the fix?

younesbelkada · 2024-06-20T08:21:38Z

Thanks ! that's another issue we can fix in a follow up PR !

alvarobartt · 2024-07-02T13:46:36Z

Hi here @yash-srivastava19 friendly ping to check about the status of this PR 👍🏻 Is it something you are still happy / comfortable to work with? Or would you prefer us to take over instead? Just let us know, thanks 🤗

alvarobartt · 2024-07-05T14:31:20Z

Hi here @yash-srivastava19 thanks for the effort, we'll be closing this PR in favour of #1807, and you've been included as a contributor there 🤗 Thanks a lot for the effort!

yash-srivastava19 added 5 commits June 18, 2024 16:33

debug session

214ddf1

some print statements to see whether I am stupid or not

c75be14

final tests and whether it is working or not?

3f78c2b

this is a very hacky fix :)

d3b3270

hacky fix works, but breaks at JSON encoding.

2d79013

yash-srivastava19 mentioned this pull request Jun 18, 2024

[BUG] TRL CLI not capturing torch_dtype correctly #1751

Closed

younesbelkada reviewed Jun 18, 2024

View reviewed changes

alvarobartt mentioned this pull request Jul 5, 2024

Fix torch_dtype handling in {DPO,SFT}Trainer when provided via CLI #1807

Merged

alvarobartt closed this Jul 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue #1751 Fix #1754

Issue #1751 Fix #1754

Uh oh!

yash-srivastava19 commented Jun 18, 2024

Uh oh!

alvarobartt commented Jun 18, 2024

Uh oh!

younesbelkada left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 18, 2024

Uh oh!

yash-srivastava19 commented Jun 19, 2024

Uh oh!

yash-srivastava19 commented Jun 19, 2024

Uh oh!

younesbelkada commented Jun 20, 2024

Uh oh!

alvarobartt commented Jul 2, 2024

Uh oh!

alvarobartt commented Jul 5, 2024

Uh oh!

Uh oh!

Issue #1751 Fix #1754

Issue #1751 Fix #1754

Uh oh!

Conversation

yash-srivastava19 commented Jun 18, 2024

Uh oh!

alvarobartt commented Jun 18, 2024

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 18, 2024

Uh oh!

yash-srivastava19 commented Jun 19, 2024

Uh oh!

yash-srivastava19 commented Jun 19, 2024

Uh oh!

younesbelkada commented Jun 20, 2024

Uh oh!

alvarobartt commented Jul 2, 2024

Uh oh!

alvarobartt commented Jul 5, 2024

Uh oh!

Uh oh!