ℹ️ Unify autocast behavior to `torch.autocast` and make it cover XPU #3541

yao-matrix · 2025-06-05T02:45:29Z

@kashif , pls help review and comment, thx very much.

device-agnostic to cover xpu Signed-off-by: YAO Matrix <matrix.yao@intel.com>

yao-matrix · 2025-06-05T02:49:49Z

docs/source/customization.md


 ```python
-training_args = DPOConfig(..., optimize_cuda_cache=True)
+training_args = DPOConfig(..., optimize_device_cache=True)


there is no optimize_cuda_cache anymore, so update the doc here

yao-matrix · 2025-06-05T02:50:01Z

examples/research_projects/stack_llama/scripts/rl_training.py

@@ -82,7 +82,7 @@ class ScriptArguments:
    batch_size=script_args.batch_size,
    mini_batch_size=script_args.mini_batch_size,
    gradient_accumulation_steps=script_args.gradient_accumulation_steps,
-    optimize_cuda_cache=True,
+    optimize_device_cache=True,


yao-matrix · 2025-06-05T02:50:41Z

tests/slow/test_dpo_slow.py

-            torch.cuda.empty_cache()
-        elif torch_device == "xpu":
-            torch.xpu.empty_cache()
+        backend_empty_cache(torch_device)


use the device-agnostic utility from transformers.testing_utils rather than if-else

yao-matrix · 2025-06-05T02:51:16Z

tests/test_dpo_trainer.py

+    @unittest.skipIf(
+        get_device_properties()[0] == "cuda" and get_device_properties()[1] < 8,
+        "Skipping because bf16 not supported on CUDA GPU with capability < 8.0",
+    )


add skipIf per the comments and remove condition-less skip

yao-matrix · 2025-06-05T02:52:43Z

trl/models/modeling_base.py

-        if is_torch_xpu_available():
-            return f"xpu:{state.local_process_index}"
+        if torch.cuda.is_available() or is_torch_xpu_available():
+            return state.local_process_index
        elif is_torch_npu_available():


we don't need this WA anymore, xpu now support integer device index

trl/templates/lm_model_card.md

qgallouedec

nice! thanks! Just one comment

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

kashif · 2025-06-09T12:40:39Z

@qgallouedec is the test failing due to the CI issue?

qgallouedec · 2025-06-09T12:45:59Z

Yes, fixing it in #3551

HuggingFaceDocBuilderDev · 2025-06-09T13:07:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2025-06-09T18:06:28Z

Let's wait for #3553 to be merged

yao-matrix added 2 commits June 5, 2025 02:43

unify autocast behavior to torch.autocast and make the behavior

43637e9

device-agnostic to cover xpu Signed-off-by: YAO Matrix <matrix.yao@intel.com>

Merge branch 'main' into xpu

ec9622e

yao-matrix commented Jun 5, 2025

View reviewed changes

Merge branch 'main' into xpu

fdcf31d

kashif approved these changes Jun 9, 2025

View reviewed changes

fix typos

62d0ced

qgallouedec reviewed Jun 9, 2025

View reviewed changes

trl/templates/lm_model_card.md Outdated Show resolved Hide resolved

qgallouedec reviewed Jun 9, 2025

View reviewed changes

trl/templates/lm_model_card.md Outdated Show resolved Hide resolved

qgallouedec approved these changes Jun 9, 2025

View reviewed changes

kashif and others added 2 commits June 9, 2025 14:30

Update trl/templates/lm_model_card.md

f2ed45f

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Update trl/templates/lm_model_card.md

5e58ea1

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Merge branch 'main' into xpu

a85cd40

qgallouedec changed the title ~~unify autocast behavior to torch.autocast and make it cover XPU~~ ℹ️ Unify autocast behavior to torch.autocast and make it cover XPU Jun 9, 2025

qgallouedec and others added 2 commits June 9, 2025 17:36

reverse test modif

17e3f7b

Merge branch 'main' into xpu

a5a7824

Merge branch 'main' into xpu

5fac681

kashif merged commit 1314aac into huggingface:main Jun 10, 2025
10 checks passed

yao-matrix deleted the xpu branch June 10, 2025 22:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ℹ️ Unify autocast behavior to `torch.autocast` and make it cover XPU #3541

ℹ️ Unify autocast behavior to `torch.autocast` and make it cover XPU #3541

Uh oh!

yao-matrix commented Jun 5, 2025 •

edited

Loading

Uh oh!

yao-matrix Jun 5, 2025

Uh oh!

yao-matrix Jun 5, 2025

Uh oh!

yao-matrix Jun 5, 2025

Uh oh!

yao-matrix Jun 5, 2025

Uh oh!

yao-matrix Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

qgallouedec left a comment

Uh oh!

kashif commented Jun 9, 2025

Uh oh!

qgallouedec commented Jun 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 9, 2025

Uh oh!

qgallouedec commented Jun 9, 2025

Uh oh!

Uh oh!

Uh oh!

ℹ️ Unify autocast behavior to torch.autocast and make it cover XPU #3541

ℹ️ Unify autocast behavior to torch.autocast and make it cover XPU #3541

Uh oh!

Conversation

yao-matrix commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yao-matrix Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

kashif commented Jun 9, 2025

Uh oh!

qgallouedec commented Jun 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 9, 2025

Uh oh!

qgallouedec commented Jun 9, 2025

Uh oh!

Uh oh!

Uh oh!

ℹ️ Unify autocast behavior to `torch.autocast` and make it cover XPU #3541

ℹ️ Unify autocast behavior to `torch.autocast` and make it cover XPU #3541

yao-matrix commented Jun 5, 2025 •

edited

Loading