enable activation offloading on XPU #3444

yao-matrix · 2025-05-13T23:11:17Z

No description provided.

Signed-off-by: Matrix Yao <matrix.yao@intel.com>

yao-matrix · 2025-05-13T23:15:30Z

trl/models/activation_offloading.py

+        )
+        # NOTE: xpu doesn't have `default_stream` API, use `current_stream` instead
+        self.s0 = (
+            torch.xpu.current_stream() if self.accelerator_type == "xpu" else torch.cuda.default_stream()


@gujinghui, pls help review this logic. Since xpu doesn't have default_stream, so i am using current_stream to WA.

LGTM. As long as the workload has no specific assumption on cuda default stream, the current stream is good enough to replace it for functionality.

yao-matrix · 2025-05-14T06:52:11Z

@kashif , pls help review, thx.

HuggingFaceDocBuilderDev · 2025-05-15T09:57:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

trl/models/activation_offloading.py

yao-matrix · 2025-05-19T07:48:27Z

seems the ci failures are not brought by my PR @kashif

Signed-off-by: Matrix Yao <matrix.yao@intel.com> Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

yao-matrix added 2 commits May 12, 2025 23:56

enable activation offloading on XPU

67d3fb3

Signed-off-by: Matrix Yao <matrix.yao@intel.com>

Merge branch 'main' into activation-off-xpu

f688f7b

yao-matrix marked this pull request as draft May 13, 2025 23:11

yao-matrix commented May 14, 2025

View reviewed changes

yao-matrix marked this pull request as ready for review May 14, 2025 06:50

kashif self-assigned this May 14, 2025

kashif reviewed May 15, 2025

View reviewed changes

trl/models/activation_offloading.py Show resolved Hide resolved

Merge branch 'main' into activation-off-xpu

e3c5500

kashif reviewed May 16, 2025

View reviewed changes

trl/models/activation_offloading.py Outdated Show resolved Hide resolved

Update trl/models/activation_offloading.py

3c44ce8

kashif approved these changes May 19, 2025

View reviewed changes

Merge branch 'main' into activation-off-xpu

346e32d

kashif merged commit 64aa064 into huggingface:main May 19, 2025
10 checks passed

shirinyamani pushed a commit that referenced this pull request May 19, 2025

enable activation offloading on XPU (#3444)

cdb3f42

Signed-off-by: Matrix Yao <matrix.yao@intel.com> Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

yao-matrix deleted the activation-off-xpu branch May 19, 2025 22:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enable activation offloading on XPU #3444

enable activation offloading on XPU #3444

Uh oh!

yao-matrix commented May 13, 2025

Uh oh!

yao-matrix May 13, 2025

Uh oh!

gujinghui May 14, 2025

Uh oh!

yao-matrix commented May 14, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 15, 2025

Uh oh!

Uh oh!

Uh oh!

yao-matrix commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!

enable activation offloading on XPU #3444

enable activation offloading on XPU #3444

Uh oh!

Conversation

yao-matrix commented May 13, 2025

Uh oh!

yao-matrix May 13, 2025

Choose a reason for hiding this comment

Uh oh!

gujinghui May 14, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix commented May 14, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 15, 2025

Uh oh!

Uh oh!

Uh oh!

yao-matrix commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!