Skip to content

Conversation

simonlui
Copy link
Contributor

@simonlui simonlui commented Aug 13, 2025

Last patch apparently introduced some regressions for iGPU users. According to pytorch/pytorch#152317, torch.xpu.get_device_capability(device)['has_bfloat16_conversions'] only checks if a device supports generating SPIRV BF16 code which is false on Lunar Lake iGPUs which slowed down for those devices. One should be using torch.xpu.is_bf16_supported() which was not documented which mirrors the Nvidia counterpart so use that instead for BF16 type checking which restores most of the speed according to a user who brought this up and did testing on the patch.

Also disable non-blocking by default on XPU due to it running slower overall on other operations not generation related at this time. But because of speedups on dGPUs, introduce a flag to force it on if needed. That in addition to above restores the speed completely for iGPU users.

I also redid the IPEX check to mirror all alternative backends to make it a lot more simple and eschew version juggling. I check first for IPEX import then check XPU availability using the standard available call since it now handles some of the checks the older check did. Documentation was updated to mostly match #7767 changes but also to keep allowing users to install nightly for better performance. Removed discussion thread as things now work out of the box for the most part without any real need to finagle with things using the standard installation procedure.

@comfyanonymous comfyanonymous merged commit c991a5d into comfyanonymous:master Aug 13, 2025
6 checks passed
@simonlui simonlui deleted the xpu-igpu branch August 14, 2025 04:23
zhangp365 pushed a commit to zhangp365/ComfyUI that referenced this pull request Aug 14, 2025
* Change bf16 check and switch non-blocking to off default with option to force to regain speed on certain classes of iGPUs and refactor xpu check.

* Turn non_blocking off by default for xpu.

* Update README.md for Intel GPUs.
Vander-Bilt pushed a commit to Vander-Bilt/ComfyUI that referenced this pull request Aug 26, 2025
* Change bf16 check and switch non-blocking to off default with option to force to regain speed on certain classes of iGPUs and refactor xpu check.

* Turn non_blocking off by default for xpu.

* Update README.md for Intel GPUs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants