Initial AMD MI300X Support via. AITER #10

jammm · 2025-06-30T18:48:16Z

Adds AITER support when running on AMD's MI300X to enable fp8 inference.

Command:
python gen_image.py --prompt "A cat playing with a ball of yarn" --output-file output.png --compile_export_mode compile

NOTE: torch.export isn't working correctly. It needs to be debugged before it would run properly. For now, please use --compile_export_mode compile as shown in the above command.

Baseline (taken from README.md):

NVIDIA fully-optimized w/ quantization (taken from README.md):

AMD MI300X w/ torch.compile w/ quantization:

sayakpaul

Thanks a lot for your PR! I have left some comments. LMK if they are unclear or don't make sense.

README.md

utils/pipeline_utils.py

jammm · 2025-07-01T11:21:29Z

@sayakpaul thanks a lot for the review, and thanks to you and @jbschlosser for the hard work making this repo ^^
I addressed your comments in 0661e85. Please let me know if you have any other questions/comments. Thanks!

sayakpaul

Thanks for your hard work. I will let @jbschlosser have his say here, too.

Meanwhile, feel free to update this part of the README (that we now support AMD integration, thanks to your PR).

jammm · 2025-07-01T12:33:23Z

Meanwhile, feel free to update this part of the README (that we now support AMD integration, thanks to your PR).

Done!

jbschlosser

Awesome, thanks for the contribution! I may have missed this, but how do the speedups look on the AMD hardware?

jammm · 2025-07-02T15:04:35Z

Awesome, thanks for the contribution! I may have missed this, but how do the speedups look on the AMD hardware?

There was definitely some speedups when going from baseline to torch.compile+fp8. I saw about a 66-70% speedup. torch.export gives even more speedup, but it's not working correctly yet. Will have to debug that

jammm added 2 commits June 30, 2025 18:47

Initial AMD MI300X Support via. AITER

c198c45

Update README.md to include AITER pip install

73826b5

sayakpaul reviewed Jul 1, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

utils/pipeline_utils.py Outdated Show resolved Hide resolved

utils/pipeline_utils.py Outdated Show resolved Hide resolved

utils/pipeline_utils.py Show resolved Hide resolved

sayakpaul requested a review from jbschlosser July 1, 2025 03:53

sayakpaul mentioned this pull request Jul 1, 2025

[docs] torch.compile blog post huggingface/diffusers#11837

Merged

Update comments and README.md for AMD support

0661e85

sayakpaul approved these changes Jul 1, 2025

View reviewed changes

Update README.md

fb1a8bb

jbschlosser approved these changes Jul 2, 2025

View reviewed changes

sayakpaul merged commit 1562815 into huggingface:main Jul 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial AMD MI300X Support via. AITER #10

Initial AMD MI300X Support via. AITER #10

Uh oh!

jammm commented Jun 30, 2025 •

edited

Loading

Uh oh!

sayakpaul left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jammm commented Jul 1, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

jammm commented Jul 1, 2025

Uh oh!

jbschlosser left a comment

Uh oh!

jammm commented Jul 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Initial AMD MI300X Support via. AITER #10

Initial AMD MI300X Support via. AITER #10

Uh oh!

Conversation

jammm commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jammm commented Jul 1, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

jammm commented Jul 1, 2025

Uh oh!

jbschlosser left a comment

Choose a reason for hiding this comment

Uh oh!

jammm commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jammm commented Jun 30, 2025 •

edited

Loading

jammm commented Jul 2, 2025 •

edited

Loading