[ROCm] Add additional block quant GEMM tuning configs for AMD GPUs. #3616

whchung · 2025-02-16T22:53:10Z

Modifications

Add additional block quant GEMM tuning configs for AMD GPUs.

Checklist

Format your code according to the Code Formatting with Pre-Commit.

HaiShaw

LG

yiakwy-xpu-ml-framework-team · 2025-02-17T04:00:43Z

Hi @whchung, do we have profiling comparison I am really interested in the parameter choosing of "BLOCK_SIZE_N" between 16 and 64.

In the last year we have paper fully study the parameter choosing. The study shows that parameters typically 16, 64, 128, which is deep related to memory transaction bandwidth.

Add additional block quant GEMM configs for AMD GPUs.

6e4a33b

whchung requested review from merrymercy, Ying1123, zhyncs and ispobock as code owners February 16, 2025 22:53

HaiShaw approved these changes Feb 17, 2025

View reviewed changes

Merge branch 'main' into whchung/amd_additional_tuning

ef23a33

HaiShaw enabled auto-merge (squash) February 17, 2025 06:53

HaiShaw self-requested a review February 17, 2025 06:55

HaiShaw approved these changes Feb 17, 2025

View reviewed changes

HaiShaw disabled auto-merge February 17, 2025 06:56

HaiShaw enabled auto-merge (squash) February 17, 2025 06:57

HaiShaw added 4 commits February 16, 2025 23:38

Merge branch 'main' into whchung/amd_additional_tuning

1415b30

Merge branch 'main' into whchung/amd_additional_tuning

4a79781

Merge branch 'main' into whchung/amd_additional_tuning

754fa87

Merge branch 'main' into whchung/amd_additional_tuning

2a759ae

saienduri disabled auto-merge February 17, 2025 23:54

saienduri merged commit 2eab113 into sgl-project:main Feb 17, 2025
3 of 18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Add additional block quant GEMM tuning configs for AMD GPUs. #3616

[ROCm] Add additional block quant GEMM tuning configs for AMD GPUs. #3616

Uh oh!

whchung commented Feb 16, 2025

Uh oh!

HaiShaw left a comment

Uh oh!

yiakwy-xpu-ml-framework-team commented Feb 17, 2025

Uh oh!

Uh oh!

Uh oh!

[ROCm] Add additional block quant GEMM tuning configs for AMD GPUs. #3616

[ROCm] Add additional block quant GEMM tuning configs for AMD GPUs. #3616

Uh oh!

Conversation

whchung commented Feb 16, 2025

Modifications

Checklist

Uh oh!

HaiShaw left a comment

Choose a reason for hiding this comment

Uh oh!

yiakwy-xpu-ml-framework-team commented Feb 17, 2025

Uh oh!

Uh oh!

Uh oh!