[ROCm] fix large tensor sort on MI350 #161054

dnikolaev-amd · 2025-08-20T13:48:43Z

Currently std::min -> ::min did not work as expected on ROCm when input values >= 2147483648

Replace std::min to ternary statement
Also std::min can be replaced by explicit typing std::min<int64_t>

fixes on ROCm:
test_sort_and_select.py::TestSortAndSelectCUDA::test_sort_large_cuda_float16
error:
RuntimeError: Cannot sort dimension of length 8192

Similar PR to fix large tensors on ROCm #130994

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd

pytorch-bot · 2025-08-20T13:48:46Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161054

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3ae51f6 with merge base 5ee464d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Currently std::min -> ::min did not work as expected on ROCm when input values >= 2147483648

jeffdaily · 2025-08-20T17:21:13Z

@pytorchbot merge

pytorchmergebot · 2025-08-20T17:23:55Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Currently std::min -> ::min did not work as expected on ROCm when input values >= 2147483648 Replace std::min to ternary statement Also std::min can be replaced by explicit typing std::min<int64_t> fixes on ROCm: test_sort_and_select.py::TestSortAndSelectCUDA::test_sort_large_cuda_float16 error: RuntimeError: Cannot sort dimension of length 8192 Combines upstream PRs: - pytorch#161054 to fix std::min on ROCm - pytorch#155546 fix python test - pytorch#159939 change test dtype from int8 to float16 Fixes: SWDEV-526432

pytorch-bot bot added module: rocm AMD GPU support for Pytorch release notes: cuda release notes category labels Aug 20, 2025

pytorchbot added the open source label Aug 20, 2025

dnikolaev-amd force-pushed the fix_large_tensor_sort_on_rocm branch from 0b9b311 to a07fa0c Compare August 20, 2025 14:09

dnikolaev-amd changed the title ~~[ROCm] fix large tensor sort~~ [ROCm] fix large tensor sort on MI350 Aug 20, 2025

fix large tensor sort on ROCm

3ae51f6

Currently std::min -> ::min did not work as expected on ROCm when input values >= 2147483648

dnikolaev-amd force-pushed the fix_large_tensor_sort_on_rocm branch from a07fa0c to 3ae51f6 Compare August 20, 2025 14:13

jeffdaily approved these changes Aug 20, 2025

View reviewed changes

jeffdaily marked this pull request as ready for review August 20, 2025 15:21

jeffdaily requested review from eqy and syed-ahmed as code owners August 20, 2025 15:21

jeffdaily added release notes: rocm mandatorylabel ciflow/rocm Trigger "default" config CI on ROCm ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 and removed release notes: cuda release notes category labels Aug 20, 2025

pytorch-bot bot removed ciflow/rocm Trigger "default" config CI on ROCm ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 labels Aug 20, 2025

pytorch deleted a comment from pytorch-bot bot Aug 20, 2025

jeffdaily added ciflow/rocm Trigger "default" config CI on ROCm ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 labels Aug 20, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 20, 2025

pytorchmergebot added the merging label Aug 20, 2025

pytorchmergebot closed this in 24e7f3c Aug 20, 2025

pytorchmergebot added Merged and removed merging labels Aug 20, 2025

dnikolaev-amd mentioned this pull request Aug 21, 2025

[rocm7.1_internal_testing] fix large tensor sort on ROCm ROCm/pytorch#2543

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] fix large tensor sort on MI350 #161054

[ROCm] fix large tensor sort on MI350 #161054

Uh oh!

dnikolaev-amd commented Aug 20, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 20, 2025 •

edited

Loading

Uh oh!

jeffdaily commented Aug 20, 2025

Uh oh!

pytorchmergebot commented Aug 20, 2025

Uh oh!

Uh oh!

[ROCm] fix large tensor sort on MI350 #161054

[ROCm] fix large tensor sort on MI350 #161054

Uh oh!

Conversation

dnikolaev-amd commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161054

✅ No Failures

Uh oh!

jeffdaily commented Aug 20, 2025

Uh oh!

pytorchmergebot commented Aug 20, 2025

Merge started

Uh oh!

Uh oh!

dnikolaev-amd commented Aug 20, 2025 •

edited

Loading

pytorch-bot bot commented Aug 20, 2025 •

edited

Loading