Skip to content

Conversation

kkHuang-amd
Copy link
Contributor

…improve performance

By changing the launch parameter on the ROCm platform,
the Geomean of median E2E latency has 1~2% improvement

Motivation

As it is.

Modifications

Checklist

  • [+] Format your code according to the Contributor Guide.
  • [+] Add unit tests as outlined in the Contributor Guide.
  • [+] Update documentation as needed, including docstrings or example tutorials.

@kkHuang-amd
Copy link
Contributor Author

@HaiShaw

Please help to review it.

Copy link
Collaborator

@HaiShaw HaiShaw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@HaiShaw HaiShaw merged commit 70dc2fb into sgl-project:main Dec 27, 2024
15 checks passed
timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
sgl-project#2610)

Co-authored-by: wunhuang <wunhuang@amd.com>
Co-authored-by: HAI <hixiao@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants