[feat] fix some numel and refactor torch binding #4867

FlamingoPg · 2025-03-28T19:47:23Z

Motivation

fix some numel and refactor torch binding

Modifications

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.

FlamingoPg · 2025-03-28T19:54:24Z

This PR do three things

Refactor torch binding,

from:

m.def("all_reduce(int fa, Tensor inp, Tensor! out) -> ()");
m.impl("all_reduce", torch::kCUDA, &all_reduce);

to:

m.def("all_reduce", all_reduce);

simplify the kernel define logic

fix tests/test_sampling.py
update sgl-kernel readme

zhyncs · 2025-03-28T20:27:31Z

.gitmodules

@@ -4,9 +4,9 @@
 [submodule "sgl-kernel/3rdparty/cccl"]


We no longer need to update git modules because we have switched to CMake.

zhyncs · 2025-03-28T20:28:11Z

sgl-kernel/CMakeLists.txt

-    GIT_REPOSITORY https://github.com/flashinfer-ai/flashinfer
-    GIT_TAG        79fd1ae90d9b8098ca70dec6071da96f3f6da7b9
+    GIT_REPOSITORY https://github.com/sgl-project/flashinfer
+    GIT_TAG        2b9f16eb79bd344e31725e8d7a92fe7fe980ffdf


Please use branch instead.

zhyncs · 2025-03-28T20:51:24Z

sgl-kernel/csrc/torch_extension.cc

@@ -22,181 +22,67 @@ TORCH_LIBRARY_EXPAND(sgl_kernel, m) {
  /*
   * From csrc/allreduce
   */
-  m.def(


revert this first in #4871 to unblock the new release

FlamingoPg added 3 commits March 28, 2025 12:20

support deepgemm for cmake

16cf57b

fix numel error for sample

6a5763c

clean deepgemm logic

b6efaea

FlamingoPg requested review from zhyncs, ispobock, HandH1998, BBuf, yizhang2077 and merrymercy as code owners March 28, 2025 19:47

FlamingoPg added 2 commits March 29, 2025 03:48

Merge branch 'main' into cmake_sample

eaa209c

Delete sgl-kernel/log

a2ad715

fix lint

d613b5d

zhyncs requested review from Ying1123 and HaiShaw as code owners March 28, 2025 20:05

upd

da4c069

zhyncs force-pushed the cmake_sample branch from 1cc28d7 to da4c069 Compare March 28, 2025 20:14

zhyncs closed this Mar 28, 2025

zhyncs mentioned this pull request Mar 28, 2025

fix sampling issue #4871

Merged

6 tasks

zhyncs reviewed Mar 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat] fix some numel and refactor torch binding #4867

[feat] fix some numel and refactor torch binding #4867

Uh oh!

FlamingoPg commented Mar 28, 2025

Uh oh!

FlamingoPg commented Mar 28, 2025 •

edited

Loading

Uh oh!

zhyncs Mar 28, 2025

Uh oh!

zhyncs Mar 28, 2025

Uh oh!

zhyncs Mar 28, 2025

Uh oh!

Uh oh!

[feat] fix some numel and refactor torch binding #4867

[feat] fix some numel and refactor torch binding #4867

Uh oh!

Conversation

FlamingoPg commented Mar 28, 2025

Motivation

Modifications

Checklist

Uh oh!

FlamingoPg commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhyncs Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

zhyncs Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

zhyncs Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

FlamingoPg commented Mar 28, 2025 •

edited

Loading