Skip to content

Conversation

merrymercy
Copy link
Contributor

No description provided.

@merrymercy merrymercy merged commit 70359bf into main Jan 16, 2024
@merrymercy merrymercy deleted the benchmark branch January 16, 2024 00:13
CatherineSue added a commit to CatherineSue/sglang that referenced this pull request Mar 4, 2025
…rver

Merge in GENAICORE/sglang from chang/request-id to features-based-on-v0.4.2.post1

Squashed commit of the following:

commit 9b3b1488feea6f5b310570a3060cae3e2786fc38
Author: Chang Su <chang.s.su@oracle.com>
Date:   Wed Feb 19 13:35:33 2025 -0800

    [Log] Support request_id in OpenAI API server

    - Add `request_id` to ChatCompletionRequest and EmbeddingRequest
    - Pass request_ids to GenerateReqInput and EmbeedingReqInput
timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Mar 12, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Mar 14, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Mar 14, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Mar 14, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
ch-wan pushed a commit to ch-wan/sglang that referenced this pull request Apr 25, 2025
NorthmanPKU pushed a commit to NorthmanPKU/sglang that referenced this pull request May 16, 2025
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request May 28, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request May 28, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Jun 3, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Jun 6, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
pengxin99 pushed a commit to pengxin99/sglang that referenced this pull request Jun 19, 2025
sleepcoo pushed a commit to shuaills/sglang that referenced this pull request Jun 24, 2025
yichiche pushed a commit to yichiche/sglang that referenced this pull request Jul 7, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
pi314ever pushed a commit to pi314ever/sglang that referenced this pull request Jul 10, 2025
…ject#8)

Signed-off-by: Rahul Vijayaraghavan <rahul.vijayaraghavan@intel.com>
siuhunh pushed a commit to xing-wenjin/sglang that referenced this pull request Jul 21, 2025
yichiche pushed a commit to yichiche/sglang that referenced this pull request Jul 23, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Jul 25, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Jul 30, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Aug 1, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Aug 6, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Aug 7, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Aug 11, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Aug 11, 2025
* align shapes

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

* fix

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>

---------

Signed-off-by: Ivan Butygin <ivan.butygin@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant