Skip to content

Conversation

ch-wan
Copy link
Collaborator

@ch-wan ch-wan commented Jul 31, 2025

Motivation

Dependency:

Modifications

Accuracy Test

python3 -m sglang.launch_server --model-path /dev/shm/GLM-4.5-Air-FP8 --trust-remote-code --tp 4 --enable-ep-moe --base-gpu-id 4 --ep-size 2
python3 few_shot_gsm8k.py

Accuracy: 93.5%

Benchmark & Profiling

Checklist

@zhyncs zhyncs merged commit 7a1f7fc into sgl-project:main Jul 31, 2025
2 of 54 checks passed
@trevor-m
Copy link
Collaborator

This is also causing accuracy issues for FP4 moe path

huangzhilin-hzl pushed a commit to huangzhilin-hzl/sglang that referenced this pull request Aug 1, 2025
MahmoudAshraf97 pushed a commit to MahmoudAshraf97/sglang that referenced this pull request Aug 1, 2025
TianQiLin666666 pushed a commit to TianQiLin666666/sglang that referenced this pull request Aug 1, 2025
lifuhuang pushed a commit that referenced this pull request Aug 3, 2025
ShangmingCai pushed a commit that referenced this pull request Aug 5, 2025
ShangmingCai pushed a commit that referenced this pull request Aug 5, 2025
narutolhy pushed a commit to narutolhy/sglang that referenced this pull request Aug 17, 2025
narutolhy pushed a commit to narutolhy/sglang that referenced this pull request Aug 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants