-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Closed
Labels
Description
Checklist
- 1. I have searched related issues but cannot get the expected help.
- 2. The bug has not been fixed in the latest version.
- 3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
- 4. If the issue you raised is not a bug but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- 5. Please use English, otherwise it will be closed.
Describe the bug
I tried to start sglang to serve a quantized model DeepSeek-R1-Distill-Qwen-32B-Int4-W4A16 ,using docker image lmsysorg/sglang:v0.4.5-cu124 . Then the error occured with log "NameError: name 'VLLM_AVAILABLE' is not defined" . After reading the code, I found that in the file compressed_tensors.py , there is no declaration of VLLM_AVAILABLE
, while in the file compressed_tensors_moe.py , there is the snippet
try:
import vllm
VLLM_AVAILABLE = True
except ImportError:
VLLM_AVAILABLE = False
And this error did not happen when using sglang:v0.4.5-cu124 sglang:v0.4.4-cu124 (fix typo)
Reproduction
model
https://modelscope.cn/models/okwinds/DeepSeek-R1-Distill-Qwen-32B-Int4-W4A16
docker image
lmsysorg/sglang:v0.4.5-cu124
Environment
Ubuntu 22.04
CUDA 12.4 with driver version 530
GPU A800