[Bug] v0.4.5 NameError: name 'VLLM_AVAILABLE' is not defined from compressed_tensors.py

### Checklist

- [x] 1. I have searched related issues but cannot get the expected help.
- [x] 2. The bug has not been fixed in the latest version.
- [x] 3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
- [x] 4. If the issue you raised is not a bug but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- [x] 5. Please use English, otherwise it will be closed.

### Describe the bug

I tried to start sglang to serve a quantized model [DeepSeek-R1-Distill-Qwen-32B-Int4-W4A16](https://modelscope.cn/models/okwinds/DeepSeek-R1-Distill-Qwen-32B-Int4-W4A16)  ,using docker image lmsysorg/sglang:v0.4.5-cu124  . Then the error occured with log "NameError: name 'VLLM_AVAILABLE' is not defined" . After reading the code, I found that in the file [compressed_tensors.py](https://github.com/sgl-project/sglang/blob/main/python/sglang/srt/layers/quantization/compressed_tensors/compressed_tensors.py) , there is no declaration of `VLLM_AVAILABLE`, while in the file [compressed_tensors_moe.py](https://github.com/sgl-project/sglang/blob/main/python/sglang/srt/layers/quantization/compressed_tensors/compressed_tensors_moe.py ) , there is the snippet
```python
try:
    import vllm

    VLLM_AVAILABLE = True
except ImportError:
    VLLM_AVAILABLE = False

```
And this error did not happen when using ~~sglang:v0.4.5-cu124~~ sglang:v0.4.4-cu124  (fix typo)



### Reproduction

model
https://modelscope.cn/models/okwinds/DeepSeek-R1-Distill-Qwen-32B-Int4-W4A16
docker image
lmsysorg/sglang:v0.4.5-cu124

### Environment

Ubuntu 22.04
CUDA 12.4 with driver version 530
GPU A800

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug] v0.4.5 NameError: name 'VLLM_AVAILABLE' is not defined from compressed_tensors.py #5443

Checklist

Describe the bug

Reproduction

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug] v0.4.5 NameError: name 'VLLM_AVAILABLE' is not defined from compressed_tensors.py #5443

Description

Checklist

Describe the bug

Reproduction

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions