[1/N] Remove `CacheConfig` import in all model files #1658

ByronHsu · 2024-10-13T21:38:35Z

Motivation

This series of PRs attempt to decouple the model code from vLLM dependencies. There are mainly four components we use:

from vllm.config import CacheConfig
from vllm.distributed import get_tensor_model_parallel_world_size
from vllm.model_executor.layers.rotary_embedding import get_rope
from vllm.model_executor.layers.vocab_parallel_embedding import (
   ParallelLMHead,
   VocabParallelEmbedding,
)

This PR removed the first one CacheConfig. This is the easiest one because radix attention always set page size as 1 so we don't need to set cache config.

Modifications

Remove from vllm.config import CacheConfig in all models

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

ByronHsu changed the title ~~[de-vLLM 1/N] Remove CacheConfig import in all model files~~ [WIP] [de-vLLM 1/N] Remove CacheConfig import in all model files Oct 13, 2024

ByronHsu changed the title ~~[WIP] [de-vLLM 1/N] Remove CacheConfig import in all model files~~ [WIP] [1/N] Remove CacheConfig import in all model files Oct 13, 2024

ByronHsu force-pushed the byhsu/de-vllm branch from b5e30dc to 1328075 Compare October 13, 2024 21:43

ByronHsu changed the title ~~[WIP] [1/N] Remove CacheConfig import in all model files~~ [1/N] Remove CacheConfig import in all model files Oct 13, 2024

ByronHsu requested a review from Ying1123 October 13, 2024 22:39

zhyncs approved these changes Oct 14, 2024

View reviewed changes

ByronHsu added 4 commits October 14, 2024 09:06

remove cacheconfig

530bd4b

remove nl

d6eba4f

wip

eb48283

lint

2c22b8a

zhyncs force-pushed the byhsu/de-vllm branch from b355124 to 2c22b8a Compare October 14, 2024 16:06

zhyncs merged commit 56503d9 into sgl-project:main Oct 14, 2024
1 of 10 checks passed

ByronHsu mentioned this pull request Oct 15, 2024

[Feature] Make vLLM optional in model code #1673

Closed

5 tasks

zhyncs mentioned this pull request Oct 17, 2024

Support qwen2 vl model #1546

Merged

5 tasks

timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025

[1/N] Remove CacheConfig import in all model files (sgl-project#1658)

c8564c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[1/N] Remove `CacheConfig` import in all model files #1658

[1/N] Remove `CacheConfig` import in all model files #1658

Uh oh!

ByronHsu commented Oct 13, 2024

Uh oh!

Uh oh!

Uh oh!

[1/N] Remove CacheConfig import in all model files #1658

[1/N] Remove CacheConfig import in all model files #1658

Uh oh!

Conversation

ByronHsu commented Oct 13, 2024

Motivation

Modifications

Checklist

Uh oh!

Uh oh!

Uh oh!

[1/N] Remove `CacheConfig` import in all model files #1658

[1/N] Remove `CacheConfig` import in all model files #1658