-
-
Notifications
You must be signed in to change notification settings - Fork 9.8k
[CI] add kvcache-connector dependency definition and add into CI build #18193
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
4f33c86
to
5a4aeab
Compare
@YuhanLiu11 What's your suggestion ? |
@panpan0000 this is a great idea! Also, now LMCache has bdist package and it can be easily installed without extra compilation. |
Yeah agree --- and we can contribute some perf CIs for CPU offloading and KV cache sharing. @youkaichao @simon-mo @DarkLight1337 what's your thoughts on this? |
So how to push thing forward? |
I am pinging people for feedbacks. Could you please fix the merge conflict first so that we can start the full CI? |
5a4aeab
to
3bbaded
Compare
Thanks for reminder. done. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
I am running some test now to make sure compatibility issues and other things, will get back to you soon. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit
s/INSTALL_KV_CONNECTOR/INSTALL_KV_CONNECTORS
3bbaded
to
76d6761
Compare
done :-) Thanks @simon-mo |
I tested this PR and it is functional. |
@panpan0000 I enabled automerge, could you please fix the linting errors? Many thanks for your contribution! |
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Head branch was pushed to by a user without write access
76d6761
to
5cee0d5
Compare
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io> Signed-off-by: Patrick von Platen <patrick.v.platen@gmail.com>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io> Signed-off-by: avigny <47987522+avigny@users.noreply.github.com>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io> Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>
vllm-project#18193) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
FIX #NA (link existing issues this PR will resolve)
As KVCache-Sharing/Offloading become more essential in production, but the current default image vllm/vllm-openai:v0.8.* doesn't include the kv-connector dependencies, for example , when run vllm image with
--kv-transfer-config '{"kv_connector":"LMCacheConnector","kv_role":"kv_both","kv_rank":0,"kv_parallel_size":2}'
So introducing a new requirements.txt for kv-connector(put
lmcache
for now, mooncake & nixl later)And I enable kv-connector into dockerfile in CI scripts, I'm not sure if it's proper or maybe we should do it in vllm-production-stack project? Please correct me.
since we don't have a up to date CI image for vllm+lmcache, the image
lmcache/vllm-openai:latest
seems still using old vllm(v0.7.0) and image not build from a trackable github CI. @KuntaiDu , any suggestion?