chore: upgrade vllm and transformers to the latest stable versions #1373

chewong · 2025-08-07T18:39:10Z

Reason for Change:

Upgrade vllm and transformers to the latest stable versions as a prerequisite to #1354.

Transformers 4.55.0 conflicts with vLLM 0.9.0 so we are forced to upgrade to 0.10.0. We have to open a follow-up PR to upgrade vLLM to 0.10.1 anyway, which would start supporting models in Add gpt-oss-20b and gpt-oss-120b as KAITO Presets #1354.
Added "max_new_tokens": None in presets/workspace/inference/text-generation/tests/test_inference_api.py generate_kwargs since max_new_tokens (default 200) takes precedence over max_length, causing the test to generate more tokens than it supposed to.

TODO after merging this PR:

Deprecate T4 skus which is not supported with vLLM V1

TODO when upgrading vLLM to 0.11.0 (v0 engine will be completely removed)

Deprecate and remove phi-2, which is not supported with vLLM V1

Requirements

added unit tests and e2e tests (if applicable).

Issue Fixed:

Notes for Reviewers:

chewong · 2025-08-07T18:53:47Z

Got flagged by https://github.com/kaito-project/kaito/actions/runs/16813076971/job/47623004035?pr=1373 for using torch==2.7.1 due to GHSA-887c-mr87-cxwp, but I can't upgrade torch to 2.8.0 until vLLM also upgrades it.

chewong · 2025-08-07T22:45:00Z

Hold until we release 0.6.0

Signed-off-by: Ernest Wong <chwong719@gmail.com>

chewong · 2025-08-08T21:28:03Z

Closing in favor of #1378

chewong requested review from Fei-Guo and zhuangqh as code owners August 7, 2025 18:39

github-project-automation bot added this to KAITO Roadmap Aug 7, 2025

chewong had a problem deploying to unit-tests August 7, 2025 18:39 — with GitHub Actions Failure

chewong had a problem deploying to preset-env August 7, 2025 18:39 — with GitHub Actions Error

chewong had a problem deploying to e2e-test August 7, 2025 18:39 — with GitHub Actions Error

chewong temporarily deployed to unit-tests August 7, 2025 18:39 — with GitHub Actions Inactive

chewong had a problem deploying to preset-env August 7, 2025 18:39 — with GitHub Actions Error

chewong mentioned this pull request Aug 7, 2025

chore: bump transformers from 4.51.3 to 4.53.0 in /presets/workspace/dependencies #1367

Closed

chewong force-pushed the prereq-1357 branch from bf29477 to 338fdaa Compare August 7, 2025 18:47

chewong temporarily deployed to unit-tests August 7, 2025 18:47 — with GitHub Actions Inactive

chewong had a problem deploying to preset-env August 7, 2025 18:47 — with GitHub Actions Error

chewong temporarily deployed to e2e-test August 7, 2025 18:47 — with GitHub Actions Inactive

chewong closed this Aug 8, 2025

chewong force-pushed the prereq-1357 branch from 338fdaa to 91b5f6e Compare August 8, 2025 16:37

github-project-automation bot moved this to Done in KAITO Roadmap Aug 8, 2025

chewong added 2 commits August 8, 2025 09:38

chore: upgrade vllm and transformers to the latest stable versions

ebbd44d

Signed-off-by: Ernest Wong <chwong719@gmail.com>

upgrade torch to 2.7.1

98df24f

Signed-off-by: Ernest Wong <chwong719@gmail.com>

chewong reopened this Aug 8, 2025

chewong requested a deployment to e2e-test August 8, 2025 16:38 — with GitHub Actions Waiting

chewong temporarily deployed to unit-tests August 8, 2025 16:38 — with GitHub Actions Inactive

chewong requested a deployment to preset-env August 8, 2025 16:38 — with GitHub Actions Waiting

chewong temporarily deployed to unit-tests August 8, 2025 16:42 — with GitHub Actions Inactive

chewong temporarily deployed to unit-tests August 8, 2025 16:50 — with GitHub Actions Inactive

chewong closed this Aug 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: upgrade vllm and transformers to the latest stable versions #1373

chore: upgrade vllm and transformers to the latest stable versions #1373

Uh oh!

chewong commented Aug 7, 2025 •

edited

Loading

Uh oh!

chewong commented Aug 7, 2025 •

edited

Loading

Uh oh!

chewong commented Aug 7, 2025

Uh oh!

chewong commented Aug 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

chore: upgrade vllm and transformers to the latest stable versions #1373

chore: upgrade vllm and transformers to the latest stable versions #1373

Uh oh!

Conversation

chewong commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chewong commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chewong commented Aug 7, 2025

Uh oh!

chewong commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

chewong commented Aug 7, 2025 •

edited

Loading

chewong commented Aug 7, 2025 •

edited

Loading

chewong commented Aug 8, 2025 •

edited

Loading