Add vLLM transformers backend to online methods #3773

merveenoyan · 2025-07-25T17:24:07Z

Add vLLM transformers backend to online methods GRPO and online DPO.
This has two limitations:

transformers backend for vLLM for VLMs are on vLLM main, it requires a release. in the meantime, install with VLLM_USE_PRECOMPILED=1 uv pip install -e .
server + eager works, for some reason colocate doesn't. I could swear it was working before I merged some changes from main. edit: colocate works on single GPU, although I merged NCCL related changes, there seems to be an issue with multi GPU setup.

I will check the issues, you can test with

CUDA_DEVICE_ORDER=PCI_BUS_ID CUDA_VISIBLE_DEVICES=0 python3 examples/scripts/grpo_vlm.py     --model_name_or_path   Qwen/Qwen2.5-VL-3B-Instruct    --output_dir grpo-qwen25     --learning_rate 1e-5   --torch_dtype bfloat16     --max_prompt_length 512     --max_completion_length 512    --per_device_train_batch_size 2     --gradient_accumulation_steps 2     --num_generations 2      --bf16 True    --lora_target_modules "q_proj", "v_proj"     --log_completions --use_vllm --vllm_mode colocate --vllm_model_impl transformers

while serving with

CUDA_DEVICE_ORDER=PCI_BUS_ID CUDA_VISIBLE_DEVICES=1 trl vllm-serve --model Qwen/Qwen2.5-VL-3B-Instruct --tensor-parallel-size 1 --port 8000 --enforce_eager --vllm_model_impl transformers

HuggingFaceDocBuilderDev · 2025-07-25T17:33:05Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

merveenoyan · 2025-07-30T10:04:27Z

@qgallouedec @kashif this works, apparently my compiled dev version for vLLM was causing all the issue, updating v1 solved it!

kashif · 2025-07-30T11:26:43Z

trl/trainer/grpo_config.py

@@ -393,6 +393,14 @@ class GRPOConfig(TrainingArguments):
            "contention with training."
        },
    )
+    vllm_model_impl: str = field(


can you also add these doc in the docstrings further up in the file?

sergiopaniego · 2025-07-30T14:38:10Z

trl/scripts/vllm_serve.py

@@ -292,6 +292,14 @@ class ScriptArguments:
            "'trace'."
        },
    )
+    vllm_model_impl: str = field(


It's missing in the docstring above

sergiopaniego · 2025-07-30T14:38:53Z

trl/trainer/online_dpo_config.py

@@ -164,6 +164,14 @@ class may differ from those in [`~transformers.TrainingArguments`].
            "(`pip install vllm`)."
        },
    )
+    vllm_model_impl: str = field(


sergiopaniego · 2025-07-30T14:43:12Z

docs/source/vllm_integration.md

+
+## vLLM with Transformers Backend
+
+vLLM now supports transformers backend for model implementations. Simply passing in `transformers` in `vllm_model_impl` in configurations or through argument parser will set use transformers backend. See an example below.


Does it support VLMs? Limitations?
Additionally, maybe linking the blog: https://blog.vllm.ai/2025/04/11/transformers-backend.html
(in case these ideas can be added in 1-2 sentences)

merveenoyan · 2025-07-30T16:23:19Z

@kashif can you merge 🙏🏻

merveenoyan added 3 commits July 25, 2025 19:18

initial commit

bee6b12

tab x3

b265cfc

Merge branch 'main' into transformers-backend

3856df8

merveenoyan and others added 4 commits July 25, 2025 19:35

make example more complete

7610306

requires vllm 0.10

17269ff

Fix title display in docs

aa04fdb

Merge branch 'main' into transformers-backend

b3680b2

Merge branch 'main' into transformers-backend

886bccb

kashif reviewed Jul 30, 2025

View reviewed changes

sergiopaniego reviewed Jul 30, 2025

View reviewed changes

merveenoyan and others added 2 commits July 30, 2025 17:03

Update vllm_integration.md

c11c72d

add docs

85c0f43

merveenoyan requested review from sergiopaniego and kashif July 30, 2025 15:52

Merge branch 'main' into transformers-backend

ff52819

kashif approved these changes Jul 30, 2025

View reviewed changes

sergiopaniego approved these changes Jul 30, 2025

View reviewed changes

kashif merged commit 90c7876 into huggingface:main Jul 30, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add vLLM transformers backend to online methods #3773

Add vLLM transformers backend to online methods #3773

Uh oh!

merveenoyan commented Jul 25, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2025

Uh oh!

merveenoyan commented Jul 30, 2025

Uh oh!

kashif Jul 30, 2025

Uh oh!

sergiopaniego Jul 30, 2025

Uh oh!

sergiopaniego Jul 30, 2025

Uh oh!

sergiopaniego Jul 30, 2025

Uh oh!

merveenoyan commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!


		## vLLM with Transformers Backend

		vLLM now supports transformers backend for model implementations. Simply passing in `transformers` in `vllm_model_impl` in configurations or through argument parser will set use transformers backend. See an example below.

Add vLLM transformers backend to online methods #3773

Add vLLM transformers backend to online methods #3773

Uh oh!

Conversation

merveenoyan commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2025

Uh oh!

merveenoyan commented Jul 30, 2025

Uh oh!

kashif Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

sergiopaniego Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

sergiopaniego Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

sergiopaniego Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

merveenoyan commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

merveenoyan commented Jul 25, 2025 •

edited

Loading