[Bug Fix] Add partial rotary factor support for Phi-4 and upgrade to transformers v4.50.0 #3984

adarshxs · 2025-03-01T15:36:51Z

Motivation

Modifications

Add partial rotary embedding support and upgrade to transformers==4.50.0
Also fix Qwen2.5VL which breaks when upgraded to transformers==4.50.0 from transformers==4.48.3
Also minor fixes to reference_hf.py script

Checklist

Format your code according to the Code Formatting with Pre-Commit.

adarshxs · 2025-03-18T13:25:33Z

@zhaochenyang20 ready to be reviewed. Some inconsistencies in the CI with accuracy but should be good

Cc @yizhang2077 @mickqian

zhaochenyang20 · 2025-03-18T18:16:06Z

@adarshxs thanks. yi and me can help to rerun the CI. @yizhang2077 could you help to review this?

adarshxs · 2025-03-20T16:02:52Z

@zhaochenyang20 @yizhang2077 any update on this?

yizhang2077

@adarshxs Sorry I am late. Thanks for your work, I leave some comments here~

test/srt/test_mla.py

test/srt/test_srt_endpoint.py

test/srt/test_verl_engine.py

test/srt/test_eval_fp8_accuracy.py

python/sglang/srt/hf_transformers_utils.py

scripts/ci_install_dependency.sh

python/sglang/srt/models/llama.py

yizhang2077 · 2025-03-21T13:56:07Z

@adarshxs LGTM，it is better if you can run mmmu benchmark and paste result here #4456. One of CI failed tests may be related to gemma.

zhaochenyang20 · 2025-03-22T14:53:35Z

@adarshxs great work!!! do not rebase with main, let me rerun for you

zhyncs · 2025-03-22T21:20:55Z

@adarshxs @zhaochenyang20 @yizhang2077 @mickqian You are great!!

test/srt/test_verl_engine.py

yhyang201 · 2025-08-14T06:53:35Z

python/sglang/srt/hf_transformers_utils.py

+    # fix: for Qwen2-VL model, inject default 'size' if not provided.
+    if config.model_type in {"qwen2_vl"}:
+        if "size" not in kwargs:
+            kwargs["size"] = {"shortest_edge": 3136, "longest_edge": 1003520}


I would like to ask about the intention of injecting the default ‘size’ here for the Qwen2-VL model. I noticed that after Transformers version 4.54.0, this injection no longer works. I’m not sure whether I need to adjust it in order to make it work again.

as far as i remember, prior to transformers v4.50.0, Qwen2-VL model's preprocessor_config.json only contained min_pixels/max_pixels and no explicit shortest_edge or longest_edge. as a result, loading those models under 4.50.0 would immediately throw ValueError: size must contain 'shortest_edge' and 'longest_edge' keys.

adarshxs and others added 2 commits March 1, 2025 16:14

Update transformers to 4.49.0

d91957f

Added partial rotary factor support for Phi-4 and fixed qwen2.5vl

c4007e7

adarshxs requested review from merrymercy, Ying1123, hnyls2002, zhyncs, ispobock and ByronHsu as code owners March 1, 2025 15:36

adarshxs changed the title ~~[Bug Fix] Add partial rotary factor support for Phi-4 and support qwen2.5vl with transformers==4.49.0~~ [Bug Fix] Add partial rotary factor support for Phi-4 and support qwen2.5vl with transformers v4.49.0 Mar 1, 2025

adarshxs and others added 3 commits March 1, 2025 21:11

Update qwen2_5_vl.py

b718c1a

fix linting

40a06b2

fix linting

6fd2c7e

adarshxs changed the title ~~[Bug Fix] Add partial rotary factor support for Phi-4 and support qwen2.5vl with transformers v4.49.0~~ [Bug Fix] Add partial rotary factor support for Phi-4 and upgrade to transformers v4.49.0 Mar 2, 2025

Merge branch 'sgl-project:main' into phi_4_bug_fix

293d8f7

adarshxs requested a review from HaiShaw as a code owner March 2, 2025 06:15

adarshxs and others added 9 commits March 3, 2025 17:43

Merge branch 'main' into phi_4_bug_fix

313c778

add args to reference_hf

1271e56

Update reference_hf.py

608b0fd

Update reference_hf.py

5ae9e25

fix imports

6675167

Merge branch 'sgl-project:main' into phi_4_bug_fix

3f4cf94

fix lint

e613e26

remove local config

cc823b3

fix import

b8ccc74

adarshxs marked this pull request as draft March 4, 2025 10:35

adarshxs added 5 commits March 4, 2025 10:44

fix broken imports

100a6bc

fix CI stuff

dfd8f81

fix CI stuff

9ada511

fix transformer utils

04a227f

fix transformer utils

0b3f2ff

Merge branch 'main' into phi_4_bug_fix

3fbb03d

Merge branch 'main' into phi_4_bug_fix

5fcab44

yizhang2077 reviewed Mar 21, 2025

View reviewed changes

adarshxs and others added 2 commits March 21, 2025 13:12

remove gemma local configs

b7ef832

Merge branch 'sgl-project:main' into phi_4_bug_fix

fcb95cf

adarshxs and others added 2 commits March 21, 2025 17:23

transformers 4.50.0

14e4e16

Update test_verl_engine.py

3128d2d

yizhang2077 approved these changes Mar 21, 2025

View reviewed changes

zhaochenyang20 changed the title ~~[Bug Fix] Add partial rotary factor support for Phi-4 and upgrade to transformers v4.49.0~~ [Bug Fix] Add partial rotary factor support for Phi-4 and upgrade to transformers v4.50.0 Mar 21, 2025

adarshxs and others added 4 commits March 22, 2025 12:51

pass ci

404d866

Merge branch 'main' into phi_4_bug_fix

44c3b57

rerun ci

1e9f44d

rerun ci

3d01371

Merge branch 'main' into phi_4_bug_fix

8c7a0f9

zhyncs reviewed Mar 22, 2025

View reviewed changes

test/srt/test_verl_engine.py Outdated Show resolved Hide resolved

zhyncs added 2 commits March 22, 2025 14:22

upd

b635fb4

Merge branch 'main' into phi_4_bug_fix

d48b136

zhyncs merged commit f8f9244 into sgl-project:main Mar 22, 2025
1 of 18 checks passed

yizhang2077 mentioned this pull request Mar 22, 2025

close gemma2 in test_verl_engine.py temporarily #4685

Merged

6 tasks

adarshxs deleted the phi_4_bug_fix branch March 23, 2025 05:46

yizhang2077 mentioned this pull request Mar 25, 2025

Development Roadmap (2025 H1) #4042

Open

67 tasks

yhyang201 reviewed Aug 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug Fix] Add partial rotary factor support for Phi-4 and upgrade to transformers v4.50.0 #3984

[Bug Fix] Add partial rotary factor support for Phi-4 and upgrade to transformers v4.50.0 #3984

Uh oh!

adarshxs commented Mar 1, 2025 •

edited by zhaochenyang20

Loading

Uh oh!

adarshxs commented Mar 18, 2025

Uh oh!

zhaochenyang20 commented Mar 18, 2025

Uh oh!

adarshxs commented Mar 20, 2025 •

edited

Loading

Uh oh!

yizhang2077 left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yizhang2077 commented Mar 21, 2025 •

edited

Loading

Uh oh!

zhaochenyang20 commented Mar 22, 2025

Uh oh!

zhyncs commented Mar 22, 2025

Uh oh!

Uh oh!

Uh oh!

yhyang201 Aug 14, 2025

Uh oh!

adarshxs Aug 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

[Bug Fix] Add partial rotary factor support for Phi-4 and upgrade to transformers v4.50.0 #3984

[Bug Fix] Add partial rotary factor support for Phi-4 and upgrade to transformers v4.50.0 #3984

Uh oh!

Conversation

adarshxs commented Mar 1, 2025 • edited by zhaochenyang20 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Checklist

Uh oh!

adarshxs commented Mar 18, 2025

Uh oh!

zhaochenyang20 commented Mar 18, 2025

Uh oh!

adarshxs commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yizhang2077 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yizhang2077 commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhaochenyang20 commented Mar 22, 2025

Uh oh!

zhyncs commented Mar 22, 2025

Uh oh!

Uh oh!

Uh oh!

yhyang201 Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

adarshxs Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

adarshxs commented Mar 1, 2025 •

edited by zhaochenyang20

Loading

adarshxs commented Mar 20, 2025 •

edited

Loading

yizhang2077 left a comment •

edited

Loading

yizhang2077 commented Mar 21, 2025 •

edited

Loading

adarshxs Aug 14, 2025 •

edited

Loading