[feat] Add math eval to CI #2652

XiaotongJiang · 2024-12-30T06:42:38Z

Compared SGL and vLLM's math eval score to address concerns on SGL being constant low on this eval (res linked below)
Add math eval to CI

Setup:

Prompt: simple eval from openai
Grading Model: gemini-1.5-flash
Temperature 0.0

inference engine	Model	# Examples	score	model official score
vllm	llama 3.2 1b	500	0.248	0.306
sgl	llama 3.2 1b	500	0.248	0.306
vllm	llama 3.2 3b	500	0.476	0.48
sgl	llama 3.2 3b	500	0.484	0.48
vllm	llama 3.1 8b	500	0.50	0.519
sgl	llama 3.1 8b	500	0.496	0.519
sgl	llama 3.1 8b	65	0.492	0.519

zhyncs · 2024-12-30T06:49:49Z

Thanks!!

zhaochenyang20 · 2024-12-30T06:59:26Z

@XiaotongJiang @zhyncs @merrymercy @shuaills I am talking with XIaotong and will take a look on this. Also, thanks so much. this having been suspended for so long 😂

This reverts commit a11f8d5.

XiaotongJiang · 2024-12-30T07:11:10Z

sorry for the disruption

AssertionError: 0.505 not greater than or equal to 0.509

need to loose this assertion a bit

Add math eval to CI

9fc7014

XiaotongJiang requested review from merrymercy, Ying1123 and zhyncs as code owners December 30, 2024 06:42

XiaotongJiang changed the title ~~Add math eval to CI~~ [Issue 2504] Add math eval to CI Dec 30, 2024

zhyncs changed the title ~~[Issue 2504] Add math eval to CI~~ [feat] Add math eval to CI Dec 30, 2024

zhyncs approved these changes Dec 30, 2024

View reviewed changes

zhyncs merged commit a11f8d5 into sgl-project:main Dec 30, 2024

merrymercy added a commit that referenced this pull request Dec 30, 2024

Revert "[feat] Add math eval to CI (#2652)"

1deafae

This reverts commit a11f8d5.

merrymercy mentioned this pull request Dec 30, 2024

Revert "[feat] Add math eval to CI" #2656

Merged

XiaotongJiang mentioned this pull request Dec 30, 2024

[feat] Add math eval to CI nightly run #2663

Merged

XiaotongJiang added a commit to XiaotongJiang/sglang that referenced this pull request Jan 3, 2025

[feat] Add math eval to CI (sgl-project#2652)

15fdc6e

timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025

[feat] Add math eval to CI (sgl-project#2652)

1d1d202

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat] Add math eval to CI #2652

[feat] Add math eval to CI #2652

Uh oh!

XiaotongJiang commented Dec 30, 2024 •

edited

Loading

Uh oh!

zhyncs commented Dec 30, 2024

Uh oh!

zhaochenyang20 commented Dec 30, 2024

Uh oh!

XiaotongJiang commented Dec 30, 2024 •

edited

Loading

Uh oh!

Uh oh!

[feat] Add math eval to CI #2652

[feat] Add math eval to CI #2652

Uh oh!

Conversation

XiaotongJiang commented Dec 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhyncs commented Dec 30, 2024

Uh oh!

zhaochenyang20 commented Dec 30, 2024

Uh oh!

XiaotongJiang commented Dec 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

XiaotongJiang commented Dec 30, 2024 •

edited

Loading

XiaotongJiang commented Dec 30, 2024 •

edited

Loading