-
Notifications
You must be signed in to change notification settings - Fork 2.8k
[eagle2] fix end check when target model verify #2723
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@yukavio can you help review this? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice catch! Can you address the comments?
- How is the performance slowdown due to this change?
- Can you provide a unit test to prevent this from happening in the future?
- Can you fix the lint error? https://sgl-project.github.io/references/contribution_guide.html#code-formatting-with-pre-commit
9676e35
to
7eae79f
Compare
|
The newly added test fails. Please fix it and make sure the test can correctly cover the fixed bug. |
7eae79f
to
93b698f
Compare
93b698f
to
cb0d7d0
Compare
The current test is too specific. Can you try to compare against the output of non speculative decoding version? You can also set this to sglang/docs/references/sampling_params.md Lines 56 to 57 in 6fb5768
|
cb0d7d0
to
dfc0e5e
Compare
I added the |
Great! Then please use this in the test case and we can merge this soon! |
Done |
@jjjjohnson It is merged. Thanks! |
Motivation
After eagel2 target model verify, the accepted tokens might be EOS token, need to check each appended new token. Then free kv cache of the unaccepted tokens.
Modifications
Checklist