Skip to content

Conversation

zhyncs
Copy link
Member

@zhyncs zhyncs commented Dec 8, 2024

Motivation

fix #2313 cc @yzh119

v0.1.6 and nightly

Modifications

Checklist

  • Format your code according to the Contributor Guide.
  • Add unit tests as outlined in the Contributor Guide.
  • Update documentation as needed, including docstrings or example tutorials.

@zhyncs
Copy link
Member Author

zhyncs commented Dec 8, 2024

@james-p-xu after this PR, you can use the nightly flashinfer for testing
also cc @BBuf

@zhyncs zhyncs merged commit 6128f7c into main Dec 8, 2024
3 of 16 checks passed
@zhyncs zhyncs deleted the zhyncs/fi branch December 8, 2024 12:07
@zhyncs
Copy link
Member Author

zhyncs commented Dec 8, 2024

@zhyncs
Copy link
Member Author

zhyncs commented Dec 8, 2024

v0.1.6 https://github.com/sgl-project/sglang/actions/runs/12221450009/job/34090501035
nightly https://github.com/sgl-project/sglang/actions/runs/12221709641/job/34091112429

Based on the results of the performance test, nightly's performance is worse than v0.1.6. @yzh119

timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature] Specify dtype at begin_forward for FlashInfer > 0.1.6
1 participant