Skip to content

Conversation

yanbing-j
Copy link
Contributor

@yanbing-j yanbing-j commented Aug 1, 2025

Motivation

Modifications

This PR is to add support of FP8 block quantize when N or K is not multiples of 128.

Accuracy Test

Benchmark & Profiling

Checklist

Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@mingfeima mingfeima added the ready-to-merge The PR is ready to merge after the CI is green. label Aug 1, 2025
@mingfeima mingfeima marked this pull request as ready for review August 1, 2025 06:41
@zhyncs zhyncs merged commit 1fe691a into sgl-project:main Aug 1, 2025
37 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ready-to-merge The PR is ready to merge after the CI is green.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants