Skip to content

[Feature] support W8A8(FP8) and KV Cache FP8 for DeepSeek V2 #1156

@zhyncs

Description

@zhyncs

Checklist

Motivation

As titled. Make DeepSeek V2 MLA Faster!

Related resources

No response

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions