I may be slow to respond.
Researcher & Engineering.
Currently working on AIGC.
-
Vivo Mobile Communication Co. Ltd
- Hangzhou, China
Pinned Loading
-
Image-Local-Attention
Image-Local-Attention PublicA better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.
-
FlashWindowAttention
FlashWindowAttention PublicSpeedup the attention computation of Swin Transformer
-
MEDUSA-Plus
MEDUSA-Plus PublicMEDUSA+: Acceleration Multiple Heads Decoding for LLM via Dynamic Tree Attention
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.