Pinned Loading
-
djl
djl PublicForked from deepjavalibrary/djl
An Engine-Agnostic Deep Learning Framework in Java
Java 1
-
-
EAGLE
EAGLE PublicForked from SafeAILab/EAGLE
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Python
-
flash-attention
flash-attention PublicForked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Python
-
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.