yuxianq

Follow

Yuxian Qiu yuxianq

Follow

2 followers · 0 following

NVIDIA
Shanghai
23:35 (UTC +08:00)

Achievements

Achievements

Popular repositories Loading

flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda
TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++
DeepGEMM DeepGEMM Public

Forked from deepseek-ai/DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

C++