Skip to content
View l1cacheDell's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report l1cacheDell

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
l1cacheDell/README.md

Profile

HPC/CUDA C++ system programmer.

  • [2024.11-2025.05] High Performance Computing Intern at @PaddlePaddle (Baidu).
  • [2023.05-2024.02] Research Intern at THU-AIR

Education

  • [2025.8-] National University of Singapore (NUS), Computer Engineering, Msc.
  • [2021.9-2025.6] Beijing University of Posts and Telecommunications (BUPT), Artificial Intelligence. BEng.

Pinned Loading

  1. PaddlePaddle/Paddle PaddlePaddle/Paddle Public

    PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

    C++ 23.2k 5.8k

  2. thu-ml/SageAttention thu-ml/SageAttention Public

    Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

    Cuda 2.3k 208

  3. PaddlePaddle/PaddleNLP PaddlePaddle/PaddleNLP Public

    Easy-to-use and powerful LLM and SLM library with awesome model zoo.

    Python 12.8k 3.1k

  4. triton-inference-server/vllm_backend triton-inference-server/vllm_backend Public

    Python 293 32