Skip to content
View Hongbosherlock's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Hongbosherlock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Hongbosherlock/README.md

Hi, I'm Hongbosherlock.

  • 📖 I graduated from the University of Chinese Academy of Sciences
  • 🔭 I’m currently focusing on inference and compression of LLM (quantization, pruning).
  • 🌱 I’m currently learning CUDA and C++.
  • 👯 I’m looking to collaborate on LLM infra.
  • 💬 Ask me about LLM quantization and inference.
  • 📫 How to reach me: hongbosherlock@gmail.com
  • ⚡ Fun fact: I am an amateur photographer📷. My work can be found at: https://photo.leoneo.top

Hongbosherlock

Hongbosherlock's github stats

Pinned Loading

  1. sgl-project/sglang sgl-project/sglang Public

    SGLang is a fast serving framework for large language models and vision language models.

    Python 17.4k 2.8k

  2. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 56.6k 9.8k

  3. Infrasys-AI/AISystem Infrasys-AI/AISystem Public

    AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

    Jupyter Notebook 14.9k 2.2k

  4. QuantLLM QuantLLM Public

    Quantization Kernel Library for LLM Inference

    C++