Skip to content
@infinigence

Infinigence

Popular repositories Loading

  1. Infini-Megrez Infini-Megrez Public

    326 20

  2. Infini-Megrez-Omni Infini-Megrez-Omni Public

    Python 237 10

  3. FlashOverlap FlashOverlap Public

    A lightweight design for computation-communication overlap.

    Cuda 155 6

  4. Semi-PD Semi-PD Public

    A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

    Python 104 10

  5. LVEval LVEval Public

    Repository of LV-Eval Benchmark

    Python 69 9

  6. SpecEE SpecEE Public

    Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)

    C++ 46 3

Repositories

Showing 10 of 12 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…