Focus on model inference optimization, such as inference engine and model compression.
- Shanghai
Chengxiang Qi
KuangjuX
MLSys| Deep Learning Compiler|System at MSRA
UCAS Beijing / Tianjin / Hangzhou, China
Zhuohan Li
zhuohan123
MTS @ @openai |
🎓 cs phd @ 🌁 uc berkeley |
building @vllm-project |
machine learning system |
the real agi is the friends we made along the way
OpenAI San Francisco Bay Area