maksimstw

Taiwei Shi maksimstw

Achievements

volcengine/verl volcengine/verl Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python 13.3k 2.3k
hiyouga/LLaMA-Factory hiyouga/LLaMA-Factory Public

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 58k 7.1k
BytedTsinghua-SIA/MemAgent BytedTsinghua-SIA/MemAgent Public

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 658 50
lmarena/arena-hard-auto lmarena/arena-hard-auto Public

Arena-Hard-Auto: An automatic LLM benchmark.

Python 924 125
zeno-ml/zeno-build zeno-ml/zeno-build Public

Build, evaluate, understand, and fix LLM-based apps

Jupyter Notebook 491 32
limenlp/safer-instruct limenlp/safer-instruct Public

This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"

17 1