Highlights
- Pro
Pinned Loading
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
-
hiyouga/LLaMA-Factory
hiyouga/LLaMA-Factory PublicUnified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
-
BytedTsinghua-SIA/MemAgent
BytedTsinghua-SIA/MemAgent PublicA MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
-
lmarena/arena-hard-auto
lmarena/arena-hard-auto PublicArena-Hard-Auto: An automatic LLM benchmark.
-
zeno-ml/zeno-build
zeno-ml/zeno-build PublicBuild, evaluate, understand, and fix LLM-based apps
-
limenlp/safer-instruct
limenlp/safer-instruct PublicThis is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.