Fighting
-
Renmin University of China
- Beijing, China
-
09:54
(UTC +08:00)
Pinned Loading
-
RUCAIBox/TextBox
RUCAIBox/TextBox PublicTextBox 2.0 is a text generation library with pre-trained language models
-
RUCAIBox/Slow_Thinking_with_LLMs
RUCAIBox/Slow_Thinking_with_LLMs PublicA series of technical report on Slow Thinking with LLM
-
RUCAIBox/Passk_Training
RUCAIBox/Passk_Training PublicThe official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
-
RUCAIBox/ChatCoT
RUCAIBox/ChatCoT PublicThe official repository of "ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models"
-
RUCAIBox/RLMEC
RUCAIBox/RLMEC PublicForked from Timothy023/RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
-
RUCAIBox/ALLO
RUCAIBox/ALLO PublicThe official repository of "Low-Redundant Optimization for Large Language Model Alignment''
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.