dirtyDan0

Follow

Lumeng Wu dirtyDan0

Follow

Innovation can be bug-driven

3 followers · 0 following

THE UNIVERSITY OF HONG KONG
Hong Kong
16:10 (UTC +08:00)

Achievements

Achievements

dirtyDan0/README.md

Hi there 👋

contact: lumeng.wu@connect.hku.hk

Pinned Loading

volcengine/verl volcengine/verl Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python 12.7k 2.2k
VerboseLengthReduction VerboseLengthReduction Public

An empirical study on how verbosity in LLM responses decreases during reinforcement learning training.

Python