A Python library for training VLMs using GRPO with Unsloth acceleration.
git clone https://github.com/agentsea/uvg.git
cd uvg
uv sync && uv pip install flash-attn --no-build-isolation && uv pip install -e .
To add Qwen-specific deps:
uv pip install -e ".[qwen]"
Check the examples/
folder.