OpenRLHF / OpenRLHF Public

Notifications You must be signed in to change notification settings
Fork 764
Star 7.8k

Code
Issues 272
Pull requests 20
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: OpenRLHF/OpenRLHF

Labels 13 Milestones 0

New pull request New

20 Open 393 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Star Attention topology support with model integration and --attn_topology flag

#1122 opened Aug 26, 2025 by MagellaX

Loading…

Add FSDP backend and --dist_backend flag across CLIs; introduce FSDPStrategy

#1115 opened Aug 23, 2025 by MagellaX

Loading…

Add GSPO: Sequence-level policy optimization with group advantages

#1111 opened Aug 21, 2025 by MagellaX

Loading…

CLI support for top_k

#1104 opened Aug 13, 2025 by JoNeedsSleep

Loading…

Update utils.py

#1101 opened Aug 6, 2025 by LiyuanLucasLiu

Loading…

Enable logits extraction from vLLM for training

#1075 opened Jul 9, 2025 by MooMoo-Yang

Loading…

Reward model outputting one reward per rollout.

#1037 opened May 28, 2025 by NotTheStallion

Loading…

Support specifying ds tp size separately for each model

#1002 opened Apr 28, 2025 by HollowMan6

Loading…

Merge lmm-r1 for Multimodal PPO

#989 opened Apr 23, 2025 by TideDra

Loading…

Fix Tokenizer Behavior for Special Placeholder Token

#894 opened Mar 20, 2025 by YuchenFan48

Loading…

Support unbiased off-policy GRPO

#840 opened Mar 7, 2025 by LYMDLUT

Loading…

added LoRA adapter disabling for computing KL divergence in single no…

#836 opened Mar 6, 2025 by wilkincr

Loading…

Add support for max time per run

#711 opened Feb 5, 2025 by titu1994

Loading…

Support SFT and DPO training for Qwen2VL

#665 opened Jan 10, 2025 by LiuXTao

Loading…

Integrate SGLang into OpenRLHF. Non-Hybrid Engine Only

#661 opened Jan 9, 2025 by zhaochenyang20

Loading…

Support rl logging board

#658 opened Jan 9, 2025 by HarderThenHarder

Loading…

Ensure train datasets do not contain eval datasets

#594 opened Dec 17, 2024 by dingyuan-shi

Loading…

Support broadcast vllm params by chunks

#593 opened Dec 17, 2024 by zhuzilin

Loading…

Make sure there is always _some_ eval data

#582 opened Dec 13, 2024 by frrad

Loading…

Support TRL's RLOO

#553 opened Dec 4, 2024 by songxxzp

Loading…

ProTip! Updated in the last three days: updated:>2025-09-04.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!