Skip to content

Conversation

wa008
Copy link
Contributor

@wa008 wa008 commented May 16, 2025

What does this PR do?

Fix bugs when use custom reward function in Command Line Interfaces (CLIs) GRPO, like command below cli grpo --reward_funcs my_lib.my_reward

Current path doesn't exist in sys.path, as a result, custom reward function in current path cannot be imported correctly. This PR is to fix it.

Fixes #3448

Before submitting

Who can review?

@qgallouedec

@qgallouedec
Copy link
Member

Thanks!

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@qgallouedec qgallouedec changed the title Update grpo.py to fix bugs for cli grpo --reward_funcs my_lib.my_reward 👇 Update grpo.py to fix bugs for cli grpo --reward_funcs my_lib.my_reward May 27, 2025
@qgallouedec qgallouedec merged commit cd83841 into huggingface:main May 27, 2025
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Rewards functions for Command Line Interfaces GRPO trainer
3 participants