Skip to content

Conversation

HollowMan6
Copy link
Contributor

Checklist Before Starting

  • Searched for similar PR(s).
  • Checked PR Title format
    • In format of: [modules] type: Title
    • modules are in fsdp, megatron, sglang, vllm, rollout, trainer, ci, training_utils, recipe, hardware, deployment, ray, worker, single_controller, misc, perf, model, algo, env, tool, ckpt, doc, data
    • type is in feat, fix, refactor, chore, test
    • can involve multiple modules, seperated by , or space, like [megatron, fsdp, doc] feat: xxx

What does this PR do?

Fix the error message: Error in reward_fn: reward_extra_info, as for some reward function implementation, only reward_tensor is included in the returned dictionary.

Checklist Before Submitting

  • Read the Contribute Guide.
  • Apply pre-commit checks.
  • Add [BREAKING] to the PR title description if it breaks any API.
  • Update the documentation about your changes in the docs.
  • New CI unit test(s) are added to cover the code path.
  • Rely on existing unit tests on CI that covers the code path.

Fix the error message: `Error in reward_fn: reward_extra_info`, as
for some reward function implementation, only `reward_tensor` is
included in the returned dictionary.

- https://github.com/volcengine/verl/blob/b401382405304436292ca19870d1917a0174d09c/verl/workers/reward_manager/prime.py#L176
- https://github.com/volcengine/verl/blob/b401382405304436292ca19870d1917a0174d09c/examples/split_placement/main_ppo_split.py#L88

Signed-off-by: Hollow Man <hollowman@opensuse.org>
Copy link
Collaborator

@tongyx361 tongyx361 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@tongyx361 tongyx361 merged commit 18c2825 into volcengine:main Jun 19, 2025
30 of 31 checks passed
@HollowMan6 HollowMan6 deleted the reward_extra_info branch June 20, 2025 02:44
yellowbee686 pushed a commit to yellowbee686/verl that referenced this pull request Jun 23, 2025
…olcengine#2109)

### Checklist Before Starting

- [X] Searched for similar PR(s).
- [X] Checked PR Title format
  - In format of: [modules] type: Title
- modules are in `fsdp, megatron, sglang, vllm, rollout, trainer, ci,
training_utils, recipe, hardware, deployment, ray, worker,
single_controller, misc, perf, model, algo, env, tool, ckpt, doc, data`
  - type is in `feat, fix, refactor, chore, test`
- can involve multiple modules, seperated by `,` or space, like
`[megatron, fsdp, doc] feat: xxx`

### What does this PR do?

Fix the error message: `Error in reward_fn: reward_extra_info`, as for
some reward function implementation, only `reward_tensor` is included in
the returned dictionary.

-
https://github.com/volcengine/verl/blob/b401382405304436292ca19870d1917a0174d09c/verl/workers/reward_manager/prime.py#L176
-
https://github.com/volcengine/verl/blob/b401382405304436292ca19870d1917a0174d09c/examples/split_placement/main_ppo_split.py#L88

### Checklist Before Submitting

- [X] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
- [X] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).
- [X] Add `[BREAKING]` to the PR title `description` if it breaks any
API.
- [X] Update the documentation about your changes in the
[docs](https://github.com/volcengine/verl/tree/main/docs).
- [X] New CI unit test(s) are added to cover the code path.
- [X] Rely on existing unit tests on CI that covers the code path.

Signed-off-by: Hollow Man <hollowman@opensuse.org>
Tyizhanshen pushed a commit to HyperdriveHustle/verl that referenced this pull request Jul 1, 2025
…olcengine#2109)

### Checklist Before Starting

- [X] Searched for similar PR(s).
- [X] Checked PR Title format
  - In format of: [modules] type: Title
- modules are in `fsdp, megatron, sglang, vllm, rollout, trainer, ci,
training_utils, recipe, hardware, deployment, ray, worker,
single_controller, misc, perf, model, algo, env, tool, ckpt, doc, data`
  - type is in `feat, fix, refactor, chore, test`
- can involve multiple modules, seperated by `,` or space, like
`[megatron, fsdp, doc] feat: xxx`

### What does this PR do?

Fix the error message: `Error in reward_fn: reward_extra_info`, as for
some reward function implementation, only `reward_tensor` is included in
the returned dictionary.

-
https://github.com/volcengine/verl/blob/0df9f879560136af9fabf08d4bf3b7705f0663eb/verl/workers/reward_manager/prime.py#L176
-
https://github.com/volcengine/verl/blob/0df9f879560136af9fabf08d4bf3b7705f0663eb/examples/split_placement/main_ppo_split.py#L88

### Checklist Before Submitting

- [X] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
- [X] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).
- [X] Add `[BREAKING]` to the PR title `description` if it breaks any
API.
- [X] Update the documentation about your changes in the
[docs](https://github.com/volcengine/verl/tree/main/docs).
- [X] New CI unit test(s) are added to cover the code path.
- [X] Rely on existing unit tests on CI that covers the code path.

Signed-off-by: Hollow Man <hollowman@opensuse.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants