Skip to content

Conversation

kaisopos
Copy link
Contributor

@kaisopos kaisopos commented Jun 13, 2025

Description

Judge API V2: Implement the CLI

CLI

oumi judge_v2 \
  --judge-config judge_config.yaml \
  --inference-config inference_config.yaml \
  --input-file input.jsonl \
  --output-file output.jsonl

Terminal Output
image

Related issues

Towards OPE-1327

Before submitting

  • This PR only changes documentation. (You can ignore the following checks in that case)
  • Did you read the contributor guideline Pull Request guidelines?
  • Did you link the issue(s) related to this PR in the section above?
  • Did you add / update tests where needed?

Reviewers

At least one review from a member of oumi-ai/oumi-staff is required.

Copy link
Contributor

@wizeng23 wizeng23 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I might be missing context, but to me it feels weird to add an oumi verb called judge_v2.

  • Is there a reason we can't just replace the old judge verb logic with this one?
  • If we still want some of the functionality of the old judge verb, could we either add this new logic under the same verb, controlled by a flag, or give this verb a more descriptive name than just v2?

@kaisopos
Copy link
Contributor Author

I might be missing context, but to me it feels weird to add an oumi verb called judge_v2.

  • Is there a reason we can't just replace the old judge verb logic with this one?
  • If we still want some of the functionality of the old judge verb, could we either add this new logic under the same verb, controlled by a flag, or give this verb a more descriptive name than just v2?

Restricted Judge V2 under the env flag OUMI_EXPERIMENTAL_JUDGE_V2, as discussed offline.
The plan is to add a deprecation notice for V1 and fully replace with V2 in 1-2 versions.

@kaisopos kaisopos merged commit 1016b6b into main Jun 14, 2025
5 checks passed
@kaisopos kaisopos deleted the kostas/judge_v2_cli branch June 14, 2025 09:33
penfever pushed a commit that referenced this pull request Aug 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants