Skip to content

[Feature] Add MultiPL-E & Code Evaluator #1963

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Mar 21, 2025

Conversation

Zhudongsheng75
Copy link
Collaborator

  1. Developed a client/server-based code evaluator;
  2. Based on this code evaluator, the MultiPL-E dataset has been added. Currently, OC can support evaluation tasks in multiple programming languages.

@tonysy
Copy link
Collaborator

tonysy commented Mar 21, 2025

multiple_infer_cfg = dict(
prompt_template=dict(type=PromptTemplate, template='Based on the provided {language} code snippet, complete the subsequent content. The initial part of the completed code must match the provided code snippet exactly:\n{prompt}'),
retriever=dict(type=ZeroRetriever),
inferencer=dict(type=GenInferencer, max_out_len=2048),
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please consider removing this max_out_len to avoid the truncation for the long cot decoding scenario.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

change num_repeats>1, otherwise the number in
`.cache/dataset_size.json` might be inconsistent.

Args:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doc string is missing

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

Copy link
Collaborator

@tonysy tonysy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tonysy tonysy merged commit 8a5029b into open-compass:main Mar 21, 2025
8 checks passed
@tonysy tonysy requested a review from MaiziXiao April 28, 2025 02:49
stephen-nju pushed a commit to stephen-nju/opencompass that referenced this pull request May 14, 2025
* multiple_code develop

* multiple_code update

* comments upadate

* index upadate
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants