Skip to content

Conversation

MattiaSangermano
Copy link
Contributor

@MattiaSangermano MattiaSangermano commented Jul 6, 2025

Here I included the results coming from the mteb PR 2859. added results for models intfloat__multilingual-e5-small and sentence-transformers__paraphrase-multilingual-MiniLM-L12-v2 on DadoEvalCoarseClassification, DisCoTexPairClassification, EmitClassification, SardiStanceClassification and XGlueWPRReranking

Checklist

  • My model has a model sheet, report or similar
  • My model has a reference implementation in mteb/models/ this can be as an API. Instruction on how to add a model can be found here
    • No, but there is an existing PR ___
  • The results submitted is obtained using the reference implementation
  • My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
  • I solemnly swear that for all results submitted I have not on the evaluation dataset including training splits. If I have I have disclosed it clearly.

…-transformers__paraphrase-multilingual-MiniLM-L12-v2 on DadoEvalCoarseClassification, DisCoTexPairClassification, EmitClassification, SardiStanceClassification and XGlueWPRReranking
…-transformers__paraphrase-multilingual-MiniLM-L12-v2 on MKQARetrieval task
Copy link
Contributor

@isaac-chung isaac-chung left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!
@Samoed any idea if this error already has a fix planned/in place?

@Samoed
Copy link
Member

Samoed commented Jul 7, 2025

The error occurs because I'm using MTEB from the main branch, but this PR was created before the tasks were merged. I don't have any ideas on how to fix it (without manually triggering the CI with a different branch), except to rerun it manually after the tasks are merged.

Copy link

github-actions bot commented Jul 7, 2025

Model Results Comparison

Reference models: intfloat/multilingual-e5-large, google/gemini-embedding-001
New models evaluated: intfloat/multilingual-e5-small, sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Tasks: DadoEvalCoarseClassification, DisCoTexPairClassification, EmitClassification, MKQARetrieval, SardiStanceClassification, XGlueWPRReranking

Results for intfloat/multilingual-e5-small

task_name intfloat/multilingual-e5-small Max result
DadoEvalCoarseClassification 0.51 0.51
DisCoTexPairClassification 0.69 0.69
EmitClassification 0.04 0.04
MKQARetrieval 0.06 0.06
SardiStanceClassification 0.42 0.42
XGlueWPRReranking 0.70 0.7
Average 0.40 0.4

Results for sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2

task_name sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Max result
DadoEvalCoarseClassification 0.38 0.51
DisCoTexPairClassification 0.67 0.69
EmitClassification 0.03 0.04
MKQARetrieval 0.05 0.06
SardiStanceClassification 0.4 0.42
XGlueWPRReranking 0.68 0.70
Average 0.37 0.40

@isaac-chung
Copy link
Contributor

The error occurs because I'm using MTEB from the main branch, but this PR was created before the tasks were merged. I don't have any ideas on how to fix it (without manually triggering the CI with a different branch), except to rerun it manually after the tasks are merged.

I see, thanks. We might want to look into that separately. Let's create an issue for that.

@isaac-chung isaac-chung merged commit 0353d7b into embeddings-benchmark:main Jul 7, 2025
3 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants