Benchmark tasks are incorrectly selected

For now, when loading `MTEB_ENG_CLASSIC`, it provides:  

```python
from mteb import MTEB_ENG_CLASSIC

MTEB_ENG_CLASSIC.tasks
# MTEBTasks(..., STS17Crosslingual(name='STS17', languages=['ara', 'deu', 'eng', '...']), 

MTEB_ENG_CLASSIC.tasks[-17]
# STS22CrosslingualSTS(name='STS22', languages=['cmn', 'deu', 'eng', '...'])

MTEB_ENG_CLASSIC.tasks[-17].hf_subsets
# ['en', 'de-en', 'es-en', 'pl-en', 'zh-en']
```

However, in the old leaderboard, only the `en` split is selected https://github.com/embeddings-benchmark/leaderboard/blob/80dd9a2b2d4a0abc368a6cea5f79a517b753951b/config.yaml#L124-L125.  

This could be an issue because main score for tasks is averaged https://github.com/embeddings-benchmark/mteb/blob/75d78c1d08227e7ca9f5a4b61c5f9503c82a9ab9/mteb/load_results/task_results.py#L500

Or old leaderboard version is not correct?

CC @Muennighoff 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmark tasks are incorrectly selected #1757

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Benchmark tasks are incorrectly selected #1757

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions