[v2] Merge main #2485

Samoed · 2025-04-04T12:33:00Z

Code Quality

Code Formatted: Format the code using make lint to maintain consistent style.

Documentation

Updated Documentation: Add or update documentation to reflect the changes introduced in this PR.

Testing

New Tests Added: Write tests to cover new functionality. Validate with make test-with-coverage.
Tests Passed: Run tests locally using make test or make test-with-coverage to ensure no existing functionality is broken.

Adding datasets checklist

Reason for dataset addition: ...

I have run the following models on the task (adding the results to the pr). These can be run using the mteb -m {model_name} -t {task_name} command.
- sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
- intfloat/multilingual-e5-small
I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).
If the dataset is too big (e.g. >2048 examples), considering using self.stratified_subsampling() under dataset_transform()
I have filled out the metadata object in the dataset file (find documentation on it here).
Run tests locally to make sure nothing is broken using make test.
Run the formatter to format the code using make lint.

Adding a model checklist

I have filled out the ModelMeta object to the extent possible
I have ensured that my model can be loaded using
- mteb.get_model(model_name, revision) and
- mteb.get_model_meta(model_name, revision)
I have tested the implementation works on a representative set of tasks.

…mplement CV-Bench (#2414) * refactor CV-Bench * reimplement CV Bench * remove abstask/evaluator/tests for Any2TextMultipleChoice * rerun descriptive stats

fix: Add option to remove leaderboard from leaderboard fixes #2413 This only removed the benchmark from the leaderboard but keep it in MTEB.

Automatically generated by python-semantic-release

* Added VDR Multilingual Dataset * address comments * make lint * Formated Dataset for retrieval * Update mteb/tasks/Retrieval/multilingual/VdrMultilingualRetrieval.py Co-authored-by: Roman Solomatin <samoed.roman@gmail.com> * Update mteb/tasks/Retrieval/multilingual/VdrMultilingualRetrieval.py Co-authored-by: Roman Solomatin <samoed.roman@gmail.com> * make lint * corrected date * fix dataset building * move to image folder --------- Co-authored-by: Roman Solomatin <samoed.roman@gmail.com> Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

Automatically generated by python-semantic-release

* pin setuptools * pin setuptools * pin setuptools in makefile * try ci * fix ci * remove speed from installs

…tering folder (#2422) * add PatentFnBClustering.py * do make lint and revise * rollback Makefile * Update mteb/tasks/Clustering/kor/PatentFnBClustering.py Co-authored-by: Roman Solomatin <samoed.roman@gmail.com> * klue_mrc_domain * make lint * klue_modified_clustering_dataset * clustering & kor folder add __init.py * clustering & kor folder add __init__.py * task.py roll-back * correct text_creation to sample_creation & delete form in MetaData * correct task_subtype in TaskMetaData * delete space * edit metadata * edit task_subtypes --------- Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>

* add richinfoai models add richinfoai models * format codes by linter format codes by linter

* Fix typos; add chrono order * Fix spacing

* Add model specific dependencies in pyproject.toml * Update documentation

Automatically generated by python-semantic-release

…mplement r-Oxford and r-Paris (#2442) * MutipleChoiceEvaluationMixin; reimplement r-Oxford and r-Paris; rerun stats * modify benchmark list * fix citation

…de' (#2445) Fixes #2444

* Added meta information about SearchMap_Preview model to the model_dir * Added meta information about SearchMap_Preview model to the model_dir * updated revision name * Device loading and cuda cache cleaning step left out * removed task instructions since it's not necessary * changed sentence transformer loader to mteb default loader and passed instructions s model prompts * Included searchmap to the models overview page * Included searchmap to the models overview page * added meta data information about where model was adpated from * Update mteb/models/searchmap_models.py * fix lint * lint --------- Co-authored-by: Roman Solomatin <samoed.roman@gmail.com> Co-authored-by: Roman Solomatin <36135455+Samoed@users.noreply.github.com>

* Add Background Gradients in Summary and Task Table * Remove warnings and add light green cmap * Address comments * Separate styling function * address comments * added comments

* add ops_moa_models * add custom implementations * Simplify custom implementation and format the code * support SentenceTransformers * add training datasets * Update mteb/models/ops_moa_models.py Co-authored-by: Roman Solomatin <samoed.roman@gmail.com> * update training_datasets --------- Co-authored-by: kunka.xgw <kunka.xgw@taobao.com> Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>

ci: cache ~/.cache/huggingface Co-authored-by: sam021313 <40773225+sam021313@users.noreply.github.com>

…mplement ImageCoDe (#2468) * reimplement ImageCoDe with ImageTextPairClassification * add missing stats file

* feat: added pubmedbert model2vec models * fix: attribute model_name * fix: fixed commit hash for pubmed_bert model2vec models * fix: changes requested in PR 2443

* add_nb_sbert_model * Update nb_sbert.py added n_parameters and release_date * Update mteb/models/nb_sbert.py Co-authored-by: Roman Solomatin <samoed.roman@gmail.com> * Update nb_sbert.py fix: make lint * added nb_sbert to overview.py + ran make lint * Update nb_sbert.py Fix error: Input should be a valid date or datetime, month value is outside expected range of 1-12 --------- Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>

Automatically generated by python-semantic-release

* supress logging warnings * remove loggers * return blocks * rename function * fix gme models * add server name * update after merge * fix ruff

Fixes #1442

Automatically generated by python-semantic-release

rename VisionCentric to VisionCentricQA

Update dataset_loading.yml

gowitheflow-1998 and others added 30 commits March 23, 2025 03:34

[MIEB] "capability measured"-Abstask 1-1 matching refactor [1/3]: rei…

2833138

…mplement CV-Bench (#2414) * refactor CV-Bench * reimplement CV Bench * remove abstask/evaluator/tests for Any2TextMultipleChoice * rerun descriptive stats

Update tasks table

065159d

fix: Add option to remove benchmark from leaderboard (#2417)

e8faf3f

fix: Add option to remove leaderboard from leaderboard fixes #2413 This only removed the benchmark from the leaderboard but keep it in MTEB.

1.36.31

a25dadb

Automatically generated by python-semantic-release

Update tasks table

34edcd5

1.36.32

0cdf2e0

Automatically generated by python-semantic-release

HOTFIX: pin setuptools (#2423)

071741d

* pin setuptools * pin setuptools * pin setuptools in makefile * try ci * fix ci * remove speed from installs

Update tasks table

55c542b

Update speed dependencies with new setuptools release (#2429)

731c4fc

add richinfoai models (#2427)

98ab0ef

* add richinfoai models add richinfoai models * format codes by linter format codes by linter

Added Memory Usage column on leaderboard (#2428)

d3eab6f

docs: typos; Standardize spacing; Chronological order (#2436)

0db0a20

* Fix typos; add chrono order * Fix spacing

fix: Add model specific dependencies in pyproject.toml (#2424)

8a024be

* Add model specific dependencies in pyproject.toml * Update documentation

1.36.33

6ae420d

Automatically generated by python-semantic-release

[MIEB] "capability measured"-Abstask 1-1 matching refactor [2/3]: rei…

65446e5

…mplement r-Oxford and r-Paris (#2442) * MutipleChoiceEvaluationMixin; reimplement r-Oxford and r-Paris; rerun stats * modify benchmark list * fix citation

Update tasks table

19dc625

Error while evaluating MIRACLRetrievalHardNegatives: 'trust_remote_co…

dadafbe

…de' (#2445) Fixes #2444

Add Background Gradients in Summary and Task Table (#2392)

5af5547

* Add Background Gradients in Summary and Task Table * Remove warnings and add light green cmap * Address comments * Separate styling function * address comments * added comments

leaderboard fix (#2456)

35a8a5b

ci: cache ~/.cache/huggingface (#2464)

d11934f

ci: cache ~/.cache/huggingface Co-authored-by: sam021313 <40773225+sam021313@users.noreply.github.com>

[MIEB] "capability measured"-Abstask 1-1 matching refactor [3/3]: rei…

8799126

…mplement ImageCoDe (#2468) * reimplement ImageCoDe with ImageTextPairClassification * add missing stats file

Update tasks table

5b567bf

fix: Adds family of NeuML/pubmedbert-base-embedding models (#2443)

f293d8b

* feat: added pubmedbert model2vec models * fix: attribute model_name * fix: fixed commit hash for pubmed_bert model2vec models * fix: changes requested in PR 2443

1.36.34

42068c6

Automatically generated by python-semantic-release

suppress logging warnings on leaderboard (#2406)

e837b09

* supress logging warnings * remove loggers * return blocks * rename function * fix gme models * add server name * update after merge * fix ruff

KennethEnevoldsen and others added 8 commits April 2, 2025 11:41

fix: E5 instruct now listed as sbert compatible (#2475)

6c8c8d2

Fixes #1442

1.36.35

eef52be

Automatically generated by python-semantic-release

[MIEB] rename VisionCentric to VisionCentricQA (#2479)

295ad0a

rename VisionCentric to VisionCentricQA

ci: Run dataset loading only when pushing to main (#2480)

17b53b4

Update dataset_loading.yml

fix table in tasks.md (#2483)

f5881b0

Update tasks table

9117c2f

merge

ed5f7d2

fix imports

aff9a3c

Samoed requested a review from isaac-chung April 4, 2025 12:33

Samoed changed the base branch from main to v2.0.0 April 4, 2025 12:33

Samoed added 6 commits April 4, 2025 15:35

update model loader

a763df7

remove unused imports

a5a6cdb

fix clip name

573614f

fix moco models

a458ca3

fix tests

1d808d0

fix tests

1011b62

isaac-chung approved these changes Apr 4, 2025

View reviewed changes

Samoed merged commit 36cf009 into v2.0.0 Apr 4, 2025
9 checks passed

Samoed deleted the merge_main branch April 4, 2025 18:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[v2] Merge main #2485

[v2] Merge main #2485

Uh oh!

Samoed commented Apr 4, 2025

Uh oh!

Uh oh!

Uh oh!

[v2] Merge main #2485

[v2] Merge main #2485

Uh oh!

Conversation

Samoed commented Apr 4, 2025

Code Quality

Documentation

Testing

Adding datasets checklist

Adding a model checklist

Uh oh!

Uh oh!

Uh oh!