Skip to content

Conversation

Samoed
Copy link
Member

@Samoed Samoed commented Mar 17, 2025

Code Quality

  • Code Formatted: Format the code using make lint to maintain consistent style.

Documentation

  • Updated Documentation: Add or update documentation to reflect the changes introduced in this PR.

Testing

  • New Tests Added: Write tests to cover new functionality. Validate with make test-with-coverage.
  • Tests Passed: Run tests locally using make test or make test-with-coverage to ensure no existing functionality is broken.

Adding datasets checklist

Reason for dataset addition: ...

  • I have run the following models on the task (adding the results to the pr). These can be run using the mteb -m {model_name} -t {task_name} command.
    • sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
    • intfloat/multilingual-e5-small
  • I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).
  • If the dataset is too big (e.g. >2048 examples), considering using self.stratified_subsampling() under dataset_transform()
  • I have filled out the metadata object in the dataset file (find documentation on it here).
  • Run tests locally to make sure nothing is broken using make test.
  • Run the formatter to format the code using make lint.

Adding a model checklist

  • I have filled out the ModelMeta object to the extent possible
  • I have ensured that my model can be loaded using
    • mteb.get_model(model_name, revision) and
    • mteb.get_model_meta(model_name, revision)
  • I have tested the implementation works on a representative set of tasks.

KennethEnevoldsen and others added 30 commits March 11, 2025 13:19
These errors where discovered when trying to install the package using `uv`.

We have a problem with salesforce-lavis, which is not compatible with the current set of dependencies.
Automatically generated by python-semantic-release
* fix: Resolve conflicting dependencies

These errors where discovered when trying to install the package using `uv`.

We have a problem with salesforce-lavis, which is not compatible with the current set of dependencies.

* fix: Remove syntax warnings occuring in python 3.12

```
Python 3.12.0 (main, Oct  2 2023, 20:56:14) [Clang 16.0.3 ] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import mteb # no syntax warnings
>>>
```
Automatically generated by python-semantic-release
* fix: add annotation models for stella zh

Additionally fixed a few annotation errors

* format

* Update mteb/models/stella_models.py

Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

---------

Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
Automatically generated by python-semantic-release
* Add rubert-mini-frida model meta

* Add BERTA model meta
Automatically generated by python-semantic-release
* Add WebFAQ bitext mining tasks

Signed-off-by: Michael Dinzinger <michael.dinzinger@uni-passau.de>

* Lower number of language pairs in WebFAQBitextMining

Signed-off-by: Michael Dinzinger <michael.dinzinger@uni-passau.de>

---------

Signed-off-by: Michael Dinzinger <michael.dinzinger@uni-passau.de>
Automatically generated by python-semantic-release
Automatically generated by python-semantic-release
* fix: Add `trust_remote_code` to MIRACLRetrieval

* fix: Correctly pass trust remote code to Miracl

* fix
Automatically generated by python-semantic-release
* add PatentFnBClustering.py

* do make lint and revise

* rollback Makefile

* Update mteb/tasks/Clustering/kor/PatentFnBClustering.py

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>

* klue_mrc_domain

* make lint

* klue_modified_clustering_dataset

---------

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>
* Rename dunzhang and Jasper models to NovaResearch

* rename model in tests

* correct reference link
* correct stats

* update Any2AnyMultiChoice qrels stats compute logic

* final correction
* Reduce logging and Warnings

* make lint

* format license to lowercase

* Address all comments

* Update mteb/leaderboard/app.py

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>

---------

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>
Automatically generated by python-semantic-release
* fix: added b1ade_models.py (#2340)

* added b1ade_models.py

* changing based on requested

* Update mteb/models/b1ade_models.py

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

---------

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

* fix: missing import and formatting

---------

Co-authored-by: Shreyas Subramanian <shreyas.f117@gmail.com>
Automatically generated by python-semantic-release
@Samoed Samoed merged commit f9f51c7 into v2.0.0 Mar 17, 2025
5 of 10 checks passed
@Samoed Samoed deleted the merge_main branch March 17, 2025 20:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants