Skip to content

Conversation

abdurrahmanbutler
Copy link
Contributor

Hi,
I’m submitting this pull request to push the Californian and US splits of the BillSum dataset to MTEB.

BillSum is a dataset created by FiscalNote for the purposes of training and evaluating models capable of summarizing federal and state legislation.

We have reframed the problem in terms of the retrieval of bills based on their summaries, making our reformatted datasets suitable for the evaluation of legal information retrieval models.

We want to improve the coverage of legal domain tasks on MTEB and we believe this dataset will contribute to increasing the diversity and difficulty of MTEB.

This pull request is being submitted courtesy of Isaacus, a legal AI research company.

You may find the original dataset here:
https://huggingface.co/datasets/FiscalNote/billsum

Note that the original dataset contained a large number of examples in both the federal US and Californian test splits and so, we have reduced both splits to 500 randomly selected examples.

Checklist

  • I have outlined why this dataset is filling an existing gap in mteb
  • I have tested that the dataset runs with the mteb package.
  • I have run the following models on the task (adding the results to the pr). These can be run using the mteb run -m {model_name} -t {task_name} command.
  • I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).
  • I have considered the size of the dataset and reduced it if it is too big (2048 examples is typically large enough for most tasks)

Copy link

@umarbutler umarbutler left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can confirm that I have reviewed and approve this PR on behalf of Isaacus.

Copy link
Contributor

@KennethEnevoldsen KennethEnevoldsen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, thanks for the PR! Generally looks good. However, the tasks are not imported, which means that it will not be fetchable with mteb.get_task("{taskname}")

abdurrahmanbutler and others added 2 commits July 28, 2025 10:10
Co-authored-by: Kenneth Enevoldsen <kenevoldsen@pm.me>
Co-authored-by: Kenneth Enevoldsen <kenevoldsen@pm.me>
@abdurrahmanbutler
Copy link
Contributor Author

Thanks for the feedback. I've approved the changes @KennethEnevoldsen

@isaac-chung
Copy link
Collaborator

@abdurrahmanbutler you needed to add imports to the __init__.py file, so that the task shows up when you run

mteb available_tasks | grep BillSum

@isaac-chung isaac-chung enabled auto-merge (squash) July 30, 2025 07:06
@isaac-chung isaac-chung merged commit 007d19f into embeddings-benchmark:main Jul 30, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants