dataset: add BillSum datasets #2943

abdurrahmanbutler · 2025-07-23T04:03:57Z

Hi,
I’m submitting this pull request to push the Californian and US splits of the BillSum dataset to MTEB.

BillSum is a dataset created by FiscalNote for the purposes of training and evaluating models capable of summarizing federal and state legislation.

We have reframed the problem in terms of the retrieval of bills based on their summaries, making our reformatted datasets suitable for the evaluation of legal information retrieval models.

We want to improve the coverage of legal domain tasks on MTEB and we believe this dataset will contribute to increasing the diversity and difficulty of MTEB.

This pull request is being submitted courtesy of Isaacus, a legal AI research company.

You may find the original dataset here:
https://huggingface.co/datasets/FiscalNote/billsum

Note that the original dataset contained a large number of examples in both the federal US and Californian test splits and so, we have reduced both splits to 500 randomly selected examples.

Checklist

I have outlined why this dataset is filling an existing gap in mteb
I have tested that the dataset runs with the mteb package.
I have run the following models on the task (adding the results to the pr). These can be run using the mteb run -m {model_name} -t {task_name} command.
I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).
I have considered the size of the dataset and reduced it if it is too big (2048 examples is typically large enough for most tasks)

mteb/tasks/Retrieval/eng/BillSumCA.py

mteb/tasks/Retrieval/eng/BillSumUS.py

umarbutler

I can confirm that I have reviewed and approve this PR on behalf of Isaacus.

KennethEnevoldsen

Hi, thanks for the PR! Generally looks good. However, the tasks are not imported, which means that it will not be fetchable with mteb.get_task("{taskname}")

mteb/tasks/Retrieval/eng/BillSumCA.py

mteb/tasks/Retrieval/eng/BillSumUS.py

Co-authored-by: Kenneth Enevoldsen <kenevoldsen@pm.me>

abdurrahmanbutler · 2025-07-28T00:12:35Z

Thanks for the feedback. I've approved the changes @KennethEnevoldsen

isaac-chung · 2025-07-30T07:06:32Z

@abdurrahmanbutler you needed to add imports to the __init__.py file, so that the task shows up when you run

mteb available_tasks | grep BillSum

abdurrahmanbutler added 2 commits July 23, 2025 13:17

Added BillSum datasets

36e0fa4

fixed billsumca

89c5983

abdurrahmanbutler mentioned this pull request Jul 23, 2025

Results for BillSum datasets embeddings-benchmark/results#243

Open

6 tasks

umarbutler reviewed Jul 23, 2025

View reviewed changes

mteb/tasks/Retrieval/eng/BillSumCA.py Outdated Show resolved Hide resolved

mteb/tasks/Retrieval/eng/BillSumUS.py Outdated Show resolved Hide resolved

abdurrahmanbutler added 2 commits July 23, 2025 14:42

Updated BillSumCA description

f7ac7c3

Updated BillSumUS description

c0fa7ae

umarbutler approved these changes Jul 23, 2025

View reviewed changes

KennethEnevoldsen reviewed Jul 24, 2025

View reviewed changes

mteb/tasks/Retrieval/eng/BillSumCA.py Outdated Show resolved Hide resolved

mteb/tasks/Retrieval/eng/BillSumUS.py Outdated Show resolved Hide resolved

abdurrahmanbutler and others added 2 commits July 28, 2025 10:10

Update mteb/tasks/Retrieval/eng/BillSumCA.py

a5e295a

Co-authored-by: Kenneth Enevoldsen <kenevoldsen@pm.me>

Update mteb/tasks/Retrieval/eng/BillSumUS.py

0eb5471

Co-authored-by: Kenneth Enevoldsen <kenevoldsen@pm.me>

isaac-chung added 2 commits July 30, 2025 10:02

lint

f1c54a6

lint

db003ff

isaac-chung enabled auto-merge (squash) July 30, 2025 07:06

format citations

e614a73

isaac-chung merged commit 007d19f into embeddings-benchmark:main Jul 30, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

dataset: add BillSum datasets #2943

dataset: add BillSum datasets #2943

Uh oh!

abdurrahmanbutler commented Jul 23, 2025

Uh oh!

Uh oh!

Uh oh!

umarbutler left a comment

Uh oh!

KennethEnevoldsen left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

abdurrahmanbutler commented Jul 28, 2025

Uh oh!

isaac-chung commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

dataset: add BillSum datasets #2943

dataset: add BillSum datasets #2943

Uh oh!

Conversation

abdurrahmanbutler commented Jul 23, 2025

Checklist

Uh oh!

Uh oh!

Uh oh!

umarbutler left a comment

Choose a reason for hiding this comment

Uh oh!

KennethEnevoldsen left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

abdurrahmanbutler commented Jul 28, 2025

Uh oh!

isaac-chung commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

KennethEnevoldsen left a comment •

edited

Loading