Add BM25 options to inference config #6893

JojiiOfficial · 2025-07-17T11:04:51Z

Depends on #6891

Adds configuration for BM25 vectorizer "models" in inference. One example model with the name bm25 added in development config too.
Highlights:

Supports multiple models with different configs
Reuse of our tokenizers with full customization support

config/development.yaml

coszio · 2025-07-22T14:49:38Z

config/development.yaml

@@ -7,6 +7,36 @@ inference:
  address: "http://localhost:2114/api/v1/infer"
  timeout: 10
  token: "98eb568c-dea5-4347-a3a5-583478983bb9"
+  # Define custom models/vectorizer, which get handled directly in Qdrant. These do not require an `address` to be configured.


Maybe they can, but it is up to the specific model params. In the case of BM25 it is not needed bc it is computed in qdrant directly.

What do you mean? custom_models is supposed to only handle 'models' that are handled directly in Qdrant, like BM25.

I was thinking that maybe in the future users might be able to plug in their own embedding services via a custom config like this. In this case another default namespace might be more fitting like bm25/bm25-for-paragraphs, or native/bm25

src/common/inference/bm25.rs

src/common/inference/config.rs

generall

I am not convinced inference config is the right place for this settings.
Inference settings are global, while we would want to have it per-collection.

I propose to postpone this change and make it part of custom collection parameters instead.

JojiiOfficial · 2025-07-25T12:33:20Z

Closing in favor of #6939

JojiiOfficial added 2 commits July 17, 2025 13:03

Add BM25 options to inference config

aa70948

Allow multiple Bm25 vectorizer with different configs.

345c165

JojiiOfficial force-pushed the bm25_model_config branch from e8c4b13 to 345c165 Compare July 18, 2025 08:30

JojiiOfficial marked this pull request as ready for review July 18, 2025 08:49

JojiiOfficial requested review from generall, n0x29a and timvisee July 18, 2025 08:49

JojiiOfficial changed the title ~~[WIP] Add BM25 options to inference config~~ Add BM25 options to inference config Jul 18, 2025

JojiiOfficial added 2 commits July 18, 2025 11:10

Add model_prefix and improve config comments.

403360f

Config validation for duplicates in model_name

253c5a8

JojiiOfficial mentioned this pull request Jul 18, 2025

[WIP] Execute Bm25 embedding in inference #6904

Closed

coszio reviewed Jul 22, 2025

View reviewed changes

Review remarks

f2e404a

JojiiOfficial requested a review from coszio July 23, 2025 07:52

generall requested changes Jul 23, 2025

View reviewed changes

JojiiOfficial mentioned this pull request Jul 25, 2025

[Bm25] Execution implementation #6939

Merged

JojiiOfficial closed this Jul 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add BM25 options to inference config #6893

Add BM25 options to inference config #6893

Uh oh!

JojiiOfficial commented Jul 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

coszio Jul 22, 2025

Uh oh!

JojiiOfficial Jul 23, 2025

Uh oh!

coszio Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

generall left a comment

Uh oh!

JojiiOfficial commented Jul 25, 2025

Uh oh!

Uh oh!

Add BM25 options to inference config #6893

Add BM25 options to inference config #6893

Uh oh!

Conversation

JojiiOfficial commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

coszio Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

JojiiOfficial Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

coszio Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

generall left a comment

Choose a reason for hiding this comment

Uh oh!

JojiiOfficial commented Jul 25, 2025

Uh oh!

Uh oh!

JojiiOfficial commented Jul 17, 2025 •

edited

Loading