[Bm25] Execution implementation #6939

JojiiOfficial · 2025-07-25T11:33:35Z

Supersedes #6893 and #6904
Depends on #6891
Implements Bm25 by parsing the inference object's options.

Request example

A possible request could look like this:

{
  "query": {
    "model": "qdrant/bm25",  // Must be set but value can be arbitrary.
    "text": "Some input to embedd with bm25",
    "options": {
      "use_bm25": true,        // Required to parse this options as BM25 configuration.
      "k": 1.0,                // Optional hyperparameters. Same for `b` and `avg_len`.
      "tokenizer": "word",     // Optional 
      "lowercase": true       // Optional
    }
  },
  "using": "sparse"
}

Error handling

In case the option "use_bm25" is enabled but the configuration is invalid, a detailed error is returned:

{
  "error": "Wrong input: Invalid BM25 config: Error(\"unknown variant `ord`, expected one of `prefix`, `whitespace`, `word`, `multilingual`\", line: 0, column: 0)"
}

Test coverage

There are also tests for the new BM25 and remote-inference logic that almost cover all parts of infer() and >97% of bm25.rs. "Almost" because returning errors are not covered.
These tests 'mock' an inference server and ensure that we correctly merge bm25 and remote items together.

JojiiOfficial · 2025-07-25T12:34:19Z

src/common/inference/service.rs

+            .collect();
+
+        // Create an HTTP mock
+        let mock = server


Mocking an inference server here to fully end2end test the logic of this PR.

Cargo.toml

src/common/inference/bm25.rs

lib/collection/src/operations/point_ops.rs

generall · 2025-07-28T16:26:18Z

src/common/inference/service.rs

+            return Ok(None);
+        };
+
+        if options.get("use_bm25") != Some(&Value::Bool(true))


Why is this here and why it has hard-coded strings?

You're right, it should be a constant instead of directly coded into the code.

I think it's better to explicitly state that we want to use bm25, instead of "guessing" and always trying to parse the options into Bm25Config. Therefore we have this additional check when trying to read BM25 configuration from the inputs options.

Alternatively, we could rename it to something like "custom_model" or "local_model" and check the value against "bm25". This makes it more extendible for other 'local' model implementations in future.

src/common/inference/service.rs

generall · 2025-07-30T14:05:47Z

src/common/inference/bm25.rs

+            return VectorPersisted::empty_sparse();
+        }
+
+        let indices: Vec<u32> = tokens


We don't have access to DimId and DimWeight in src/common. I don't think we should add a dependency, just for this type alias.
If you prefer having some sort of alias here, I can add a new type alias here, but I think the semantics is already known from the variable names and context from the function.

The same applies to the other place in line 103

generall · 2025-07-30T14:06:11Z

src/common/inference/bm25.rs

+            .unique()
+            .collect();
+
+        let values: Vec<f32> = vec![1.0; indices.len()];


DimWeight ?

generall · 2025-07-30T14:07:39Z

src/common/inference/inference_input.rs

+            return Ok(None);
+        };
+
+        let Some(local_model_field) = options.get(LOCAL_MODEL_KEY) else {


I still don't understand why this is needed

If we add a different local 'model', next to bm25, which also has tokenizer configs and a user only specifies those (so no Bm25 specific parameters), we can't distinguish which model should be used.
This requires the user to specify which local model to use.

We could use some dedicated model name e.g. 'local/bm25' to specify which local model to use. However I thought hard-coding some specific model name to be potentially confusing, especially if something like 'local/bm25' works but 'bm25' throws an error about some 'address' not being found. If you prefer this approach though, I'm totally fine with changing it.

* Add Bm25 * Execute BM25 if config available * cargo format * Add mocked tests for inference and bm25 * Properly apply InferenceType * Fix tests * Review remarks * Review remarks * Add overwritten model name fix again * use enum instead of multiple constants * ensure handling all fields of InferenceInput in infer_local --------- Co-authored-by: Luis Cossío <luis.cossio@outlook.com>

* Uncouple tokenizer and TextIndexParams * Refactor token processing in `TokenizerConfig` (#6912) * Refactor token processing in TokenizerConfig * fix max length checking * [Bm25] Execution implementation (#6939) * Add Bm25 * Execute BM25 if config available * cargo format * Add mocked tests for inference and bm25 * Properly apply InferenceType * Fix tests * Review remarks * Review remarks * Add overwritten model name fix again * use enum instead of multiple constants * ensure handling all fields of InferenceInput in infer_local --------- Co-authored-by: Luis Cossío <luis.cossio@outlook.com> * review fixes * fmt * spell-check * deduplicate code --------- Co-authored-by: Luis Cossío <luis.cossio@qdrant.com> Co-authored-by: Luis Cossío <luis.cossio@outlook.com> Co-authored-by: Andrey Vasnetsov <andrey@vasnetsov.com>

JojiiOfficial requested review from generall and coszio July 25, 2025 12:03

This was referenced Jul 25, 2025

[WIP] Execute Bm25 embedding in inference #6904

Closed

Add BM25 options to inference config #6893

Closed

JojiiOfficial commented Jul 25, 2025

View reviewed changes

generall requested changes Jul 28, 2025

View reviewed changes

generall reviewed Jul 30, 2025

View reviewed changes

src/common/inference/bm25.rs

.unique()

.collect();

let values: Vec<f32> = vec![1.0; indices.len()];

Copy link

Member

generall Jul 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

DimWeight ?

generall reviewed Jul 30, 2025

View reviewed changes

JojiiOfficial force-pushed the bm25_execution_impl_2 branch from 0689e42 to 1bcb324 Compare July 30, 2025 15:56

JojiiOfficial requested a review from generall July 30, 2025 15:57

generall approved these changes Jul 31, 2025

View reviewed changes

JojiiOfficial added 8 commits August 1, 2025 09:11

Add Bm25

53f7967

Execute BM25 if config available

01851d0

cargo format

9dfb345

Add mocked tests for inference and bm25

88ebe49

Properly apply InferenceType

b1405d8

Fix tests

fffe6d1

Review remarks

2ff8bd4

Review remarks

82a78e5

JojiiOfficial force-pushed the bm25_execution_impl_2 branch from 0c38fa9 to 82a78e5 Compare August 1, 2025 07:12

JojiiOfficial and others added 3 commits August 1, 2025 09:16

Add overwritten model name fix again

ac150dd

use enum instead of multiple constants

fafb3ff

ensure handling all fields of InferenceInput in infer_local

7ec399e

coszio approved these changes Aug 1, 2025

View reviewed changes

generall merged commit c9f3626 into uncouple_tokenizer_and_textindex_params Aug 3, 2025
15 checks passed

generall deleted the bm25_execution_impl_2 branch August 3, 2025 16:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bm25] Execution implementation #6939

[Bm25] Execution implementation #6939

Uh oh!

JojiiOfficial commented Jul 25, 2025 •

edited by coszio

Loading

Uh oh!

JojiiOfficial Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

generall Jul 28, 2025

Uh oh!

JojiiOfficial Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

generall Jul 30, 2025

Uh oh!

JojiiOfficial Jul 30, 2025

Uh oh!

generall Jul 30, 2025

Uh oh!

generall Jul 30, 2025

Uh oh!

JojiiOfficial Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[Bm25] Execution implementation #6939

[Bm25] Execution implementation #6939

Uh oh!

Conversation

JojiiOfficial commented Jul 25, 2025 • edited by coszio Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Request example

Error handling

Test coverage

Uh oh!

JojiiOfficial Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

generall Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

JojiiOfficial Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

generall Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

JojiiOfficial Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

generall Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

generall Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

JojiiOfficial Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

JojiiOfficial commented Jul 25, 2025 •

edited by coszio

Loading

JojiiOfficial Jul 30, 2025 •

edited

Loading

JojiiOfficial Jul 30, 2025 •

edited

Loading