Fix enormous memory usage in Distance Matrix API on high sample size #6640

JojiiOfficial · 2025-06-04T16:30:34Z

Fixes extremely high memory usage in Distance Matrix API on certain datasets with a large sample rate (>= ~50k).

Dev:

PR:

KShivendu

Nice! Can you please share script to repro the bug/results? :)

agourlay · 2025-06-05T08:10:32Z

lib/segment/src/types.rs

@@ -2608,7 +2609,7 @@ impl From<JsonPath> for IsEmptyCondition {
 #[derive(Debug, Deserialize, Serialize, JsonSchema, Clone, PartialEq, Eq)]
 pub struct HasIdCondition {
    #[schemars(schema_with = "HashSet::<PointIdType>::json_schema")]
-    pub has_id: AHashSet<PointIdType>,


Why not always have an Arc?

I think the answer lies here:

/// Threshold determining when to use an `Arc` in `HasIdCondition` if the condition includes many points. /// Since we're cloning filters quite a lot, using an Arc for larger conditions reduces risk of memory leaks /// and potentially improves performance in some places. const HAS_ID_CONDITION_ARC_THRESHOLD: usize = 1_000;

meaning, a balance between performance and memory usage.

Though, I'm also curious how always using an Arc behaves. @JojiiOfficial Would it be possible to benchmark it?

Here is a comparison between cloning raw+arc and creating an arc using Arc::new() of the same hashset used in each iteration.

I suggest that we set the HAS_ID_CONDITION_ARC_THRESHOLD to something around 60 (probably 64 due to power of 2 allocation logic of rust) to not introduce overhead in places where we might create a lot of small filters without much cloning. Otherwise we might introduce an unexpected performance regression.

Wdyt?

coszio · 2025-06-05T15:27:59Z

lib/segment/src/utils/maybe_arc.rs

+    Arc(Arc<T>),
+}
+
+impl<T> MaybeArc<T> {


A nice to have would be to choose arc/no_arc automatically based on the size_of_value() of the inner value as a constructor function of this type.

Based on the provided chart, and considering ExtendedPointId is 24 bytes, the threshold could be around 1536 bytes (24 * 64)

size_of_value doesn't work with HashSet though. When using MaybeArc wrapped around HashMaps or Vec, size_of_value would only return the stack size of the type, which is always constant regardless of the amount of items.
Therefore we can only apply this heuristic at each usage of MaybeArc instead inside a constructor, sadly.

…6640) * Fix memory leak in distance API * Add tests

Fix memory leak in distance API

fc0cc59

JojiiOfficial force-pushed the fix_memory_leak_in_distance_matrix branch from c30253d to df24f78 Compare June 4, 2025 16:32

qdrant deleted a comment from coderabbitai bot Jun 4, 2025

JojiiOfficial requested review from timvisee, coszio, generall and agourlay June 4, 2025 16:33

This comment was marked as resolved.

Sign in to view

JojiiOfficial force-pushed the fix_memory_leak_in_distance_matrix branch from df24f78 to 2beb70e Compare June 4, 2025 16:36

Add tests

8e0cbf0

JojiiOfficial force-pushed the fix_memory_leak_in_distance_matrix branch from 2beb70e to 8e0cbf0 Compare June 4, 2025 16:37

This comment was marked as resolved.

Sign in to view

KShivendu reviewed Jun 4, 2025

View reviewed changes

github-actions bot mentioned this pull request Jun 4, 2025

Flaky test tests::snapshot_test::test_snapshot_collection_listener #5218

Open

JojiiOfficial changed the title ~~Fix memory leak in distance matrix~~ Fix enormous memory usage in Distance Matrix API on high sample size Jun 5, 2025

agourlay reviewed Jun 5, 2025

View reviewed changes

generall approved these changes Jun 5, 2025

View reviewed changes

coszio approved these changes Jun 5, 2025

View reviewed changes

coszio reviewed Jun 5, 2025

View reviewed changes

JojiiOfficial merged commit 64364f3 into dev Jun 5, 2025
17 checks passed

JojiiOfficial deleted the fix_memory_leak_in_distance_matrix branch June 5, 2025 15:46

generall pushed a commit that referenced this pull request Jul 17, 2025

Fix enormous memory usage in Distance Matrix API on high sample size (#…

b5fe0e7

…6640) * Fix memory leak in distance API * Add tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix enormous memory usage in Distance Matrix API on high sample size #6640

Fix enormous memory usage in Distance Matrix API on high sample size #6640

Uh oh!

JojiiOfficial commented Jun 4, 2025 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

KShivendu left a comment •

edited

Loading

Uh oh!

agourlay Jun 5, 2025

Uh oh!

timvisee Jun 5, 2025

Uh oh!

JojiiOfficial Jun 5, 2025 •

edited

Loading

Uh oh!

coszio Jun 5, 2025

Uh oh!

JojiiOfficial Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

Fix enormous memory usage in Distance Matrix API on high sample size #6640

Fix enormous memory usage in Distance Matrix API on high sample size #6640

Uh oh!

Conversation

JojiiOfficial commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

KShivendu left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

agourlay Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

timvisee Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

JojiiOfficial Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coszio Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

JojiiOfficial Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

JojiiOfficial commented Jun 4, 2025 •

edited

Loading

KShivendu left a comment •

edited

Loading

JojiiOfficial Jun 5, 2025 •

edited

Loading