Wait until replicas are activated, when creating custom shards #6817

ffuugoo · 2025-07-07T10:46:00Z

Follow-up to #6778.

After #6778, we initialize custom shard replicas in Initializing state, and so when Qdrant returns response to the client, it does not yet guarantee that custom shard is ready to accept write requests. There will be a small delay while shards switch from Initializing into Active state and fully available.

This PR adds explicit wait_for_state check, so that we ensure all replicas are Active before returning response to the client. I've also very slightly tweaked test_shard_consistency test, so that it should cover this check as part of existing test.

All Submissions:

Contributions should target the dev branch. Did you create your branch from dev?
Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass tests?
Have you formatted your code locally using cargo +nightly fmt --all command prior to submission?
Have you checked your code using cargo clippy --all --all-features command?

Changes to Core Features:

Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

🙄🖕

- so that it creates custom shards in `Initializing` state instead of `Active` - and in doing so, also covers the check that `create_shard_key` request waits for replica activation

generall · 2025-07-07T19:39:55Z

lib/collection/src/shards/replica_set/mod.rs

+        &self,
+        check: F,
+        timeout: Duration,
+    ) -> impl Future<Output = CollectionResult<()>> + 'static


What was a reason for this change?

Without this change, wait_for was borrowing/lifetime-bound to ReplicaSet, so you had to hold a lock to ReplicaSet while await-ing wait_for.

With this change, wait_for creates a future that is completely independent of ReplicaSet, so that you can call wait_for, release the lock to ReplicaSet, and then await wait_for future.

generall · 2025-07-07T19:55:27Z

lib/collection/src/shards/replica_set/mod.rs

+            if status {
+                Ok(())
+            } else {
+                Err(CollectionError::service_error(


I checked usages, it should be safe to convert this error to timeout. Both Service Error and timeout are transient, also it looks like all actual places where it is used either don't care or force convert error into service error.

timvisee · 2025-07-08T08:48:08Z

lib/storage/src/dispatcher.rs

+            for replica_set in shard_holder.all_shards() {
+                if replica_set.shard_key() != Some(&shard_key) {
+                    continue;
+                }


Shall we create a getter in the shard holder to which we pass the key? That way we keep the shard key selection logic inside the shard holder.

We have some methods related to shard keys there, to get the shard IDs related to a shard key for example. But for this we'd probably require a new one.

lib/storage/src/dispatcher.rs

Co-authored-by: generall <andrey@vasnetsov.com> Co-authored-by: timvisee <tim@visee.me>

Wait until replicas are activated, when creating custom shards

c105990

ffuugoo force-pushed the custom-shard-await-active branch from 8db9214 to c105990 Compare July 7, 2025 13:43

ffuugoo added 2 commits July 7, 2025 15:47

fixup! Wait until replicas are activated, when creating custom shards

e6ea515

🙄🖕

Update test_shard_consistency test...

932ffec

- so that it creates custom shards in `Initializing` state instead of `Active` - and in doing so, also covers the check that `create_shard_key` request waits for replica activation

ffuugoo marked this pull request as ready for review July 7, 2025 14:01

ffuugoo requested review from timvisee and generall July 7, 2025 14:01

This comment was marked as resolved.

Sign in to view

generall reviewed Jul 7, 2025

View reviewed changes

minor review fixes + comments

80b2f8a

generall approved these changes Jul 7, 2025

View reviewed changes

generall and others added 2 commits July 7, 2025 22:07

fix spelling

93d5fa3

Use early return

6094fe2

timvisee reviewed Jul 8, 2025

View reviewed changes

ffuugoo commented Jul 8, 2025

View reviewed changes

lib/storage/src/dispatcher.rs Show resolved Hide resolved

ffuugoo merged commit 68a6536 into dev Jul 8, 2025
18 checks passed

ffuugoo deleted the custom-shard-await-active branch July 8, 2025 10:18

generall added a commit that referenced this pull request Jul 17, 2025

Wait until replicas are activated, when creating custom shards (#6817)

eceefa7

Co-authored-by: generall <andrey@vasnetsov.com> Co-authored-by: timvisee <tim@visee.me>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Wait until replicas are activated, when creating custom shards #6817

Wait until replicas are activated, when creating custom shards #6817

Uh oh!

ffuugoo commented Jul 7, 2025 •

edited

Loading

Uh oh!

This comment was marked as resolved.

generall Jul 7, 2025

Uh oh!

ffuugoo Jul 8, 2025

Uh oh!

generall Jul 7, 2025

Uh oh!

timvisee Jul 8, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Wait until replicas are activated, when creating custom shards #6817

Wait until replicas are activated, when creating custom shards #6817

Uh oh!

Conversation

ffuugoo commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

All Submissions:

New Feature Submissions:

Changes to Core Features:

Uh oh!

This comment was marked as resolved.

generall Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

ffuugoo Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

generall Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

timvisee Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ffuugoo commented Jul 7, 2025 •

edited

Loading