Skip to content

Conversation

JojiiOfficial
Copy link
Contributor

@JojiiOfficial JojiiOfficial commented May 8, 2025

Depends on #6487

Improves optimization when copying points from different segments into a new one.

Benchmark:

On the same machine with storage on a Hetzner Network Disk (5M points, 20=>10 segments, vectors on disk)
Dev: 200s
PR: 3s

SSH-FS mounted Hetzner Network Disk (100k points, 10=>5 segments, vectors on disk)
Dev: ~450s
PR: 240s

@generall
Copy link
Member

generall commented May 9, 2025

Confirming improvement on my HDD:

Dev

2025-05-09T09:46:08.203869Z
-
2025-05-09T09:45:23.945362Z
=
00:00:44.258507

PR

2025-05-09T09:57:06.075739Z
-
2025-05-09T09:56:36.425863Z
=
00:00:29.649876

@generall generall merged commit 19e5758 into batch-reader-for-segment-construction May 9, 2025
13 checks passed
@generall generall deleted the sequential-batch-reader-seg-construction branch May 9, 2025 11:02
generall added a commit that referenced this pull request May 9, 2025
* naive implementation of batched reader for segment construction

* Make reads sequential in BatchedVectorReader (#6508)

* Add sequential reads and improve BatchedVectorReader

* implement get_many_sequential

---------

Co-authored-by: generall <andrey@vasnetsov.com>

* implement get_many_sequential

---------

Co-authored-by: Jojii <15957865+JojiiOfficial@users.noreply.github.com>
generall added a commit that referenced this pull request May 22, 2025
* naive implementation of batched reader for segment construction

* Make reads sequential in BatchedVectorReader (#6508)

* Add sequential reads and improve BatchedVectorReader

* implement get_many_sequential

---------

Co-authored-by: generall <andrey@vasnetsov.com>

* implement get_many_sequential

---------

Co-authored-by: Jojii <15957865+JojiiOfficial@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants