-
-
Notifications
You must be signed in to change notification settings - Fork 855
Open
Labels
feature requestNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomerspri/lowLow priority issueLow priority issuestatus/approvedThis issue is ready to be implementedThis issue is ready to be implemented
Description
Describe the feature you'd like
Hi,
binaryQuantized allows to keep only 1 and 0s in the embedding process, making the db way smaller and the search way faster. Depending on the model and dimensions the loss can actually be negligible.
In this example using binary embeddings uses 3% of the space (x32 compression) and retains 96% of the performance with a x25 speedup.
I'm not saying it should default to true but exposing the setting would be a good start!
Here's the doc
Describe the benefits this would bring to existing Karakeep users
Way faster search, way lower db size, usable on cheaper hardware.
Can the goal of this request already be achieved via other means?
No
Have you searched for an existing open/closed issue?
- I have searched for existing issues and none cover my fundamental request
Additional context
No response
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomerspri/lowLow priority issueLow priority issuestatus/approvedThis issue is ready to be implementedThis issue is ready to be implemented