-
Notifications
You must be signed in to change notification settings - Fork 335
Description
Hello!
I'm hoping to use the compress package for packing bytes in our archive format for Dolt (https://github.com/dolthub/dolt). Our data is very amenable to custom dictionaries for small sets (5-50) of binary blobs which average about 4K in size.
We built our POC using the gozstd package which has Dictionary support: https://pkg.go.dev/github.com/valyala/gozstd
Now that I'm are convinced this is a good approach for us, I'm curious how we can move the dict.BuildZstdDict
function out of the experimental phase, as it's stated in the dict/README.md
. In particular, the README says it doesn't work well with small sets, and indeed I got a panic when using a set of 32 samples.
How far from complete do you think it is? How can I help get this tested and get the bugs ironed out? I have one local test case I can share with you if that's helpful.
FWIW, using the dictionary produced by valyala/gozstd, I can compress/decompress successfully with the compress/zstd package. So I think it's just a matter of generating a correct dict and we'll be able to move forward.
Thanks!