Skip to content

Conversation

lnkuiper
Copy link
Contributor

@lnkuiper lnkuiper commented Oct 7, 2024

As the title explains, this PR implements DELTA_BINARY_PACKED compression for the Parquet writer. I've reused our own bitpacking code to achieve this. I've reworked the reader to also use our own bitpacking code. The compression ratio for integral columns should be much better now.

I've added this on a branch where I was enabling ARM Neon SIMD instructions for Snappy on AArch64. I found out that Neon is always available/enabled on AArch64! This means we can enable it in Snappy to enjoy faster (de-)compression on this CPU architecture. For x86 it's much harder to enable SIMD, as it is non-portable to older CPUs. ARM is relatively young in that sense, and we can safely enable it.

@lnkuiper
Copy link
Contributor Author

lnkuiper commented Oct 8, 2024

Regression test failures are unrelated

@Mytherin Mytherin merged commit 5dd4deb into duckdb:feature Oct 21, 2024
41 of 42 checks passed
@Mytherin
Copy link
Collaborator

Thanks! Looks great

@lnkuiper lnkuiper deleted the snappy_stuff branch April 14, 2025 09:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants