Reduce usage of variable cpy on decompression #1226

Nicoshev · 2023-04-10T05:27:52Z

Hello, I hope you are doing well

I wanted to propose an additional code optimization for the decompression function:

The variable cpy is now not used from the code paths that copy literals within the fast decompression loop.
This optimizes the usage of the CPU registers, as well as reduces the size of certain code branches.
On average, I got slightly over 1% improvement in decompression speed when decompressing the Silesia corpus using an X64 CPU.

This optimization works very well with the one proposed in the PR "Optimize LZ4_memcpy_using_offset".
When both optimizations are applied together, the usage of the variable cpy is completely removed from the fast decompression loop.

Regards,
Nicolas

Cyan4973 · 2023-04-25T00:59:05Z

I've been able to observe very small (<1%) gains on some platforms (M1 + clang, 9700k + gcc-11).
On x64 "skylake-era" cpus, performance fluctuations are a mess, sometimes increasing (clang) or decreasing (gcc-9) by a lot, depending on compiler version. This effect seems unrelated to this PR, and is more a symptom of instruction alignment, an uncontrollable side-effect which is particularly sensible on this cpu generation.

In the end, I mostly like that this PR tends to make the code more readable, and it's a good enough reason.

Reduce usage of variable cpy on decompression

42aef71

Cyan4973 approved these changes Apr 25, 2023

View reviewed changes

Cyan4973 merged commit 76106cb into lz4:dev Aug 14, 2023

Nicoshev mentioned this pull request Sep 6, 2023

Optimize reading of length variables #1223

Open

Nicoshev deleted the reduce_cpy_variable_usage branch January 14, 2024 18:51

moonfruit mentioned this pull request Jul 22, 2024

lz4 1.10.0 Homebrew/homebrew-core#178056

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce usage of variable cpy on decompression #1226

Reduce usage of variable cpy on decompression #1226

Uh oh!

Nicoshev commented Apr 10, 2023

Uh oh!

Cyan4973 commented Apr 25, 2023

Uh oh!

Uh oh!

Reduce usage of variable cpy on decompression #1226

Reduce usage of variable cpy on decompression #1226

Uh oh!

Conversation

Nicoshev commented Apr 10, 2023

Uh oh!

Cyan4973 commented Apr 25, 2023

Uh oh!

Uh oh!