Improve rolling bloom filter performance and benchmark #7934

sipa · 2016-04-24T17:01:24Z

Added a benchmark for the rolling bloom filter (with parameters corresponding to the tx rejection cache), which showed that on average, adding+checking one item takes around 2.1us, but the refresh action that wipes all old generation items every 60000 iterations takes 65ms, which is very significant.

Thus, this patch also changes the implementation from one that stores 16 2-bit integers in uint32_t's, to one that stores the first bit of 64 2-bit integers in one uint64_t and the second bit in another. This allows for 450x faster refreshing (0.14ms) and 2.2x faster average adding+checking (0.93us).

All benchmarks done on an Intel Core i7-4800MQ CPU, running at 2.6 GHz, with binaries compiled with GCC 5.3.1.

dcousens · 2016-04-26T01:32:47Z

src/bloom.cpp

+        uint32_t h = RollingBloomHash(n, nTweak, vKey);
+        int bit = h & 0x3F;
+        uint32_t pos = (h >> 6) % data.size();
+        /* The lowest bit of pos is ignored, and set to zero first the first bit, and to one for the second. */


set to zero for ~~first~~ the first bit

dcousens · 2016-04-26T03:26:05Z

light utACK 67d44f8, didn't verify masking operations

This patch changes the implementation from one that stores 16 2-bit integers in one uint32_t's, to one that stores the first bit of 64 2-bit integers in one uint64_t and the second bit in another. This allows for 450x faster refreshing and 2.2x faster average speed.

sipa · 2016-04-28T12:56:53Z

Addressed @dcousens's nits.

gmaxwell · 2016-04-28T16:02:11Z

utACK.

dcousens · 2016-04-30T04:40:16Z

utACK 1953c40

gmaxwell · 2016-05-05T11:17:46Z

ACK. (appears to work.)

1953c40 More efficient bitsliced rolling Bloom filter (Pieter Wuille) aa62b68 Benchmark rolling bloom filter (Pieter Wuille)

laanwj · 2016-05-09T06:53:33Z

utACK 1953c40

rebroad · 2016-12-21T02:03:50Z

I'd like to understand this code - where do I start?

dcousens · 2016-12-21T02:09:35Z

src/bloom.cpp

-        uint32_t h = Hash(n, vKey);
-        put(h, nGeneration);
+        uint32_t h = RollingBloomHash(n, nTweak, vKey);
+        int bit = h & 0x3F;


I wonder if it is confusing alternating between 0x3f and 63 a few times in this code...

laanwj · 2016-12-21T08:49:27Z

I'd like to understand this code - where do I start?

In this case it is the theory that is important to understand. With that, the code is pretty straightforward. Google "bloom filters". Most notably the wikipedia page about Bloom filters has a lot of references to CS literature about various kinds of bloom filters, and the article itself may give basic understanding.

…hmark 1953c40 More efficient bitsliced rolling Bloom filter (Pieter Wuille) aa62b68 Benchmark rolling bloom filter (Pieter Wuille)

Micro-benchmarking framework part 1 Cherry-picked from the following upstream PRs: - bitcoin/bitcoin#6733 - bitcoin/bitcoin#6770 - bitcoin/bitcoin#6892 - Excluding changes to `src/policy/policy.h` which we don't have yet. - bitcoin/bitcoin#7934 - Just the benchmark, not the performance improvements. - bitcoin/bitcoin#8039 - bitcoin/bitcoin#8107 - bitcoin/bitcoin#8115 - bitcoin/bitcoin#8914 - Required resolving several merge conflicts in code that had been refactored upstream. The changes were simple enough that I decided it was okay to impose merge conflicts on pulling in those refactors later. - bitcoin/bitcoin#9200 - bitcoin/bitcoin#9202 - Adds support for measuring CPU cycles, which is later removed in an upstream PR after the refactor. I am including it to reduce future merge conflicts. - bitcoin/bitcoin#9281 - Only changes to `src/bench/bench.cpp` - bitcoin/bitcoin#9498 - bitcoin/bitcoin#9712 - bitcoin/bitcoin#9547 - bitcoin/bitcoin#9505 - Just the benchmark, not the performance improvements. - bitcoin/bitcoin#9792 - Just the benchmark, not the performance improvements. - bitcoin/bitcoin#10272 - bitcoin/bitcoin#10395 - Only changes to `src/bench/` - bitcoin/bitcoin#10735 - Only changes to `src/bench/base58.cpp` - bitcoin/bitcoin#10963 - bitcoin/bitcoin#11303 - Only the benchmark backend change. - bitcoin/bitcoin#11562 - bitcoin/bitcoin#11646 - bitcoin/bitcoin#11654 This pulls in all changes to the micro-benchmark framework prior to December 2017, when it was rewritten. The rewrite depends on other upstream PRs we have not pulled in yet. This does not pull in all benchmarks prior to December 2017. It leaves out benchmarks that either test code we do not have yet (except for the `FastRandomContext` refactor, which I decided to pull in), or would require rewrites to work with our changes to the codebase.

Backport bloom filter improvements Cherry-picked from the following upstream PRs: - bitcoin/bitcoin#7113 - bitcoin/bitcoin#7818 - Only the second commit (to resolve conflicts). - bitcoin/bitcoin#7934 - bitcoin/bitcoin#8655 - Partial backport to help resolve conflicts. - bitcoin/bitcoin#9060 - bitcoin/bitcoin#9223 - bitcoin/bitcoin#9644 - Partial backport to help resolve conflicts. - bitcoin/bitcoin#9916 - bitcoin/bitcoin#9750 - bitcoin/bitcoin#13176 - bitcoin/bitcoin#13948 - bitcoin/bitcoin#16073 - bitcoin/bitcoin#18670 - bitcoin/bitcoin#18806 - Reveals upstream's covert fix for CVE-2013-5700. - bitcoin/bitcoin#19968

sipa force-pushed the benchrollingbloom branch 2 times, most recently from 84d71ac to 67d44f8 Compare April 24, 2016 22:04

laanwj added the Resource usage label Apr 25, 2016

dcousens reviewed Apr 26, 2016
View reviewed changes

laanwj mentioned this pull request Apr 26, 2016

Add benchmarks to bench_bitcoin #7883

Closed

8 tasks

sipa added 2 commits April 28, 2016 14:56

Benchmark rolling bloom filter

aa62b68

sipa force-pushed the benchrollingbloom branch from 67d44f8 to 1953c40 Compare April 28, 2016 12:56

laanwj merged commit 1953c40 into bitcoin:master May 9, 2016

laanwj added a commit that referenced this pull request May 9, 2016

Merge #7934: Improve rolling bloom filter performance and benchmark

f17032f

1953c40 More efficient bitsliced rolling Bloom filter (Pieter Wuille) aa62b68 Benchmark rolling bloom filter (Pieter Wuille)

dcousens reviewed Dec 21, 2016

View reviewed changes

dagurval mentioned this pull request Jan 15, 2018

Bloom filter updates bitcoinxt/bitcoinxt#297

Merged

sickpig mentioned this pull request May 31, 2018

[PORT] Rolling Bloom Filter new implementation. BitcoinUnlimited/BitcoinUnlimited#1112

Merged

str4d mentioned this pull request Feb 22, 2019

Micro-benchmarking framework part 1 zcash/zcash#3858

Merged

str4d mentioned this pull request Mar 5, 2021

Backport bloom filter improvements zcash/zcash#5026

Merged

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve rolling bloom filter performance and benchmark #7934

Improve rolling bloom filter performance and benchmark #7934

Uh oh!

sipa commented Apr 24, 2016 •

edited

Loading

Uh oh!

dcousens Apr 26, 2016 •

edited

Loading

Uh oh!

dcousens commented Apr 26, 2016

Uh oh!

sipa commented Apr 28, 2016

Uh oh!

gmaxwell commented Apr 28, 2016

Uh oh!

dcousens commented Apr 30, 2016

Uh oh!

gmaxwell commented May 5, 2016

Uh oh!

laanwj commented May 9, 2016

Uh oh!

rebroad commented Dec 21, 2016

Uh oh!

dcousens Dec 21, 2016 •

edited

Loading

Uh oh!

laanwj commented Dec 21, 2016

Uh oh!

Uh oh!

Improve rolling bloom filter performance and benchmark #7934

Improve rolling bloom filter performance and benchmark #7934

Uh oh!

Conversation

sipa commented Apr 24, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcousens Apr 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dcousens commented Apr 26, 2016

Uh oh!

sipa commented Apr 28, 2016

Uh oh!

gmaxwell commented Apr 28, 2016

Uh oh!

dcousens commented Apr 30, 2016

Uh oh!

gmaxwell commented May 5, 2016

Uh oh!

laanwj commented May 9, 2016

Uh oh!

rebroad commented Dec 21, 2016

Uh oh!

dcousens Dec 21, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

laanwj commented Dec 21, 2016

Uh oh!

Uh oh!

sipa commented Apr 24, 2016 •

edited

Loading

dcousens Apr 26, 2016 •

edited

Loading

dcousens Dec 21, 2016 •

edited

Loading