Replace cluster linearization algorithm with SFL #32545

sipa · 2025-05-17T23:24:56Z

Part of cluster mempool: #30289. Based on #30605.

This replaces the cluster linearization algorithm introduced in #30126 and #30286 (a combination of LIMO with candidate-set search), with a completely different algorithm: spanning-forest linearization, which appears to have much better performance for hard clusters. See this post for a comparison between various linearization algorithms, and this post for benchmarks comparing them. Replaying historical mempool data on it shows that it can effectively linearize every observed cluster up to 64 transactions optimally within tens of microseconds, though pathological examples can be created which take longer.

The algorithm is effectively a very specialized version of the simplex algorithm to the problem of finding high-feerate topological subsets of clusters, but modified to find all consecutive such subsets concurrently rather than just the first one. See the post above for how it is related.

It represents the cluster as partitioned into a set of chunks, each with a spanning tree of its internal dependencies connecting the transactions. Randomized improvements are made by selecting dependencies to add and remove to these spanning trees, merging and splitting chunks, until no more improvements are possible. Like simplex, it does not necessarily make progress in every step, and thus has no upper bound on its runtime, but randomization makes long runtimes very unlikely, and additionally makes it hard to adversarially construct clusters in which the algorithm reliably makes bad choices.

DrahtBot · 2025-05-17T23:24:59Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/32545.

Reviews

See the guideline for information on the review process.

Type	Reviewers
Concept ACK	jonatack

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

Conflicts

No conflicts as of last run.

DrahtBot · 2025-05-17T23:40:01Z

🚧 At least one of the CI tasks failed.
_{Task ARM, unit tests, no functional tests: https://github.com/bitcoin/bitcoin/runs/42417371062}
_{LLM reason (✨ experimental): The CI failure is due to a build error during the compilation of txgraph.cpp.o.}

Hints

Try to run the tests locally, according to the documentation. However, a CI failure may still
happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the
affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

DrahtBot · 2025-05-19T03:39:17Z

🚧 At least one of the CI tasks failed.
_{Task ARM, unit tests, no functional tests: https://github.com/bitcoin/bitcoin/runs/42447412610}
_{LLM reason (✨ experimental): The CI failure is due to a failed CTest test: cluster_linearize_tests.}

Hints

Try to run the tests locally, according to the documentation. However, a CI failure may still
happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the
affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

jonatack

Concept ACK

jonatack · 2025-05-22T16:09:08Z

src/cluster_linearize.h

@@ -539,492 +555,651 @@ class LinearizationChunking
    }
 };

-/** Class encapsulating the state needed to find the best remaining ancestor set.
+/** Class to represent the internal state of the spanning-forest linearization algorithm.


Appreciate the excellent doxygen documentation here.

DrahtBot · 2025-05-25T16:44:44Z

🚧 At least one of the CI tasks failed.
_{Task CentOS, depends, gui: https://github.com/bitcoin/bitcoin/runs/42858330826}
_{LLM reason (✨ experimental): The CI failure is due to assertion failures within the cluster_linearize_tests and bench_sanity_check tests.}

Hints

Try to run the tests locally, according to the documentation. However, a CI failure may still
happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the
affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

l0rinc · 2025-06-04T13:22:05Z

Also, is this with 32-bit or 64-bit userspace?

64-bit userspace (AArch64)

$ file bitcoind
bitcoind: ELF 64-bit LSB pie executable, ARM aarch64, version 1 (GNU/Linux), dynamically linked, interpreter /lib/ld-linux-aarch64.so.1, BuildID[sha1]=7e059ec01f7460042910ca4ed15270382269c9d5, for GNU/Linux 3.7.0, with debug_info, not stripped

additional details

$ getconf LONG_BIT
64
$ dpkg --print-architecture
arm64
$ uname -m
aarch64

sipa · 2025-06-12T17:28:34Z

Rebased on top of #30605.

DrahtBot · 2025-08-09T04:45:26Z

🚧 At least one of the CI tasks failed.
_{Task no wallet, libbitcoinkernel: https://github.com/bitcoin/bitcoin/runs/47723469045}
_{LLM reason (✨ experimental): The build failed due to compilation errors in cluster_linearize.cpp caused by incorrect member access of a tuple, leading to a build error.}

Hints

Try to run the tests locally, according to the documentation. However, a CI failure may still
happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the
affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

…ature) This replaces the existing LIMO linearization algorithm (which internally uses ancestor set finding and candidate set finding) with the much more performant spanning-forest linearization algorithm. See https://delvingbitcoin.org/t/spanning-forest-cluster-linearization/1419

…nup) This removes the candidate set finding classes, as well as related tests and benchmarks for them.

…imization) This avoids the need for a loop over all parents of a transaction while walking a chunk, and removes the need to store the set of parent dependencies explicitly.

This introduces the notion of gain to the SFL algorithm. Given a chunk c, an active dependency d in it, and the chunks (t, b) that c would split into if d were deactivated, the gain is defined as either (they are equivalent): (feerate(t) - feerate(b)) * size(t) * size(b) fee(t) * size(b) - fee(b) * size(t) It happens to also be equal to these: (feerate(t) - feerate(c)) * size(t) * size(c) fee(t) * size(c) - fee(c) * size(t) Its relevance is that this metric is proportional to a lower bound on the area under the fee-size diagram which would be gained IF a deactivation of d does not result in a self-merge of t and b again. This commit adds logic to find, within each chunk, the dependency with the highest gain. In benchmarks, this appears to be a very good heuristic for deciding which splits are worth making.

…r (optimization) This reduces the number of allocations required inside the SFL algorithm, and works because the number of dependencies per transaction is at most n-1. To minimize the memory usage from this pre-allocation (which might impact memory locality), change the data type of DepIdx from uint32_t to uint8_t or uint16_t when possible.

…ptimization) Within the per-transaction child dependency list, keep the active ones before all inactive ones. This improves the complexity over iterating over active dependencies from O(m) to O(n), as at most n-1 dependencies can be active within any given chunk at any given time.

This distributes the work over the various chunks fairly, and simultaneously avoids retrying chunks over and over again which are already known to be optimal.

Out of an abundance of caution that adversarially-constructed clusters might reliably result in bad chunk split decisions with the maximum-gain strategy, make every third consecutive attempt to split the same chunk use a random strategy instead.

We do not need to actually keep track of whether a dependency is active or not; it is implied by whether or not it appears within the active prefix of its parent's child_deps, and its child's parent_deps. Just remove setting and checking it.

This adds a rough estimate of algorithm runtime, so it can be interrupted if no solution is found in time. Due to inherent differences between platforms, this will not be extremely accurate, but it is preferable over directly measuring time for consistency.

After the normal optimization process finishes, and finds an optimal spanning forest, run a second process (while computation budget remains) to split chunks into minimal equal-feerate chunks. As a side-effect, this also guarantees that the optimal chunk order is deterministic.

sipa mentioned this pull request May 17, 2025

Cluster mempool tracking issue #30289

Open

22 tasks

sipa force-pushed the 202505_sfl branch from 7693795 to 2fb6a0e Compare May 17, 2025 23:38

DrahtBot added the CI failed label May 17, 2025

sipa force-pushed the 202505_sfl branch 2 times, most recently from 3b7477b to b920e76 Compare May 18, 2025 02:20

DrahtBot mentioned this pull request May 18, 2025

cluster mempool: add TxGraph work controls #32263

Merged

DrahtBot removed the CI failed label May 18, 2025

DrahtBot mentioned this pull request May 18, 2025

Cluster linearization: separate tests from tests-of-tests #30605

Merged

sipa force-pushed the 202505_sfl branch 2 times, most recently from 1c6bb72 to df589da Compare May 19, 2025 02:54

DrahtBot added the CI failed label May 19, 2025

sipa force-pushed the 202505_sfl branch 2 times, most recently from 7d5e4dc to 23072f2 Compare May 20, 2025 02:30

DrahtBot removed the CI failed label May 20, 2025

sipa added the Mempool label May 20, 2025

jonatack reviewed May 22, 2025

View reviewed changes

sipa force-pushed the 202505_sfl branch 5 times, most recently from 9ee20ca to ba7464a Compare May 25, 2025 16:43

DrahtBot added the CI failed label May 25, 2025

DrahtBot removed the CI failed label May 25, 2025

sipa force-pushed the 202505_sfl branch 3 times, most recently from 55931c3 to 47bdf8f Compare May 28, 2025 14:43

sipa force-pushed the 202505_sfl branch from de45866 to 58b6fc7 Compare June 12, 2025 16:32

sipa force-pushed the 202505_sfl branch 2 times, most recently from 8b3968f to 505bc96 Compare June 14, 2025 22:56

sipa mentioned this pull request Jun 27, 2025

refactor: CFeeRate encapsulates FeeFrac internally #32750

Merged

sipa force-pushed the 202505_sfl branch from 505bc96 to 545c11d Compare July 11, 2025 03:19

DrahtBot added the Needs rebase label Jul 29, 2025

sipa mentioned this pull request Aug 8, 2025

cluster mempool: control/optimize TxGraph memory usage #33157

Open

sipa force-pushed the 202505_sfl branch from 545c11d to 6a7377d Compare August 9, 2025 03:53

DrahtBot added the CI failed label Aug 9, 2025

DrahtBot removed the Needs rebase label Aug 9, 2025

sipa added 13 commits August 9, 2025 09:32

clusterlin: add known-correct optimal linearization tests (tests)

d5e322a

clusterlin: replace benchmarks with SFL-hard ones (bench)

149731f

clusterlin: remove unused {Ancestor,Search}CandidateFinder code (clea…

ff164c1

…nup) This removes the candidate set finding classes, as well as related tests and benchmarks for them.

clusterlin: keep track of the active parents of each transaction (opt…

46f5435

…imization) This avoids the need for a loop over all parents of a transaction while walking a chunk, and removes the need to store the set of parent dependencies explicitly.

clusterlin: keep FIFO queue of improvable chunks (optimization)

dde2081

This distributes the work over the various chunks fairly, and simultaneously avoids retrying chunks over and over again which are already known to be optimal.

clusterlin: randomize merges/splits in SFL (feature)

b1b72d1

sipa force-pushed the 202505_sfl branch from 6a7377d to 8bca6bb Compare August 9, 2025 13:43

sipa force-pushed the 202505_sfl branch from 8bca6bb to 3d2b9b4 Compare August 9, 2025 15:16

DrahtBot removed the CI failed label Aug 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace cluster linearization algorithm with SFL #32545

Replace cluster linearization algorithm with SFL #32545

sipa commented May 17, 2025 •

edited

Loading

Uh oh!

DrahtBot commented May 17, 2025 •

edited

Loading

Uh oh!

DrahtBot commented May 17, 2025

Uh oh!

DrahtBot commented May 19, 2025

Uh oh!

jonatack left a comment

Uh oh!

jonatack May 22, 2025

Uh oh!

DrahtBot commented May 25, 2025

Uh oh!

l0rinc commented Jun 4, 2025

Uh oh!

sipa commented Jun 12, 2025

Uh oh!

DrahtBot commented Aug 9, 2025

Uh oh!

Uh oh!

Replace cluster linearization algorithm with SFL #32545

Are you sure you want to change the base?

Replace cluster linearization algorithm with SFL #32545

Conversation

sipa commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DrahtBot commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Coverage & Benchmarks

Reviews

Conflicts

Uh oh!

DrahtBot commented May 17, 2025

Uh oh!

DrahtBot commented May 19, 2025

Uh oh!

jonatack left a comment

Choose a reason for hiding this comment

Uh oh!

jonatack May 22, 2025

Choose a reason for hiding this comment

Uh oh!

DrahtBot commented May 25, 2025

Uh oh!

l0rinc commented Jun 4, 2025

Uh oh!

sipa commented Jun 12, 2025

Uh oh!

DrahtBot commented Aug 9, 2025

Uh oh!

Uh oh!

sipa commented May 17, 2025 •

edited

Loading

DrahtBot commented May 17, 2025 •

edited

Loading