[IBD] coins: increase default UTXO flush batch size to 32 MiB #31645

l0rinc · 2025-01-12T20:08:03Z

This change is part of [IBD] - Tracking PR for speeding up Initial Block Download

Summary

When the in-memory UTXO set is flushed to LevelDB (after IBD or AssumeUTXO load), it does so in batches to manage memory usage during the flush.
A hidden -dbbatchsize config option exists to modify this value. This PR only changes the default from 16 MiB to 32 MiB.
Using a larger default reduces the overhead of many small writes and improves I/O efficiency (especially on HDDs). It may also help LevelDB optimize writes more effectively (e.g., via internal ordering).

Context

The UTXO set has grown significantly since 2017, when the original fixed 16 MiB batch size was chosen.

With the current multi-gigabyte UTXO set and the common practice of using larger -dbcache values, the fixed 16 MiB batch size leads to several inefficiencies:

Flushing the entire UTXO set often requires thousands of separate 16 MiB write operations.
Particularly on HDDs, the cumulative disk seek time and per-operation overhead from numerous small writes significantly slow down the flushing process.
Each WriteBatch call incurs internal LevelDB overhead (e.g., MemTable handling, compaction triggering logic). More frequent, smaller batches amplify this cumulative overhead.

Flush times of 20-30 minutes are not uncommon, even on capable hardware.

Considerations

As noted by sipa, flushing involves a temporary memory usage increase as the batch is prepared. A larger batch size naturally leads to a larger peak during this phase. Crashing due to OOM during a flush is highly undesirable, but now that #30611 is merged, the most we'd lose is the first hour of IBD.

Increasing the LevelDB write batch size from 16 to 32 MiB raised the measured peaks by ~70 MiB in my tests during UTXO dump. The option remains hidden, and users can always override it.

The increased peak memory usage (detailed below) is primarily attributed to LevelDB's leveldb::Arena (backing MemTables) and the temporary storage of serialized batch data (e.g., std::string in CDBBatch::WriteImpl).

Performance gains are most pronounced on systems with slower I/O (HDDs), but some SSDs also show measurable improvements.

Measurements:

AssumeUTXO proxy, multiple runs with error bars (flushing time is faster that the measured loading + flushing):

Raspberry Pi, dbcache=500: ~30% faster with 32 MiB vs 16 MiB, peak +~75 MiB and still < 1 GiB.
i7 + HDD: results vary by dbcache, but 32 MiB usually beats 16 MiB and tracks close to 64 MiB without the larger peak.
i9 + fast NVMe: roughly flat across 16/32/64 MiB. The goal here is to avoid regressions, which holds.

Reproducer:

# Set up a clean demo environment
mkdir -p demo && rm -rfd demo

# Build Bitcoin Core
cmake -B build -DCMAKE_BUILD_TYPE=Release && cmake --build build -j$(nproc)

# Start bitcoind with minimal settings without mempool and internet connection
build/bin/bitcoind -datadir=demo -stopatheight=1
build/bin/bitcoind -datadir=demo -blocksonly=1 -connect=0 -dbcache=30000 -daemon

# Load the AssumeUTXO snapshot, making sure the path is correct
# Expected output includes `"coins_loaded": 184821030`
build/bin/bitcoin-cli -datadir=demo -rpcclienttimeout=0 loadtxoutset ~/utxo-880000.dat

# Stop the daemon and verify snapshot flushes in the logs
build/bin/bitcoin-cli -datadir=demo stop
grep "FlushSnapshotToDisk: completed" demo/debug.log

This PR originally proposed 64 MiB, then a dynamic size, but both were dropped: 64 MiB increased peaks more than desired on low-RAM systems, and the dynamic variant underperformed across mixed hardware. 32 MiB is a simpler default that captures most of the gains with a modest peak increase.

For more details see: #31645 (comment)

While the PR isn't about IBD in general, rather about a critical section of it, I have measured a reindex-chainstate until 900k blocks, showing a 1% overall speedup:

Details

COMMITS="e6bfd95d5012fa1d91f83bf4122cb292afd6277f af653f321b135a59e38794b537737ed2f4a0040b"; \
STOP=900000; DBCACHE=10000; \
CC=gcc; CXX=g++; \
BASE_DIR="/mnt/my_storage"; DATA_DIR="$BASE_DIR/BitcoinData"; LOG_DIR="$BASE_DIR/logs"; \
(echo ""; for c in $COMMITS; do git fetch -q origin $c && git log -1 --pretty='%h %s' $c || exit 1; done; echo "") && \
hyperfine \
  --sort command \
  --runs 1 \
  --export-json "$BASE_DIR/rdx-$(sed -E 's/(\w{8})\w+ ?/\1-/g;s/-$//'<<<"$COMMITS")-$STOP-$DBCACHE-$CC.json" \
  --parameter-list COMMIT ${COMMITS// /,} \
  --prepare "killall bitcoind 2>/dev/null; rm -f $DATA_DIR/debug.log; git checkout {COMMIT}; git clean -fxd; git reset --hard && \
    cmake -B build -G Ninja -DCMAKE_BUILD_TYPE=Release && ninja -C build bitcoind && \
    ./build/bin/bitcoind -datadir=$DATA_DIR -stopatheight=$STOP -dbcache=1000 -printtoconsole=0; sleep 10" \
  --cleanup "cp $DATA_DIR/debug.log $LOG_DIR/debug-{COMMIT}-$(date +%s).log" \
  "COMPILER=$CC ./build/bin/bitcoind -datadir=$DATA_DIR -stopatheight=$STOP -dbcache=$DBCACHE -reindex-chainstate -blocksonly -connect=0 -printtoconsole=0"

e6bfd95d50 Merge bitcoin-core/gui#881: Move `FreespaceChecker` class into its own module
af653f321b coins: derive `batch_write_bytes` from `-dbcache` when unspecified

Benchmark 1: COMPILER=gcc ./build/bin/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=900000 -dbcache=10000 -reindex-chainstate -blocksonly -connect=0 -printtoconsole=0 (COMMIT = e6bfd95d5012fa1d91f83bf4122cb292afd6277f)
  Time (abs ≡):        25016.346 s               [User: 30333.911 s, System: 826.463 s]
 
Benchmark 2: COMPILER=gcc ./build/bin/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=900000 -dbcache=10000 -reindex-chainstate -blocksonly -connect=0 -printtoconsole=0 (COMMIT = af653f321b135a59e38794b537737ed2f4a0040b)
  Time (abs ≡):        24801.283 s               [User: 30328.665 s, System: 834.110 s]
 
Relative speed comparison
        1.01          COMPILER=gcc ./build/bin/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=900000 -dbcache=10000 -reindex-chainstate -blocksonly -connect=0 -printtoconsole=0 (COMMIT = e6bfd95d5012fa1d91f83bf4122cb292afd6277f)
        1.00          COMPILER=gcc ./build/bin/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=900000 -dbcache=10000 -reindex-chainstate -blocksonly -connect=0 -printtoconsole=0 (COMMIT = af653f321b135a59e38794b537737ed2f4a0040b)

l0rinc · 2025-01-12T20:11:26Z

Visual representation of the AssumeUTXO 840k measurements (16MiB was the previous default, 64MiB is the proposed one):

Bash script to run assumeUTXO with differenty values

set -e

killall bitcoind 2>/dev/null || true
cd /mnt/my_storage/bitcoin
mkdir -p demo

for i in 1 2; do
  for dbcache in 440 5000 30000; do
    for dbbatchsize in 4194304 8388608 16777216 33554432 67108864 134217728 268435456; do
      cmake -B build -DCMAKE_BUILD_TYPE=Release > /dev/null 2>&1
      cmake --build build -j"$(nproc)" > /dev/null 2>&1

      build/bin/bitcoin-cli -datadir=demo stop 2>/dev/null || true
      killall bitcoind 2>/dev/null || true
      sleep 10

      rm -rfd demo/chainstate demo/chainstate_snapshot
      build/bin/bitcoind -datadir=demo -stopatheight=1 -printtoconsole=0 && rm -f demo/debug.log

      echo "Starting bitcoind with dbcache=$dbcache"
      build/bin/bitcoind -datadir=demo -blocksonly=1 -connect=0 -dbcache="$dbcache" -dbbatchsize="$dbbatchsize" -daemon -printtoconsole=0
      sleep 10

      echo "Loading UTXO snapshot..."
      build/bin/bitcoin-cli -datadir=demo -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat | grep -q '"coins_loaded": 184821030' || { echo "ERROR: Wrong number of coins loaded"; exit 1; }
      build/bin/bitcoin-cli -datadir=demo stop
      killall bitcoind 2>/dev/null || true
      sleep 10

      out_file="results_i_${i}_dbcache_${dbcache}_dbbatchsize_${dbbatchsize}.log"
      echo "Collecting logs in ${out_file}"
      grep "FlushSnapshotToDisk: completed" demo/debug.log | tee -a "${out_file}"
      echo "---" >> "${out_file}"

      echo "Done with i=${i}, dbcache=${dbcache}, dbbatchsize=${dbbatchsize}"
      echo
    done
  done
done

echo "All runs complete. Logs are saved in results_dbcache*.log files."

Python file to sum the flush times

import re
import sys


def parse_bitcoin_debug_log(file_path):
    results = []

    flush_sum = 0.0
    flush_count = 0

    version_pattern = re.compile(r"Bitcoin Core version")
    flush_pattern = re.compile(r'FlushSnapshotToDisk: completed \(([\d.]+)ms\)')

    def finalize_current_block():
        nonlocal flush_sum, flush_count
        if flush_count > 0:
            results.append((flush_sum, flush_count))
        flush_sum = 0.0
        flush_count = 0

    try:
        with open(file_path, 'r') as file:
            for line in file:
                if version_pattern.search(line):
                    finalize_current_block()
                    continue

                match_flush = flush_pattern.search(line)
                if match_flush:
                    flush_ms = float(match_flush.group(1))
                    flush_sum += flush_ms
                    flush_count += 1
    except Exception as e:
        print(f"Error reading file: {e}")
        sys.exit(1)

    finalize_current_block()

    return results


if __name__ == "__main__":
    if len(sys.argv) < 2:
        print("Usage: python3 script.py <path_to_debug_log>")
        sys.exit(1)

    file_path = sys.argv[1]
    parsed_results = parse_bitcoin_debug_log(file_path)

    for total_flush_time, total_flush_calls in parsed_results:
        print(f"{total_flush_time:.2f},{total_flush_calls}")

Edit:
Reran the benchmarks on a Hetzner Linux machine with the new AssumeUTXO 880k set, parsing the logged flush times and plotting the results.
For different dbcache (440, 5000, 30000) and dbbatchsize (4-256MiB range, 16MiB (current) and 64MiB (proposed) are highlighted and trendline is added for clarity), each one twice for stability:

Sorting the same measurements (and calculating the sums) might give us a better understanding of the trends:

DrahtBot · 2025-01-12T20:11:29Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/31645.

Reviews

See the guideline for information on the review process.

Type	Reviewers
Stale ACK	ryanofsky, jonatack, hodlinator

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

Conflicts

No conflicts as of last run.

sipa · 2025-01-13T15:52:27Z

FWIW, the reason for the existence of the batch size behavior (as opposed to just writing everything at once) is that it causes a memory usage spike at flush time. If that spike exceeds the memory the process can allocate it causes a crash, at a particularly bad time (may require a replay to fix, which may be slower than just reprocessing the blocks).

Given that changing this appears to improve performance it's worth considering of course, but it is essentially a trade-off between speed and memory usage spiking.

sipa · 2025-01-13T18:02:58Z

There is a config option. This is about changing the dedault.

l0rinc · 2025-01-13T18:58:27Z

If that spike exceeds the memory the process can allocate it causes a crash

Thanks for the context, @sipa.
On the positive side, the extra allocation is constant (or at least non-proportional with the usage) and it's narrowing the window for other crashes during flushing (#30611 will also likely help here).
This change may also enable another one (that I'm currently re-measuring to be sure), which seems to halve the remaining flush time again (by sorting the values in descending order before adding them to the batch), e.g. from 30 minutes (on master) to 10 (with this change included).

luke-jr · 2025-01-14T21:55:38Z

Can we predict the memory usage spike size? Presumably as we flush, that releases memory, which allows for a larger and larger batch size?

l0rinc · 2025-01-16T10:32:15Z

Since profilers may not catch these short-lived spikes, I've instrumented the code, loaded the UTXO set (as described the in the PR), parsed the logged flushing times and memory usages and plotted them against each other to see the effect of the batch size increase.

txdb.cpp flush time and memory instrumentation:

diff --git a/src/txdb.cpp b/src/txdb.cpp
--- a/src/txdb.cpp	(revision d249a353be58868d41d2a7c57357038ffd779eba)
+++ b/src/txdb.cpp	(revision bae884969d35469320ed9967736eb15b5d87edff)
@@ -90,7 +90,81 @@
     return vhashHeadBlocks;
 }

+/*
+ * Author:  David Robert Nadeau
+ * Site:    http://NadeauSoftware.com/
+ * License: Creative Commons Attribution 3.0 Unported License
+ *          http://creativecommons.org/licenses/by/3.0/deed.en_US
+ */
+#if defined(_WIN32)
+#include <windows.h>
+#include <psapi.h>
+
+#elif defined(__unix__) || defined(__unix) || defined(unix) || (defined(__APPLE__) && defined(__MACH__))
+#include <unistd.h>
+#include <sys/resource.h>
+
+#if defined(__APPLE__) && defined(__MACH__)
+#include <mach/mach.h>
+
+#elif (defined(_AIX) || defined(__TOS__AIX__)) || (defined(__sun__) || defined(__sun) || defined(sun) && (defined(__SVR4) || defined(__svr4__)))
+#include <fcntl.h>
+#include <procfs.h>
+
+#elif defined(__linux__) || defined(__linux) || defined(linux) || defined(__gnu_linux__)
+#include <stdio.h>
+
+#endif
+
+#else
+#error "Cannot define  getCurrentRSS( ) for an unknown OS."
+#endif
+
+/**
+ * Returns the current resident set size (physical memory use) measured
+ * in bytes, or zero if the value cannot be determined on this OS.
+ */
+size_t getCurrentRSS( )
+{
+#if defined(_WIN32)
+    /* Windows -------------------------------------------------- */
+    PROCESS_MEMORY_COUNTERS info;
+    GetProcessMemoryInfo( GetCurrentProcess( ), &info, sizeof(info) );
+    return (size_t)info.WorkingSetSize;
+
+#elif defined(__APPLE__) && defined(__MACH__)
+    /* OSX ------------------------------------------------------ */
+    struct mach_task_basic_info info;
+    mach_msg_type_number_t infoCount = MACH_TASK_BASIC_INFO_COUNT;
+    if ( task_info( mach_task_self( ), MACH_TASK_BASIC_INFO,
+        (task_info_t)&info, &infoCount ) != KERN_SUCCESS )
+        return (size_t)0L;      /* Can't access? */
+    return (size_t)info.resident_size;
+
+#elif defined(__linux__) || defined(__linux) || defined(linux) || defined(__gnu_linux__)
+    /* Linux ---------------------------------------------------- */
+    long rss = 0L;
+    FILE* fp = NULL;
+    if ( (fp = fopen( "/proc/self/statm", "r" )) == NULL )
+        return (size_t)0L;      /* Can't open? */
+    if ( fscanf( fp, "%*s%ld", &rss ) != 1 )
+    {
+        fclose( fp );
+        return (size_t)0L;      /* Can't read? */
+    }
+    fclose( fp );
+    return (size_t)rss * (size_t)sysconf( _SC_PAGESIZE);
+
+#else
+    /* AIX, BSD, Solaris, and Unknown OS ------------------------ */
+    return (size_t)0L;          /* Unsupported. */
+#endif
+}
+
 bool CCoinsViewDB::BatchWrite(CoinsViewCacheCursor& cursor, const uint256 &hashBlock) {
+    const auto start = std::chrono::steady_clock::now();
+    size_t max_mem{getCurrentRSS()};
+
     CDBBatch batch(*m_db);
     size_t count = 0;
     size_t changed = 0;
@@ -129,7 +203,11 @@
         it = cursor.NextAndMaybeErase(*it);
         if (batch.SizeEstimate() > m_options.batch_write_bytes) {
             LogDebug(BCLog::COINDB, "Writing partial batch of %.2f MiB\n", batch.SizeEstimate() * (1.0 / 1048576.0));
+
+            max_mem = std::max(max_mem, getCurrentRSS());
             m_db->WriteBatch(batch);
+            max_mem = std::max(max_mem, getCurrentRSS());
+
             batch.Clear();
             if (m_options.simulate_crash_ratio) {
                 static FastRandomContext rng;
@@ -146,8 +224,16 @@
     batch.Write(DB_BEST_BLOCK, hashBlock);

     LogDebug(BCLog::COINDB, "Writing final batch of %.2f MiB\n", batch.SizeEstimate() * (1.0 / 1048576.0));
+
+    max_mem = std::max(max_mem, getCurrentRSS());
     bool ret = m_db->WriteBatch(batch);
+    max_mem = std::max(max_mem, getCurrentRSS());
+
     LogDebug(BCLog::COINDB, "Committed %u changed transaction outputs (out of %u) to coin database...\n", (unsigned int)changed, (unsigned int)count);
+    if (changed > 0) {
+        const auto end{std::chrono::steady_clock::now()};
+        LogInfo("BatchWrite took=%dms, maxMem=%dMiB", duration_cast<std::chrono::milliseconds>(end - start).count(), max_mem >> 20);
+    }
     return ret;
 }

Python script to load the utxo set, parse the logs and create the flush and memory plots

import os
import re
import shutil
import statistics
import subprocess
import time
import datetime
import argparse
import matplotlib.pyplot as plt  # python3.12 -m pip install matplotlib --break-system-packages

# Regex to parse logs
BATCHWRITE_REGEX = re.compile(r"^(\d{4}-\d{2}-\d{2}T\d{2}:\d{2}:\d{2}Z) BatchWrite took=(\d+)ms, maxMem=(\d+)MiB")


def parse_log(archive):
    """Parse the log file to extract elapsed times, flush times, and memory usage."""
    start_time = None
    elapsed, batchwrite_times, usage_snapshots = [], [], []
    with open(archive, "r") as f:
        for line in f:
            if m := BATCHWRITE_REGEX.search(line):
                dt = datetime.datetime.strptime(m.group(1), "%Y-%m-%dT%H:%M:%SZ")
                if start_time is None:
                    start_time = dt
                elapsed.append((dt - start_time).total_seconds())
                batchwrite_times.append(int(m.group(2)))
                usage_snapshots.append(int(m.group(3)))
    return elapsed, batchwrite_times, usage_snapshots


def plot_results(results, output_dir):
    """Create separate plots for flush times and memory usage."""
    if len(results) != 2:
        print("plot_results() requires exactly 2 runs for comparison.")
        return

    (dbbatch0, elapsed0, flush0, mem0) = results[0]
    (dbbatch1, elapsed1, flush1, mem1) = results[1]

    # Compute percentage differences
    avg_flush0, avg_flush1 = statistics.mean(flush0), statistics.mean(flush1)
    max_mem0, max_mem1 = max(mem0), max(mem1)
    flush_improvement = round(((avg_flush0 - avg_flush1) / avg_flush0) * 100, 1)
    mem_increase = round(((max_mem1 - max_mem0) / max_mem0) * 100, 1)

    # Plot flush times
    plt.figure(figsize=(16, 8))
    plt.plot(elapsed0, flush0, color="red", linestyle="-", label=f"Flush Times (dbbatch={dbbatch0})")
    plt.axhline(y=avg_flush0, color="red", linestyle="--", alpha=0.5, label=f"Mean ({dbbatch0})={avg_flush0:.1f}ms")
    plt.plot(elapsed1, flush1, color="orange", linestyle="-", label=f"Flush Times (dbbatch={dbbatch1})")
    plt.axhline(y=avg_flush1, color="orange", linestyle="--", alpha=0.5, label=f"Mean ({dbbatch1})={avg_flush1:.1f}ms")
    plt.title(f"Flush Times (dbbatch {dbbatch0} vs {dbbatch1}) — {abs(flush_improvement)}% {'faster' if flush_improvement > 0 else 'slower'}")
    plt.xlabel("Elapsed Time (seconds)")
    plt.ylabel("Flush Times (ms)")
    plt.legend()
    plt.grid(True)
    plt.tight_layout()
    flush_out_file = os.path.join(output_dir, "plot_flush_times.png")
    plt.savefig(flush_out_file)
    print(f"Flush Times plot saved as {flush_out_file}")
    plt.close()

    # Plot memory usage
    plt.figure(figsize=(16, 8))
    plt.plot(elapsed0, mem0, color="blue", linestyle="-", label=f"Memory (dbbatch={dbbatch0})")
    plt.axhline(y=max_mem0, color="blue", linestyle="--", alpha=0.5, label=f"Max Mem ({dbbatch0})={max_mem0}MiB")
    plt.plot(elapsed1, mem1, color="green", linestyle="-", label=f"Memory (dbbatch={dbbatch1})")
    plt.axhline(y=max_mem1, color="green", linestyle="--", alpha=0.5, label=f"Max Mem ({dbbatch1})={max_mem1}MiB")
    plt.title(f"Memory Usage (dbbatch {dbbatch0} vs {dbbatch1}) — {abs(mem_increase)}% {'higher' if mem_increase > 0 else 'lower'}")
    plt.xlabel("Elapsed Time (seconds)")
    plt.ylabel("Memory Usage (MiB)")
    plt.legend()
    plt.grid(True)
    plt.tight_layout()
    mem_out_file = os.path.join(output_dir, "plot_memory_usage.png")
    plt.savefig(mem_out_file)
    print(f"Memory Usage plot saved as {mem_out_file}")
    plt.close()


def loadtxoutset(dbbatchsize, datadir, bitcoin_cli, bitcoind, utxo_file):
    """Load the UTXO set and run the Bitcoin node."""
    archive = os.path.join(datadir, f"results_dbbatch-{dbbatchsize}.log")

    # Skip if logs already exist
    if os.path.exists(archive):
        print(f"Log file {archive} already exists. Skipping loadtxoutset for dbbatchsize={dbbatchsize}.")
        return

    os.makedirs(datadir, exist_ok=True)
    debug_log = os.path.join(datadir, "debug.log")

    try:
        print("Cleaning up previous run")
        for subdir in ["chainstate", "chainstate_snapshot"]:
            shutil.rmtree(os.path.join(datadir, subdir), ignore_errors=True)

        print("Preparing UTXO load")
        subprocess.run([bitcoind, f"-datadir={datadir}", "-stopatheight=1"], cwd=bitcoin_core_path)
        os.remove(debug_log)

        print(f"Starting bitcoind with dbbatchsize={dbbatchsize}")
        subprocess.run([bitcoind, f"-datadir={datadir}", "-daemon", "-blocksonly=1", "-connect=0", f"-dbbatchsize={dbbatchsize}", f"-dbcache={440}"], cwd=bitcoin_core_path)
        time.sleep(5)

        print("Loading UTXO set")
        subprocess.run([bitcoin_cli, f"-datadir={datadir}", "loadtxoutset", utxo_file], cwd=bitcoin_core_path)
    except Exception as e:
        print(f"Error during loadtxoutset for dbbatchsize={dbbatchsize}: {e}")
        raise
    finally:
        print("Stopping bitcoind...")
        subprocess.run([bitcoin_cli, f"-datadir={datadir}", "stop"], cwd=bitcoin_core_path, stdout=subprocess.DEVNULL, stderr=subprocess.DEVNULL)
        time.sleep(5)

    shutil.copy2(debug_log, archive)
    print(f"Archived logs to {archive}")


if __name__ == "__main__":
    # Parse script arguments
    parser = argparse.ArgumentParser(description="Benchmark Bitcoin dbbatchsize configurations.")
    parser.add_argument("--utxo-file", required=True, help="Path to the UTXO snapshot file.")
    parser.add_argument("--bitcoin-core-path", required=True, help="Path to the Bitcoin Core project directory.")
    args = parser.parse_args()

    utxo_file = args.utxo_file
    bitcoin_core_path = args.bitcoin_core_path
    datadir = os.path.join(bitcoin_core_path, "demo")
    debug_log = os.path.join(datadir, "debug.log")
    bitcoin_cli = os.path.join(bitcoin_core_path, "build/src/bitcoin-cli")
    bitcoind = os.path.join(bitcoin_core_path, "build/src/bitcoind")

    # Build Bitcoin Core
    print("Building Bitcoin Core...")
    subprocess.run(["cmake", "-B", "build", "-DCMAKE_BUILD_TYPE=Release"], cwd=bitcoin_core_path, check=True)
    subprocess.run(["cmake", "--build", "build", "-j", str(os.cpu_count())], cwd=bitcoin_core_path, check=True)

    # Run tests for each dbbatchsize
    results = []
    for dbbatchsize in [16777216, 67108864]:  # Original and proposed
        loadtxoutset(dbbatchsize, datadir, bitcoin_cli, bitcoind, utxo_file)
        archive = os.path.join(datadir, f"results_dbbatch-{dbbatchsize}.log")
        elapsed, batchwrite_times, usage_snapshots = parse_log(archive)
        results.append((dbbatchsize, elapsed, batchwrite_times, usage_snapshots))

    # Plot results
    plot_results(results, bitcoin_core_path)
    print("All configurations processed.")

For standard dbcache values the results are very close (though the memory measurements aren't as scientific as I'd like them to be (probably because there is still enough memory), some runs even indicate that 16MiB consumes a bit more memory than the 64MiB version), but the trend seems to be clear from the produced plots: the batch writes are faster (and seem more predictable) with bigger batches, while the memory usage is only slightly higher.

Is there any other way that you'd like me to test this @sipa, @luke-jr, @1440000bytes?

luke-jr · 2025-01-16T22:56:58Z

I think those graphs need to be on height rather than seconds. The larger dbbatchsize making it faster means it gets further in the chain, leading to the higher max at the end...

I would expect both lines to be essentially overlapping except during flushes.

l0rinc · 2025-01-17T10:14:28Z

I would expect both lines to be essentially overlapping except during flushes.

I was only measuring the memory here during flushes. There is no direct height available there, but if we instrument UpdateTipLog instead (and fetch some data from the assumeUTXO height), we'd get:

dbbatchsize=16MiB:

dbbatchsize=64MiB (+ experimental sorting):

overlapped (blue 16, green 64):

The UTXO set has grown significantly, and flushing it from memory to LevelDB often takes over 20 minutes after a successful IBD with large dbcache values. The final UTXO set is written to disk in batches, which LevelDB sorts into SST files. By increasing the default batch size, we can reduce overhead from repeated compaction cycles, minimize constant overhead per batch, and achieve more sequential writes. Experiments with different batch sizes (loaded via assumeutxo at block 840k, then measuring final flush time) show that 64 MiB batches significantly reduce flush time without notably increasing memory usage: | dbbatchsize | flush_sum (ms) | |-------------|----------------| | 8 MiB | ~240,000 | | 16 MiB | ~220,000 | | 32 MiB | ~200,000 | | *64 MiB* | *~150,000* | | 128 MiB | ~156,000 | | 256 MiB | ~166,000 | | 512 MiB | ~186,000 | | 1 GiB | ~186,000 | Checking the impact of a `-reindex-chainstate` with `-stopatheight=878000` and `-dbcache=30000` gives: 16 << 20 ``` 2025-01-12T07:31:05Z Flushed fee estimates to fee_estimates.dat. 2025-01-12T07:31:05Z [warning] Flushing large (26 GiB) UTXO set to disk, it may take several minutes 2025-01-12T07:53:51Z Shutdown: done ``` Flush time: 22 minutes and 46 seconds 64 >> 20 ``` 2025-01-12T18:30:00Z Flushed fee estimates to fee_estimates.dat. 2025-01-12T18:30:00Z [warning] Flushing large (26 GiB) UTXO set to disk, it may take several minutes 2025-01-12T18:44:43Z Shutdown: done ``` Flush time: ~14 minutes 43 seconds. Github-Pull: bitcoin#31645 Rebased-From: d249a35

ryanofsky

Code review ACK 8684133

If that spike exceeds the memory the process can allocate it causes a crash, at a particularly bad time (may require a replay to fix, which may be slower than just reprocessing the blocks).

It is difficult for me to have a sense of how safe this change is but I'd hope we are not currently pushing systems so close to the edge that using an extra 48mb will cause them to start crashing. This does seem like a nice performance improvment if it doesn't cause crashes.

In theory, we could dynamically limit the batch size based on available memory to mitigate the risk of crashes. However, since the batch size is already small and further increases don’t provide much additional benefit (per the commit message), that added complexity probably isn’t worth it here.

jonatack

Concept ACK on raising the default from 16 to 67 megabytes (note that -dbbatchsize is a hidden -help-debug config option). Testing.

$ ./build/bin/bitcoind -help-debug | grep -A2 dbbatch
  -dbbatchsize
       Maximum database write batch size in bytes (default: 67108864)

DrahtBot · 2025-04-17T10:37:25Z

🚧 At least one of the CI tasks failed.
_{Debug: https://github.com/bitcoin/bitcoin/runs/40717139109}

Hints

Try to run the tests locally, according to the documentation. However, a CI failure may still
happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the
affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

jonatack

ACK 8fd522b

A few nits, feel free to pick/choose/ignore.

jonatack · 2025-04-22T16:51:51Z

src/node/coins_view_args.h

+        (dbcache_bytes / DEFAULT_KERNEL_CACHE) * DEFAULT_DB_CACHE_BATCH,
+        /*lo=*/DEFAULT_DB_CACHE_BATCH,
+        /*hi=*/256_MiB
+    );


Might be good to add a comment for this magic value, i.e. (from the PR description):

Capped at 256 MiB, as gains are barely measurable for bigger batches (see PR 31645)

also, clang-format

- /*hi=*/256_MiB - ); + /*hi=*/256_MiB);

and unneeded braces line 18

- (dbcache_bytes / DEFAULT_KERNEL_CACHE) * DEFAULT_DB_CACHE_BATCH, + dbcache_bytes / DEFAULT_KERNEL_CACHE * DEFAULT_DB_CACHE_BATCH,

Capped at 256 MiB, as gains are barely measurable for bigger batches (see PR 31645)

I don't mind adding, but a simple blame would immediately reveal that

unneeded braces line 18

They may be implied, but I want to emphasize that mathematically this isn't associative, i.e. not the same as

dbcache_bytes / (DEFAULT_KERNEL_CACHE * DEFAULT_DB_CACHE_BATCH)

Extracted min/max and added some comments

jonatack · 2025-04-22T16:53:23Z

src/test/coins_tests.cpp

@@ -20,6 +20,7 @@
 #include <vector>

 #include <boost/test/unit_test.hpp>
+#include <node/coins_view_args.h>


This seems to be the most frequently seen order:

#include <coins.h> +#include <node/coins_view_args.h> #include <streams.h> @@ -20,7 +21,6 @@ #include <vector> #include <boost/test/unit_test.hpp> -#include <node/coins_view_args.h> using namespace util::hex_literals;

you mean we usually ignore the folder when sorting? I found examples for all sorts of includes (pun, intended) - I'll adjust if I have to push again.

you mean we usually ignore the folder when sorting?

Think it's a case of keeping our headers separate from external dependencies/libraries.

Makes sense, will do next time I push

jonatack · 2025-04-22T16:55:05Z

src/node/coins_view_args.cpp

-    if (auto value = args.GetIntArg("-dbbatchsize")) options.batch_write_bytes = *value;
-    if (auto value = args.GetIntArg("-dbcrashratio")) options.simulate_crash_ratio = *value;
+    if (const auto value = args.GetIntArg("-dbbatchsize")) options.batch_write_bytes = *value;
+    else options.batch_write_bytes = GetDbBatchSize(args.GetIntArg("-dbcache", DEFAULT_KERNEL_CACHE));


Feel free to ignore, but the following would be more readable and follow the most frequent convention in this codebase:

- if (const auto value = args.GetIntArg("-dbbatchsize")) options.batch_write_bytes = *value; - else options.batch_write_bytes = GetDbBatchSize(args.GetIntArg("-dbcache", DEFAULT_KERNEL_CACHE)); + if (const auto value = args.GetIntArg("-dbbatchsize")) { + options.batch_write_bytes = *value; + } else { + options.batch_write_bytes = GetDbBatchSize(args.GetIntArg("-dbcache", DEFAULT_KERNEL_CACHE)); + }

Will do if I repush.

Added braces

l0rinc · 2025-05-07T18:38:56Z

Now that #30611 is merged, I'm drafting this PR until #32414 is also merged, since they eliminate the max memory usecase I've been optimizing for - and I have to remeasure the usecases to see if this is still the optimum.

luke-jr · 2025-05-29T16:14:31Z

I'm not sure it makes sense to adjust this based on dbcache size. Won't a given batch size use the same amount of memory regardless of the size of the dbcache?

l0rinc · 2025-05-29T16:30:25Z

Won't a given batch size use the same amount of memory regardless of the size of the dbcache

It would, the assumption was that the user should be able to signal how much leftover memory they have - if they start the app with dbcache of 40MiB, allocating an extra 16MiB can be acceptable, but allocating an extra 64MiB can push the node over the edge. Even though we're not (yet?) preallocating the batch string, doubling the size to accommodate the content would end up with a similar size.
However, if you're starting with a dbcache of 30GiB, you signal that even 256MiB of extra memory is fine. I.e. there should be a way to lower memory as much as possible, which wouldn't be the case if the batch size were fixed.

Note, however, that this has changed slightly with the merge of #30611, since there's no significant difference between flushing with a dbcache of 4.5GiB and one with 45GiB (since we're flushing regularly now).

I'm open to suggestions for which direction to take from here.

The UTXO set has grown significantly, and flushing it from memory to LevelDB often takes over 20 minutes after a successful IBD with large dbcache values. The final UTXO set is written to disk in batches, which LevelDB sorts into SST files. By increasing the default batch size, we can reduce overhead from repeated compaction cycles, minimize constant overhead per batch, and achieve more sequential writes. Experiments with different batch sizes (loaded via assumeutxo at block 840k, then measuring final flush time) show that 64 MiB batches significantly reduce flush time without notably increasing memory usage: | dbbatchsize | flush_sum (ms) | |-------------|----------------| | 8 MiB | ~240,000 | | 16 MiB | ~220,000 | | 32 MiB | ~200,000 | | *64 MiB* | *~150,000* | | 128 MiB | ~156,000 | | 256 MiB | ~166,000 | | 512 MiB | ~186,000 | | 1 GiB | ~186,000 | Checking the impact of a `-reindex-chainstate` with `-stopatheight=878000` and `-dbcache=30000` gives: 16 << 20 ``` 2025-01-12T07:31:05Z Flushed fee estimates to fee_estimates.dat. 2025-01-12T07:31:05Z [warning] Flushing large (26 GiB) UTXO set to disk, it may take several minutes 2025-01-12T07:53:51Z Shutdown: done ``` Flush time: 22 minutes and 46 seconds 64 >> 20 ``` 2025-01-12T18:30:00Z Flushed fee estimates to fee_estimates.dat. 2025-01-12T18:30:00Z [warning] Flushing large (26 GiB) UTXO set to disk, it may take several minutes 2025-01-12T18:44:43Z Shutdown: done ``` Flush time: ~14 minutes 43 seconds. Github-Pull: bitcoin#31645 Rebased-From: 8684133

l0rinc · 2025-07-31T04:14:22Z

Rebased, the PR is ready for review again!

The batch size for UTXO set writes is now calculated based on the maximum dbcache size to ensure that with the default values, memory usage doesn't increase, while reducing flushing time when there is enough memory available.
The change reduces the IBD time by a fixed amount, it's speeding up a critical part of saving the state for long-term storage.

l0rinc

Pushed the remaining nits that I promised a long time ago :)

Re-rebased, you can re-review with git range-diff af653f3...956a6b4

l0rinc · 2025-08-07T00:15:11Z

src/node/coins_view_args.cpp

-    if (auto value = args.GetIntArg("-dbbatchsize")) options.batch_write_bytes = *value;
-    if (auto value = args.GetIntArg("-dbcrashratio")) options.simulate_crash_ratio = *value;
+    if (const auto value = args.GetIntArg("-dbbatchsize")) options.batch_write_bytes = *value;
+    else options.batch_write_bytes = GetDbBatchSize(args.GetIntArg("-dbcache", DEFAULT_KERNEL_CACHE));


Added braces

l0rinc · 2025-08-07T00:15:38Z

src/node/coins_view_args.h

+        (dbcache_bytes / DEFAULT_KERNEL_CACHE) * DEFAULT_DB_CACHE_BATCH,
+        /*lo=*/DEFAULT_DB_CACHE_BATCH,
+        /*hi=*/256_MiB
+    );


Extracted min/max and added some comments

l0rinc · 2025-08-07T00:15:45Z

src/test/coins_tests.cpp

@@ -20,6 +20,7 @@
 #include <vector>

 #include <boost/test/unit_test.hpp>
+#include <node/coins_view_args.h>


l0rinc · 2025-08-07T00:17:19Z

src/node/coins_view_args.h

+    static constexpr size_t GetDbBatchSize(const size_t dbcache_bytes)
+    {
+        const auto target{(dbcache_bytes / DEFAULT_KERNEL_CACHE) * DEFAULT_DB_CACHE_BATCH};
+        return std::max<size_t>(MIN_DB_CACHE_BATCH, std::min<size_t>(MAX_DB_CACHE_BATCH, target));


Ended up with extracting min/max and adding an assert

jonatack

ACK 956a6b4

Note that #31645 (comment) refers to a change of approach from several months ago, prior to my previous review -- in that context, I found the comment confusing
I would suggest not rebasing if there is no merge conflict; this eases reviewing the diff (and after, I'll usually rebase the PR branch to master locally anyway)
Happy to re-ack if you take the review suggestions

src/node/coins_view_args.h

jonatack · 2025-08-08T00:25:59Z

src/kernel/caches.h

@@ -16,6 +16,9 @@ static constexpr size_t MAX_BLOCK_DB_CACHE{2_MiB};
 //! Max memory allocated to coin DB specific cache (bytes)
 static constexpr size_t MAX_COINS_DB_CACHE{8_MiB};

+//! The batch size of DEFAULT_KERNEL_CACHE
+static constexpr size_t DEFAULT_DB_CACHE_BATCH{16_MiB};
+


In 6fa584f, seems it would make sense to place these two together.

//! Suggested default amount of cache reserved for the kernel (bytes) static constexpr size_t DEFAULT_KERNEL_CACHE{450_MiB}; +//! Batch size of DEFAULT_KERNEL_CACHE +static constexpr size_t DEFAULT_DB_CACHE_BATCH{16_MiB}; //! Max memory allocated to block tree DB specific cache (bytes) static constexpr size_t MAX_BLOCK_DB_CACHE{2_MiB}; //! Max memory allocated to coin DB specific cache (bytes) static constexpr size_t MAX_COINS_DB_CACHE{8_MiB}; -//! The batch size of DEFAULT_KERNEL_CACHE -static constexpr size_t DEFAULT_DB_CACHE_BATCH{16_MiB}; - namespace kernel { struct CacheSizes {

We could, I'll do it next time I push

hodlinator

Concept ACK 956a6b4

Have not confirmed the improvements in IBD speedup.

hodlinator · 2025-08-14T19:34:47Z

src/test/coins_tests.cpp

+    BOOST_REQUIRE_EQUAL(node::GetDbBatchSize(DEFAULT_KERNEL_CACHE), DEFAULT_DB_CACHE_BATCH);
+    BOOST_REQUIRE_EQUAL(node::GetDbBatchSize(0_MiB), DEFAULT_DB_CACHE_BATCH);
+
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(4_MiB), 16'777'216);


nit: Could motivate the min value:

Suggested change

BOOST_CHECK_EQUAL(node::GetDbBatchSize(4_MiB), 16'777'216);

static_assert(MIN_DB_CACHE == 4_MiB);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(4_MiB), 16'777'216);

hodlinator · 2025-08-14T19:45:59Z

src/test/coins_tests.cpp

+BOOST_AUTO_TEST_CASE(db_batch_sizes)
+{
+    BOOST_REQUIRE_EQUAL(node::GetDbBatchSize(DEFAULT_KERNEL_CACHE), DEFAULT_DB_CACHE_BATCH);
+    BOOST_REQUIRE_EQUAL(node::GetDbBatchSize(0_MiB), DEFAULT_DB_CACHE_BATCH);
+
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(4_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(10_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(45_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(100_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(450_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(1000_MiB), 33'554'432);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(2000_MiB), 67'108'864);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(3000_MiB), 100'663'296);
+
+#if SIZE_MAX > UINT32_MAX
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(4500_MiB), 167'772'160);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(7000_MiB), 251'658'240);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(10000_MiB), 268'435'456);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(45000_MiB), 268'435'456);
+#endif
+}


Might as well make use of constexpr?

Suggested change

BOOST_AUTO_TEST_CASE(db_batch_sizes)

{

BOOST_REQUIRE_EQUAL(node::GetDbBatchSize(DEFAULT_KERNEL_CACHE), DEFAULT_DB_CACHE_BATCH);

BOOST_REQUIRE_EQUAL(node::GetDbBatchSize(0_MiB), DEFAULT_DB_CACHE_BATCH);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(4_MiB), 16'777'216);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(10_MiB), 16'777'216);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(45_MiB), 16'777'216);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(100_MiB), 16'777'216);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(450_MiB), 16'777'216);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(1000_MiB), 33'554'432);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(2000_MiB), 67'108'864);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(3000_MiB), 100'663'296);

#if SIZE_MAX > UINT32_MAX

BOOST_CHECK_EQUAL(node::GetDbBatchSize(4500_MiB), 167'772'160);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(7000_MiB), 251'658'240);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(10000_MiB), 268'435'456);

BOOST_CHECK_EQUAL(node::GetDbBatchSize(45000_MiB), 268'435'456);

#endif

}

// Verify DB batch sizes:

static_assert(node::GetDbBatchSize(DEFAULT_KERNEL_CACHE) == DEFAULT_DB_CACHE_BATCH);

static_assert(node::GetDbBatchSize(0_MiB) == DEFAULT_DB_CACHE_BATCH);

static_assert(node::GetDbBatchSize(4_MiB) == 16'777'216);

static_assert(node::GetDbBatchSize(10_MiB) == 16'777'216);

static_assert(node::GetDbBatchSize(45_MiB) == 16'777'216);

static_assert(node::GetDbBatchSize(100_MiB) == 16'777'216);

static_assert(node::GetDbBatchSize(450_MiB) == 16'777'216);

static_assert(node::GetDbBatchSize(1000_MiB) == 33'554'432);

static_assert(node::GetDbBatchSize(2000_MiB) == 67'108'864);

static_assert(node::GetDbBatchSize(3000_MiB) == 100'663'296);

#if SIZE_MAX > UINT32_MAX

static_assert(node::GetDbBatchSize(4500_MiB) == 167'772'160);

static_assert(node::GetDbBatchSize(7000_MiB) == 251'658'240);

static_assert(node::GetDbBatchSize(10000_MiB) == 268'435'456);

static_assert(node::GetDbBatchSize(45000_MiB) == 268'435'456);

#endif

I'm not a fan of compile time tests, they're usually harder to debug, when needed.
And these are as fast as they get, we wouldn't be saving any time. So it would be inconsistent with other tests, while being slightly harder to debug and not any faster.
Do you think there's any tangible advantage there?

I would just default to testing at compile time:

No need to (re)run ctest/test_bitcoin to exercise checks.

Only run when compilation unit is (re)built. Not re-run when iterating on test code in other compilation units.

Inconsistency might push other tests towards being converted to compile time, which I would say is a positive effect.

These are extremely fast tests, keeping the usual test format has more advantages in my opinion.
If you have a string preference or if other reviewers prefer that, I don't mind changing, but I don't think the current one has measurable disadvantages compared to the suggestion - while being more in-line with how we usually test, making GetDbBatchSize easily debuggable (often helps with understanding if you can step through it).

src/init.cpp

hodlinator

ACK 956a6b4

Change scales hidden -dbbatchsize by the -dbcache setting if unset.

Compared IBD until block 910'000 from local peer in 3 variations (both nodes on SSD, edit: both on NVMe); PR change for -dbcache of 450 vs 45'000, and base commit for PR with 45'000.

Baseline of -dbcache=450 averaged 267mins across 3 runs.
-dbcache=45000 without this PR averaged 241mins across 3 runs (these were probably the runs where I was doing the least work on the other node).
-dbcache=45000 with this PR averaged 233mins across 3 runs.

That looks like at least an average 3% improvement in the -dbcache=45000 case for me. Still nice.

Edit: Did another run with PR and -dbcache=45000 and without working on any of the nodes, got a wall time of 218min. Strengthening the impression that this PR is beneficial.

Run log

With PR changes (956a6b4):

rm -rf ~/.bitcoin && time ./build/bin/bitcoind -connect=workstation.lan -dbcache=45000 -stopatheight=910000

real	238m57.330s
user	527m48.667s
sys	19m49.001s

rm -rf ~/.bitcoin && time ./build/bin/bitcoind -connect=workstation.lan -dbcache=450 -stopatheight=910000

real	266m16.473s
user	636m6.859s
sys	32m59.252s

repeated:

real	273m25.548s
user	640m5.442s
sys	31m58.674s

rm -rf ~/.bitcoin && time ./build/bin/bitcoind -connect=workstation.lan -dbcache=45000 -stopatheight=910000

real	228m12.943s
user	509m52.682s
sys	17m10.000s

rm -rf ~/.bitcoin && time ./build/bin/bitcoind -connect=workstation.lan -dbcache=450 -stopatheight=910000

real	261m47.071s
user	631m42.243s
sys	31m34.241s

PR base (d767503):

rm -rf ~/.bitcoin && time ./build/bin/bitcoind -connect=workstation.lan -dbcache=45000 -stopatheight=910000

real	246m56.108s
user	522m46.642s
sys	20m24.343s

again:

real	232m41.467s
user	508m7.763s
sys	19m16.589s

again:

real	242m17.316s
user	506m25.625s
sys	18m45.029s

=> 241

Again on PR:

rm -rf ~/.bitcoin && time ./build/bin/bitcoind -connect=workstation.lan -dbcache=45000 -stopatheight=910000

real	232m37.784s
user	490m58.944s
sys	18m3.817s

Averages:

450 runs:
266+273+262 = 267mins

45'000 without PR runs:
247+233+242 / 3 = 241min

45'000 with PR runs:
239+228+233 / 3 = 233min

hodlinator · 2025-08-18T14:05:12Z

src/test/coins_tests.cpp

+BOOST_AUTO_TEST_CASE(db_batch_sizes)
+{
+    BOOST_REQUIRE_EQUAL(node::GetDbBatchSize(DEFAULT_KERNEL_CACHE), DEFAULT_DB_CACHE_BATCH);
+    BOOST_REQUIRE_EQUAL(node::GetDbBatchSize(0_MiB), DEFAULT_DB_CACHE_BATCH);
+
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(4_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(10_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(45_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(100_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(450_MiB), 16'777'216);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(1000_MiB), 33'554'432);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(2000_MiB), 67'108'864);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(3000_MiB), 100'663'296);
+
+#if SIZE_MAX > UINT32_MAX
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(4500_MiB), 167'772'160);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(7000_MiB), 251'658'240);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(10000_MiB), 268'435'456);
+    BOOST_CHECK_EQUAL(node::GetDbBatchSize(45000_MiB), 268'435'456);
+#endif
+}


I would just default to testing at compile time:

No need to (re)run ctest/test_bitcoin to exercise checks.

Only run when compilation unit is (re)built. Not re-run when iterating on test code in other compilation units.

Inconsistency might push other tests towards being converted to compile time, which I would say is a positive effect.

The constant for the default batch size is moved to `kernel/caches.h` to consolidate kernel-related cache constants.

The default database write batch size is increased from 16 MiB to 32 MiB to improve I/O efficiency and performance during UTXO flushes, particularly during Initial Block Download and `assumeutxo` loads. On systems with slower I/O, a larger batch size reduces overhead from numerous small writes. Measurements show this change provides a modest performance improvement on most hardware during a critical section, with a minimal peak memory increase (approx. 75 MiB on default settings).

l0rinc · 2025-08-28T17:23:27Z

Thanks a lot for testing this @hodlinator and @Eunovo.

It seems that different configurations behave differently here, so I've retested a few scenarios on a few different platforms to understand what the performance increase depends on exactly. It's not even a linear speedup (i.e. sometimes 32 MiB is faster, sometimes it's 64 MiB), some dbcache values reliably produce a lot faster results, while others reliably produce slower results for the same config (i.e. the before-after-plot for different dbcache values doesn't look like two parallel lines).

Novo's measurement results showing barely any speedup for `dbcache=1000`

My recent measurement used assumeutxo loading as a proxy (previous measurements indicated that it's a realistic proxy for dumping utxos), and I ran each independent dbcache size 5 times to have proper error bars.

First, on raspberry pi the difference is obviously a lot better with a bigger batch, even with a tiny memory increase:

for very small dbcache, 64 MiB batch size would make `assumeutxo` load and dump ~59% faster on average on a Pi for 500 MB dbcache, 32 MiB would make it 30% faster

dbcache=500 MB (dynamic batch size would be 17.8 MB)
  371bece67f: 2215.101±198.228 s  use fixed 64_MiB
  0a77b9eee6: 2698.961±108.559 s  use fixed 32_MiB
  1c88f34884: 2880.934±137.068 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  6ca6f3b37b: 3513.171±761.674 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues

dbcache=2500 MB (dynamic batch size would be 88.9 MB)
  0a77b9eee6: 2202.481±100.896 s  use fixed 32_MiB
  6ca6f3b37b: 2277.482±131.931 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  1c88f34884: 2313.098±202.818 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  371bece67f: 2360.670±73.818 s  use fixed 64_MiB

For a relatively performant i7 processor with HDD, I'm getting values are all over the place, but it's obvious that the current batch size isn't the best. Looks like dynamic sizing isn't as performant as powers-of-two. The 32 MiB version seems pretty good though.

Details

dbcache=500 MB (dynamic batch size would be 17.8 MB)
  371bece67f: 723.753±74.142 s  use fixed 64_MiB
  0a77b9eee6: 872.229±19.075 s  use fixed 32_MiB
  6ca6f3b37b: 924.901±34.220 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  1c88f34884: 927.929±20.000 s  coins: introduce dynamic batch size calculator based on `dbcache` value

dbcache=1000 MB (dynamic batch size would be 35.6 MB)
  371bece67f: 763.976±18.272 s  use fixed 64_MiB
  0a77b9eee6: 780.121±21.817 s  use fixed 32_MiB
  6ca6f3b37b: 850.142±53.788 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  1c88f34884: 865.503±48.942 s  coins: introduce dynamic batch size calculator based on `dbcache` value

dbcache=1500 MB (dynamic batch size would be 53.3 MB)
  0a77b9eee6: 741.196±42.555 s  use fixed 32_MiB
  371bece67f: 753.405±23.264 s  use fixed 64_MiB
  1c88f34884: 835.422±16.123 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  6ca6f3b37b: 841.494±27.479 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues

dbcache=2000 MB (dynamic batch size would be 71.1 MB)
  371bece67f: 726.012±25.313 s  use fixed 64_MiB
  6ca6f3b37b: 799.339±29.058 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  1c88f34884: 820.236±29.547 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  0a77b9eee6: 858.368±16.452 s  use fixed 32_MiB

dbcache=2500 MB (dynamic batch size would be 88.9 MB)
  371bece67f: 707.112±55.839 s  use fixed 64_MiB
  1c88f34884: 749.380±35.179 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  6ca6f3b37b: 768.356±33.643 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  0a77b9eee6: 781.631±25.348 s  use fixed 32_MiB

dbcache=3000 MB (dynamic batch size would be 106.7 MB)
  371bece67f: 704.683±36.839 s  use fixed 64_MiB
  0a77b9eee6: 749.494±14.230 s  use fixed 32_MiB
  6ca6f3b37b: 802.988±29.282 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  1c88f34884: 812.413±14.338 s  coins: introduce dynamic batch size calculator based on `dbcache` value

dbcache=3500 MB (dynamic batch size would be 124.4 MB)
  371bece67f: 747.208±31.524 s  use fixed 64_MiB
  0a77b9eee6: 756.109±9.717 s  use fixed 32_MiB
  1c88f34884: 769.228±35.585 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  6ca6f3b37b: 783.391±43.066 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues

dbcache=4000 MB (dynamic batch size would be 142.2 MB)
  0a77b9eee6: 742.536±13.438 s  use fixed 32_MiB
  1c88f34884: 752.711±21.669 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  6ca6f3b37b: 753.414±15.107 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  371bece67f: 759.595±15.312 s  use fixed 64_MiB

dbcache=4500 MB (dynamic batch size would be 160.0 MB)
  371bece67f: 722.305±22.894 s  use fixed 64_MiB
  0a77b9eee6: 738.573±28.200 s  use fixed 32_MiB
  1c88f34884: 743.053±12.117 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  6ca6f3b37b: 747.821±16.638 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues

dbcache=5000 MB (dynamic batch size would be 177.8 MB)
  6ca6f3b37b: 707.499±23.626 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  1c88f34884: 708.064±21.035 s  coins: introduce dynamic batch size calculator based on `dbcache` value
  0a77b9eee6: 713.495±18.264 s  use fixed 32_MiB
  371bece67f: 723.671±18.868 s  use fixed 64_MiB

dbcache=5500 MB (dynamic batch size would be 195.6 MB)
  371bece67f: 697.942±15.469 s  use fixed 64_MiB
  0a77b9eee6: 705.869±15.561 s  use fixed 32_MiB
  6ca6f3b37b: 797.158±15.429 s  Merge bitcoin/bitcoin#33241: Update libmultiprocess subtree to fix build issues
  1c88f34884: 797.677±17.496 s  coins: introduce dynamic batch size calculator based on `dbcache` value

I reran 32_MiB for dbcache=2000 and got the same result again, it doesn't seem to be a measurement error

Also measured on a very performant i9 with a really fast NVMe - making the difference between memory and background storage more blurry.
Here a bigger dbcache doesn't even increase the overall speed, assumeutxo is basically the same speed whatever we do - here the goal is to keep the current speed.

Details

dbcache=500 MB (dynamic batch size would be 17.8 MB)
371bece: 409.650±1.656 s use fixed 64_MiB
0a77b9e: 411.217±4.367 s use fixed 32_MiB
1c88f34: 455.376±1.326 s coins: introduce dynamic batch size calculator based on dbcache value
6ca6f3b: 455.975±1.697 s Merge #33241: Update libmultiprocess subtree to fix build issues

dbcache=1000 MB (dynamic batch size would be 35.6 MB)
371bece: 416.379±1.989 s use fixed 64_MiB
1c88f34: 448.944±6.389 s coins: introduce dynamic batch size calculator based on dbcache value
0a77b9e: 449.627±1.714 s use fixed 32_MiB
6ca6f3b: 456.552±27.444 s Merge #33241: Update libmultiprocess subtree to fix build issues

dbcache=1500 MB (dynamic batch size would be 53.3 MB)
371bece: 400.644±1.372 s use fixed 64_MiB
6ca6f3b: 430.214±3.485 s Merge #33241: Update libmultiprocess subtree to fix build issues
1c88f34: 430.987±4.601 s coins: introduce dynamic batch size calculator based on dbcache value
0a77b9e: 442.139±9.970 s use fixed 32_MiB

dbcache=2000 MB (dynamic batch size would be 71.1 MB)
371bece: 435.944±2.428 s use fixed 64_MiB
6ca6f3b: 443.443±3.007 s Merge #33241: Update libmultiprocess subtree to fix build issues
1c88f34: 446.855±2.847 s coins: introduce dynamic batch size calculator based on dbcache value
0a77b9e: 467.190±4.509 s use fixed 32_MiB

dbcache=2500 MB (dynamic batch size would be 88.9 MB)
6ca6f3b: 431.099±1.968 s Merge #33241: Update libmultiprocess subtree to fix build issues
1c88f34: 431.577±3.782 s coins: introduce dynamic batch size calculator based on dbcache value
371bece: 435.850±9.648 s use fixed 64_MiB
0a77b9e: 461.795±4.254 s use fixed 32_MiB

dbcache=3000 MB (dynamic batch size would be 106.7 MB)
6ca6f3b: 428.011±3.228 s Merge #33241: Update libmultiprocess subtree to fix build issues
1c88f34: 429.848±2.192 s coins: introduce dynamic batch size calculator based on dbcache value
371bece: 431.654±5.198 s use fixed 64_MiB
0a77b9e: 445.446±5.815 s use fixed 32_MiB

dbcache=3500 MB (dynamic batch size would be 124.4 MB)
1c88f34: 453.404±4.046 s coins: introduce dynamic batch size calculator based on dbcache value
6ca6f3b: 455.361±3.817 s Merge #33241: Update libmultiprocess subtree to fix build issues
371bece: 469.343±2.369 s use fixed 64_MiB
0a77b9e: 474.821±12.205 s use fixed 32_MiB

dbcache=4000 MB (dynamic batch size would be 142.2 MB)
1c88f34: 443.378±5.214 s coins: introduce dynamic batch size calculator based on dbcache value
6ca6f3b: 443.661±4.678 s Merge #33241: Update libmultiprocess subtree to fix build issues
371bece: 457.182±5.747 s use fixed 64_MiB
0a77b9e: 471.365±8.131 s use fixed 32_MiB

Edit: The above ones are the complete AssumeUTXO load & dump. Extracting only the batch saving times (the only change of the PR) on an i7 with HDD reveals a 40% speedup for default memory and a 3% speedup for bigger one with fewer batch writes:

Measurement

COMMITS="8bbb7b8bf8e3b2b6465f318ec102cc5275e5bf8c b6f8c48946cbfceb066de660c485ae1bd2c27cc1"; \
CC=gcc; CXX=g++; \
BASE_DIR="/mnt/my_storage"; DATA_DIR="$BASE_DIR/ShallowBitcoinData"; LOG_DIR="$BASE_DIR/logs"; UTXO_SNAPSHOT_PATH="$BASE_DIR/utxo-880000.dat"; \
(echo ""; for c in $COMMITS; do git fetch -q origin $c && git log -1 --pretty='%h %s' $c || exit 1; done; echo "") && \
for DBCACHE in 450 4500; do \
  hyperfine \
  --sort command \
  --runs 5 \
  --export-json "$BASE_DIR/assumeutxo-$(sed -E 's/(\w{8})\w+ ?/\1-/g;s/-$//'<<<"$COMMITS")-$DBCACHE-$CC-$(date +%s).json" \
  --parameter-list COMMIT ${COMMITS// /,} \
  --prepare "killall bitcoind 2>/dev/null; rm -rf $DATA_DIR/chainstate $DATA_DIR/chainstate_snapshot $DATA_DIR/debug.log; git checkout {COMMIT}; git clean -fxd; git reset --hard && \
    cmake -B build -G Ninja -DCMAKE_BUILD_TYPE=Release -DENABLE_IPC=OFF && ninja -C build bitcoind bitcoin-cli && \
      ./build/bin/bitcoind -datadir=$DATA_DIR -stopatheight=1 -printtoconsole=0; sleep 20 && \
      ./build/bin/bitcoind -datadir=$DATA_DIR -daemon -blocksonly -connect=0 -dbcache=$DBCACHE -printtoconsole=0; sleep 20" \
    --cleanup "cp $DATA_DIR/debug.log $LOG_DIR/debug-assumeutxo-{COMMIT}-dbcache-$DBCACHE-$(date +%s).log; \
      build/bin/bitcoin-cli -datadir=$DATA_DIR stop || true; killall bitcoind || true" \
    "COMPILER=$CC DBCACHE=$DBCACHE ./build/bin/bitcoin-cli -datadir=$DATA_DIR -rpcclienttimeout=0 loadtxoutset $UTXO_SNAPSHOT_PATH"; \
done

8bbb7b8bf8 refactor: Extract default batch size into kernel
b6f8c48946 coins: increase default `dbbatchsize` to 32 MiB

Benchmark 1: COMPILER=gcc DBCACHE=450 ./build/bin/bitcoin-cli -datadir=/mnt/my_storage/ShallowBitcoinData -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat (COMMIT = 8bbb7b8bf8e3b2b6465f318ec102cc5275e5bf8c)
  Time (mean ± σ):     993.753 s ± 41.940 s    [User: 0.002 s, System: 0.001 s]
  Range (min … max):   920.460 s … 1027.298 s    5 runs
 
Benchmark 2: COMPILER=gcc DBCACHE=450 ./build/bin/bitcoin-cli -datadir=/mnt/my_storage/ShallowBitcoinData -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat (COMMIT = b6f8c48946cbfceb066de660c485ae1bd2c27cc1)
  Time (mean ± σ):     827.580 s ± 37.633 s    [User: 0.001 s, System: 0.002 s]
  Range (min … max):   778.174 s … 868.777 s    5 runs
 
Relative speed comparison
        1.20 ±  0.07  COMPILER=gcc DBCACHE=450 ./build/bin/bitcoin-cli -datadir=/mnt/my_storage/ShallowBitcoinData -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat (COMMIT = 8bbb7b8bf8e3b2b6465f318ec102cc5275e5bf8c)
        1.00          COMPILER=gcc DBCACHE=450 ./build/bin/bitcoin-cli -datadir=/mnt/my_storage/ShallowBitcoinData -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat (COMMIT = b6f8c48946cbfceb066de660c485ae1bd2c27cc1)


Benchmark 1: COMPILER=gcc DBCACHE=4500 ./build/bin/bitcoin-cli -datadir=/mnt/my_storage/ShallowBitcoinData -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat (COMMIT = 8bbb7b8bf8e3b2b6465f318ec102cc5275e5bf8c)
  Time (mean ± σ):     754.283 s ±  6.920 s    [User: 0.001 s, System: 0.002 s]
  Range (min … max):   747.235 s … 765.162 s    5 runs
 
Benchmark 2: COMPILER=gcc DBCACHE=4500 ./build/bin/bitcoin-cli -datadir=/mnt/my_storage/ShallowBitcoinData -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat (COMMIT = b6f8c48946cbfceb066de660c485ae1bd2c27cc1)
  Time (mean ± σ):     740.267 s ±  9.414 s    [User: 0.001 s, System: 0.002 s]
  Range (min … max):   726.094 s … 751.329 s    5 runs
 
Relative speed comparison
        1.02 ±  0.02  COMPILER=gcc DBCACHE=4500 ./build/bin/bitcoin-cli -datadir=/mnt/my_storage/ShallowBitcoinData -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat (COMMIT = 8bbb7b8bf8e3b2b6465f318ec102cc5275e5bf8c)
        1.00          COMPILER=gcc DBCACHE=4500 ./build/bin/bitcoin-cli -datadir=/mnt/my_storage/ShallowBitcoinData -rpcclienttimeout=0 loadtxoutset /mnt/my_storage/utxo-880000.dat (COMMIT = b6f8c48946cbfceb066de660c485ae1bd2c27cc1)

Edit2: A similar measurement on a Raspberry PI for only the dumping part of the UTXOs reveals a 70% speedup (before: ~39 minutes vs ~23 minutes after) for default dbcache (consisting of 58 separate batch writes).

So in summary, the dynamic scaling seems like a needless complication and 64 MiB seems to add too much extra memory.

The above tests indicate that a constant 32 MiB bump (without any differentiation based on dbcache) is usually faster for most system with most dbcache values than the current one of 16 MiB (even if usually not as fast as the 64 MiB alternative).

Comparing the massif memory usage of the current 16 MiB:

    MB
744.4^                            :
     |# :::: :  ::: ::  :::::  ::@:::::   ::::::::@@
     |# : :: :  ::: ::  : ::   ::@:: ::   ::: ::: @
     |#:: :: :  ::: ::  : ::   ::@:: ::   ::: ::: @
     |#:: :: :  ::: ::  : ::   ::@:: :: ::::: ::: @
     |#:: :: :  ::: ::  : ::   ::@:: :: : ::: ::: @
     |#:: :: :  ::: ::  : ::   ::@:: :::: ::: ::: @
     |#:: :: :::::: ::  : ::   ::@:: :::: ::: ::: @
     |#:: ::::: ::: ::::: :: ::::@:: :::: ::: ::: @
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @ ::::::::@@::@::::::::::@::
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @ : : : : @ ::@:::: :::::@::
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @ : : : : @ ::@:::: :::::@::
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @ : : : : @ ::@:::: :::::@::
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @ : : : : @ ::@:::: :::::@::
     |#:: ::::: ::::::: : :: : ::@:: :::: ::: ::: @ : : : : @ ::@:::: :::::@::
   0 +----------------------------------------------------------------------->h
     0                                                                   1.603

vs the new proposed fix 32 MiB:

    MB
819.6^      ##
     | :    #       :      :  :           :
     | :  ::# :::@@:::::::::  :  @:::::: :::::::
     | :  : # :: @ ::: :: ::  :  @:: ::  :::: :
     | :: : # :: @ ::: :: ::  :  @:: ::  :::: :
     | :: : # :: @ ::: :: ::  :  @:: ::  :::: :
     | :: : # :: @ ::: :: ::  :  @:: ::  :::: :
     | :::: # :: @ ::: :: ::  :  @:: ::  :::: :
     | :::: # :: @ ::: :: ::  :  @:: ::  :::: :
     | :::: # :: @ ::: :: :::::  @:: ::  :::: :
     | :::: # :: @ ::: :: ::: :::@:: ::  :::: :
     | :::: # :: @ ::: :: ::: :: @:: ::  :::: :
     | :::: # :: @ ::: :: ::: :: @:: :: ::::: :
     | :::: # :: @ ::: :: ::: :: @:: :: ::::: :
     | :::: # :: @ ::: :: ::: :: @:: :: ::::: : :::::::::::@::::@::::@:::::::@
     | :::: # :: @ ::: :: ::: :: @:: :: ::::: : : : :::::: @::::@::::@:::::::@
     | :::: # :: @ ::: :: ::: :: @:: :: ::::: : : : :::::: @::::@::::@:::::::@
     | :::: # :: @ ::: :: ::: :: @:: :: ::::: : : : :::::: @::::@::::@:::::::@
     | :::: # :: @ ::: :: ::: :: @:: :: ::::: : : : :::::: @::::@::::@:::::::@
     | :::: # :: @ ::: :: ::: :: @:: :: ::::: : : : :::::: @::::@::::@:::::::@
   0 +----------------------------------------------------------------------->h
     0                                                                   1.484

Reveals that we're still a lot below 1GiB, the peak memory only increased by 75 MiB in total (and only during dumping the UTXO set), while the total time decreased from 1.6h to 1.48h (8% faster).

What about dbcache=4500?! it's 65 MiB bigger and 3% faster

I was interested in the total memory usage of dbcache=4500 as well:

    GB
4.705^                                 :
     | ##::  :::::  ::@:::  :@::: :@::::
     | # ::  : :::  ::@: :  :@::: :@: ::
     | # ::  : :::  ::@: :  :@::: :@: ::
     | # ::  : :::  ::@: :  :@::: :@: ::
     | # :: :: :::  ::@: : ::@::: :@: ::
     | # :: :: :::  ::@: : ::@::: :@: ::
     | # :: :: :::  ::@: : ::@::: :@: ::
     | # :: :: :::  ::@: : ::@::: :@: :::::::
     | # :: :: :::  ::@: : ::@::: :@: ::: ::
     | # :: :: ::: :::@: : ::@::: :@: ::: ::
     | # :: :: ::: :::@: : ::@::: :@: ::: ::
     | # :: :: ::: :::@: : ::@::: :@: ::: ::
     | # :: :: ::: :::@: : ::@::: :@: ::: ::
     | # ::::: ::: :::@: : ::@::: :@: ::: ::
     |@# ::::: ::: :::@: : ::@::: :@: ::: ::
     |@# ::::: :::@:::@: : ::@::: :@: ::: ::
     |@# ::::: :::@:::@: : ::@::: :@: ::: ::
     |@# ::::: :::@:::@: : ::@::: :@: ::: ::
     |@# ::::: :::@:::@: ::::@:::::@: ::: :: @@::@:::@:::::::::@::::::@::::::@
   0 +----------------------------------------------------------------------->h
     0                                                                   1.294

vs master:

    GB
4.640^                       ::
     |  #::  :::@@:  ::@:   :@ @::: @:::
     |  #:   :: @ :  ::@:  @:@ @ :: @: :
     |  #:   :: @ :  ::@:  @:@ @ :: @: :
     |  #:   :: @ :  ::@:  @:@ @ :::@: :
     |  #:  ::: @ : :::@:  @:@ @ :::@: :
     |  #:  ::: @ : :::@:  @:@ @ :::@: :
     |  #:  ::: @ : :::@:  @:@ @ :::@: :
     |  #:  ::: @ : :::@:  @:@ @ :::@: : :::
     |  #:  ::: @ : :::@:  @:@ @ :::@: : :::
     |  #:  ::: @ : :::@:  @:@ @ :::@: : :::
     |  #:  ::: @ : :::@:  @:@ @ :::@: : :::
     | :#:  ::: @ :@:::@:  @:@ @ :::@: : :::
     | :#:  ::: @ :@:::@:  @:@ @ :::@: :::::
     | :#:  ::: @ :@:::@:  @:@ @ :::@: :::::
     | :#:  ::: @ :@:::@:::@:@ @ :::@: :::::
     | :#:  ::: @ :@:::@:: @:@ @ :::@: :::::
     | :#:  ::: @ :@:::@:: @:@ @ :::@: :::::
     | :#: :::: @ :@:::@:: @:@ @ :::@: :::::
     | :#: :::: @ :@:::@:: @:@ @ :::@: :::::::::::::::::::::::::::::::::::@:::
   0 +----------------------------------------------------------------------->h
     0                                                                   1.338

l0rinc mentioned this pull request Jan 12, 2025

nDefaultDbBatchSize = 64 << 20 bitcoin-dev-tools/benchcoin#92

Closed

DrahtBot mentioned this pull request Jan 13, 2025

kernel: Move kernel-related cache constants to kernel cache #31483

Merged

laanwj added the UTXO Db and Indexes label Jan 13, 2025

This comment was marked as abuse.

Sign in to view

l0rinc mentioned this pull request Jan 16, 2025

Get total memory for every UpdateTipLog call bitcoin-dev-tools/benchcoin#113

Merged

DrahtBot added the Needs rebase label Jan 16, 2025

l0rinc force-pushed the l0rinc/utxo-dump-batching branch from d249a35 to 8684133 Compare January 16, 2025 15:24

DrahtBot removed the Needs rebase label Jan 17, 2025

This was referenced Jan 22, 2025

[IBD] specialize CheckBlock's input & coinbase checks #31682

Open

Use number of dirty cache entries in flush warnings/logs #31703

Open

This was referenced Feb 12, 2025

Add -pruneduringinit option to temporarily use another prune target during IBD #31845

Open

WIP: speed up BatchWrite by sorting the batches in descending order #31875

Closed

This was referenced Mar 4, 2025

Measure used memory bitcoin-dev-tools/benchcoin#143

Open

Fully validated AssumeUTXO starts revalidating after restart #32029

Open

l0rinc changed the title ~~optimization: increase default LevelDB write batch size to 64 MiB~~ [IBD] Flush UXTOs in bigger batches Mar 12, 2025

l0rinc mentioned this pull request Mar 12, 2025

[IBD] Tracking PR for speeding up Initial Block Download #32043

Draft

11 tasks

l0rinc changed the title ~~[IBD] Flush UXTOs in bigger batches~~ [IBD] flush UXTOs in bigger batches Mar 12, 2025

ryanofsky approved these changes Mar 12, 2025

View reviewed changes

jonatack reviewed Mar 13, 2025

View reviewed changes

l0rinc force-pushed the l0rinc/utxo-dump-batching branch 2 times, most recently from 1ec68cf to 4b4dee0 Compare April 17, 2025 10:37

DrahtBot added the CI failed label Apr 17, 2025

DrahtBot removed the CI failed label Apr 17, 2025

l0rinc force-pushed the l0rinc/utxo-dump-batching branch from 4b4dee0 to 8fd522b Compare April 19, 2025 18:45

l0rinc changed the title ~~[IBD] flush UTXOs in bigger batches based on dbcache size~~ [IBD] flush UTXO set in batches proportional to dbcache size Apr 19, 2025

jonatack reviewed Apr 22, 2025

View reviewed changes

DrahtBot requested a review from ryanofsky April 22, 2025 17:02

l0rinc marked this pull request as draft May 7, 2025 18:39

DrahtBot added CI failed and removed CI failed labels Jun 15, 2025

l0rinc force-pushed the l0rinc/utxo-dump-batching branch from 8fd522b to af653f3 Compare July 30, 2025 20:05

l0rinc marked this pull request as ready for review July 31, 2025 04:07

l0rinc force-pushed the l0rinc/utxo-dump-batching branch from af653f3 to 956a6b4 Compare August 7, 2025 00:34

l0rinc commented Aug 7, 2025

View reviewed changes

jonatack reviewed Aug 8, 2025

View reviewed changes

hodlinator reviewed Aug 14, 2025

View reviewed changes

hodlinator approved these changes Aug 25, 2025

View reviewed changes

l0rinc added 2 commits August 28, 2025 10:09

refactor: Extract default batch size into kernel

8bbb7b8

The constant for the default batch size is moved to `kernel/caches.h` to consolidate kernel-related cache constants.

l0rinc changed the title ~~[IBD] flush UTXO set in batches proportional to dbcache size~~ [IBD] coins: increase default UTXO flush batch size to 32 MiB Aug 28, 2025

l0rinc force-pushed the l0rinc/utxo-dump-batching branch from 956a6b4 to b6f8c48 Compare August 28, 2025 17:23

DrahtBot mentioned this pull request Aug 29, 2025

(RFC) kernel: Replace leveldb-based BlockTreeDB with flat-file based store #32427

Open

	BOOST_CHECK_EQUAL(node::GetDbBatchSize(4_MiB), 16'777'216);
	static_assert(MIN_DB_CACHE == 4_MiB);
	BOOST_CHECK_EQUAL(node::GetDbBatchSize(4_MiB), 16'777'216);

[IBD] coins: increase default UTXO flush batch size to 32 MiB #31645

Are you sure you want to change the base?

[IBD] coins: increase default UTXO flush batch size to 32 MiB #31645

Conversation

l0rinc commented Jan 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Context

Considerations

Measurements:

Reproducer:

For more details see: #31645 (comment)

Uh oh!

l0rinc commented Jan 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DrahtBot commented Jan 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Coverage & Benchmarks

Reviews

Conflicts

Uh oh!

sipa commented Jan 13, 2025

Uh oh!

This comment was marked as abuse.

sipa commented Jan 13, 2025

Uh oh!

This comment was marked as abuse.

l0rinc commented Jan 13, 2025

Uh oh!

luke-jr commented Jan 14, 2025

Uh oh!

l0rinc commented Jan 16, 2025

Uh oh!

luke-jr commented Jan 16, 2025

Uh oh!

l0rinc commented Jan 17, 2025

Uh oh!

ryanofsky left a comment

Choose a reason for hiding this comment

Uh oh!

jonatack left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DrahtBot commented Apr 17, 2025

Uh oh!

jonatack left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

l0rinc commented May 7, 2025

Uh oh!

luke-jr commented May 29, 2025

Uh oh!

l0rinc commented May 29, 2025

Uh oh!

l0rinc commented Jul 31, 2025

Uh oh!

l0rinc commented Jan 12, 2025 •

edited

Loading

l0rinc commented Jan 12, 2025 •

edited

Loading

DrahtBot commented Jan 12, 2025 •

edited

Loading

jonatack left a comment •

edited

Loading

hodlinator left a comment •

edited

Loading

l0rinc commented Aug 28, 2025 •

edited

Loading