rpc: Pruning nodes can not fetch blocks before syncing past their height #23927

fjahr · 2021-12-31T16:37:56Z

This PR prevents getblockfrompeer from getting used on blocks that the node has not synced past yet if the node is in running in prune mode.

Problem

While a node is still catching up to the tip that it is aware of via the headers, the user can currently use to fetch blocks close to or at the tip. These blocks are stored in the block/rev file that otherwise contains blocks the node is receiving as part of the syncing process.

This creates a problem for pruned nodes: The files containing a fetched block are not pruned during syncing because they contain a block close to the tip. This means the entire file (~130MB) will not be pruned until the tip has moved on far enough from the fetched block. In extreme cases with heavy pruning (like 550) and multiple blocks being fetched this could mean that the disc usage far exceeds what the user expects, potentially running out of space.

Approach

There would be certainly other approaches that could fix the problem while still allowing the current behavior, but all of the ideas I came up with seemed like overkill for a niche problem on a new RPC where it's still unclear how and how much it will be used.

Testing

So far I did not see a simple enough way to test this I am still looking into it and if it's complex will potentially add it in a follow-up. What would be needed is a way to have a node fetch headers but not sync the blocks yet, that seems like a pattern that could be generally useful.

To manually reproduce the problematic behavior:

Start a node with current master with -prune=550 and an empty/new datadir, Testnet and Mainnet should both work.
While the node is syncing run getblockfrompeer on the current tip and a few other recent blocks.
Go to your datadir and observe the blocks folder: There should be a few full blk*.dat and rev*.dat files that are not being pruned. When you "pinned" a few of these files the blocks folder should be significantly above the target size of 550MB.

DrahtBot · 2022-01-01T03:06:46Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#23813 (Add test and docs for getblockfrompeer with pruning by fjahr)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

brunoerg · 2022-01-01T15:09:59Z

Concept ACK.

I started testing it by creating a functional test. Not sure if the approach is right, I started two nodes, the second one with -prune=550 and then, node0 mines 20 blocks, I get the bestblockhash from node0 and use it on getblockfrompeer (node1 --> node0), it should return an error (apparently it worked):

def run_test(self):
        self.log.info("Mine 20 blocks on Node 0")
        self.generate(self.nodes[0], 200, sync_fun=self.no_op)
        assert_equal(self.nodes[0].getblockcount(), 400)

        self.log.info("Connect nodes")
        self.connect_nodes(0, 1)

        peers = self.nodes[1].getpeerinfo()
        assert_equal(len(peers), 1)
        peer_1_peer_0_id = peers[0]["id"]
        best_block_hash_0 = self.nodes[0].getbestblockhash()
        assert_raises_rpc_error(-1, 'In prune mode, only blocks that the node has already synced previously can be fetched from a peer', self.nodes[1].getblockfrompeer, best_block_hash_0, peer_1_peer_0_id)

In master branch, this test would fail, because an exeception won't be raised.

fjahr · 2022-01-01T18:24:01Z

I started testing it by creating a functional test. Not sure if the approach is right, I started two nodes, the second one with -prune=550 and then, node0 mines 20 blocks, I get the bestblockhash from node0 and use it on getblockfrompeer (node1 --> node0), it should return an error (apparently it worked):

Hey @brunoerg , thanks for giving it a try but unfortunately I don't think this approach works reliably. The problem is that the outcome of this test is a race because node 1 is downloading the blocks from node 0 in the background. It may have the current tip in flight by the time the assert is called, or not. We try to avoid tests where the outcome is not 100% reliable because we are seeing intermittent test failures quite often already.

What would be needed is a reliable way to let node 1 sync the headers but then prevent it from syncing the blocks. It seems we don't have something like this and I am now thinking about how this could be done and where else it could be useful.

brunoerg · 2022-01-02T13:40:47Z

The problem is that the outcome of this test is a race because node 1 is downloading the blocks from node 0 in the background.

Interesting, this is new for me.

def setup_network(self):
  self.setup_nodes()

even with this config (setup_network) node1 will download the blocks from node0 in the background after node0 mines more 200 blocks?

def run_test(self):
        self.log.info("Mine 200 blocks on Node 0")
        self.generate(self.nodes[0], 200, sync_fun=self.no_op)
        assert_equal(self.nodes[0].getblockcount(), 400)

        self.log.info("Connect nodes")
        self.connect_nodes(0, 1)

        peers = self.nodes[1].getpeerinfo()
        assert_equal(len(peers), 1)
        peer_1_peer_0_id = peers[0]["id"]
        best_block_hash_0 = self.nodes[0].getbestblockhash()
        assert_raises_rpc_error(-1, 'In prune mode, only blocks that the node has already synced previously can be fetched from a peer', self.nodes[1].getblockfrompeer, best_block_hash_0, peer_1_peer_0_id)

        self.sync_blocks()
        self.nodes[1].getblockfrompeer(best_block_hash_0, peer_1_peer_0_id)

I thought the first getblockfrompeer into the assert would fail because I didn't setup it to sync blocks and the second one would work because of self.sync_blocks().

fjahr · 2022-01-02T14:25:00Z

even with this config (setup_network) node1 will download the blocks from node0 in the background after node0 mines more 20 blocks?

No, the nodes indeed can not sync if they are not connected. But you are connecting the nodes in your test here: self.connect_nodes(0, 1). In the moment the nodes are connected they also start syncing. You can see this for example by inserting print(self.nodes[1].getblockcount()) in the line before your first assert and then running the test a couple of times. You will see that the node is not at 200 blocks anymore and each time you run the test it will be a different number because of this race between the different processes.

I thought the first getblockfrompeer into the assert would fail because I didn't setup it to sync blocks and the second one would work because of self.sync_blocks().

What sync_blocks() does is it waits for all the nodes to be caught up with each other. This ensures the reverse of our problem doesn't happen: a test where all the nodes need to be caught up to continue does not fail intermittently and so with that the second assert is safe. But for the first assert we would basically need the inverse of the functionality, i.e. something that ensures that the nodes are definitely not caught up with each other until we have finished what we want to test.

fjahr · 2022-01-02T16:09:45Z

I think we have a working test now. I remembered that we can submit the header via a P2PInterface to the pruning node and that seems to work. :)

brunoerg · 2022-01-02T19:01:00Z

You will see that the node is not at 200 blocks anymore and each time you run the test it will be a different number because of this race between the different processes.

Yes, my bad. That approach would work because node0 is mining 200 blocks more, so probably node1 didn't get the last one before the assertion but they're syncing and it could cause intermittent failures.

brunoerg

tACK 105b287

Compiled the branch code on MacOS 12 and started bitcoind (empty):
./src/bitcoind --prune=550 --daemon

Got a hash of a recent block from a block explorer: 00000000000000000001fede733d9ad94b9a9cdb07237cba25c556f8f807db4b

Executed getpeerinfo to see the connections and get some peers id to use.

And then, executed getblockfrompeer with 00000000000000000001fede733d9ad94b9a9cdb07237cba25c556f8f807db4b as block hash.

➜  bitcoin git:(fjahr) ✗ ./src/bitcoin-cli getblockfrompeer 00000000000000000001fede733d9ad94b9a9cdb07237cba25c556f8f807db4b 11
error code: -1
error message:
In prune mode, only blocks that the node has already synced previously can be fetched from a peer
➜  bitcoin git:(fjahr) ✗ ./src/bitcoin-cli getblockfrompeer 00000000000000000001fede733d9ad94b9a9cdb07237cba25c556f8f807db4b 8 
error code: -1
error message:
In prune mode, only blocks that the node has already synced previously can be fetched from a peer
➜  bitcoin git:(fjahr) ✗ ./src/bitcoin-cli getblockfrompeer 00000000000000000001fede733d9ad94b9a9cdb07237cba25c556f8f807db4b 5
error code: -1
error message:
In prune mode, only blocks that the node has already synced previously can be fetched from a peer

Sjors · 2022-01-06T16:12:06Z

Concept ACK, but light selfish preference for getting #23706 in first, since there's a (small) conflict.

luke-jr · 2022-01-12T07:07:16Z

Weak concept NACK. I think it's better to allow it. Pruning is only best-effort anyway, not a guarantee.

fjahr · 2022-01-25T20:26:04Z

Rebased

Weak concept NACK. I think it's better to allow it. Pruning is only best-effort anyway, not a guarantee.

Why do you think it's better to allow it? Do you have any specific use cases in mind? Of course there is no guarantee to stay below the exact number but for prune=550 this has the potential to double or tripple the number quickly. And the worst case scenario of crashing due to a full disc seems bad enough that it's worth to disable something where it's unclear if there is a use case for it at all (using the RPC in this way). If there is a use case I am genuinely interested in hearing about it and would then look for a better solution rather than just disallowing it.

Sjors · 2022-01-26T13:08:08Z

In the context of ForkMonitor I use this feature to fetch blocks that are, by definition, at or below the tip height. I.e. stale blocks. This PR doesn't impact that use case, because these nodes are always up to date.

In fact, this PR adds some safety for when a fresh node is added to the site, or an existing node is reinstalled, and it's still catching up (though in practice we don't call getblockfrompeer on a node that is in IBD).

Another use case that seems obvious to me is to fetch an historical block, perhaps because you're rescanning a wallet. In that case I don't see any harm in waiting until the node is synced.

fjahr · 2022-01-29T16:56:49Z

Rebased

While a node is still catching up to the tip that it is aware of via the headers, the user can currently use to fetch blocks close to the tip. These blocks are stored in the current block/rev file which otherwise contains blocks the node is receiving as part of the syncing process. This creates a problem for pruned nodes: The files containing a fetched block are not pruned during syncing because they contain a block close to the tip. This means the entire file will not be pruned until the tip have moved on far enough from the fetched block. In extreme cases with heavy pruning (550) and multiple blocks being fetched this could mean that the disc usage far exceeds what the user expects, potentially running out of space.

fjahr · 2022-06-05T23:44:54Z

Rebased

Sjors · 2022-07-18T09:49:35Z

utACK 5826bf5

Very nice test.

achow101 · 2022-10-25T21:09:18Z

ACK 5826bf5

furszy · 2022-10-25T21:50:35Z

src/rpc/blockchain.cpp

@@ -453,6 +453,12 @@ static RPCHelpMan getblockfrompeer()
        throw JSONRPCError(RPC_MISC_ERROR, "Block header missing");
    }

+    // Fetching blocks before the node has syncing past their height can prevent block files from
+    // being pruned, so we avoid it if the node is in prune mode.
+    if (index->nHeight > chainman.ActiveChain().Tip()->nHeight && node::fPruneMode) {


In 7fa851f: (non-blocking nit)

Why not use IsBlockPruned instead?

Which should be the same as saying that, on pruning mode, can only fetch blocks that were downloaded and discarded.

If IsBlockPruned was used instead, fetching an older block that isn't pruned (blocks close to the tip for instance) on a pruned node would result in the error In prune mode, only blocks that the node has already synced previously can be fetched from a peer instead of Block already downloaded.

Doesn't chainman.ActiveChain() need to be called with cs_main locked?

CChain& ActiveChain() const EXCLUSIVE_LOCKS_REQUIRED(GetMutex()) { return ActiveChainstate().m_chain; }

Hmm, indeed it does. I really need to be building clang...

rpc/blockchain.cpp:464:35: warning: calling function 'ActiveChain' requires holding mutex 'cs_main' exclusively [-Wthread-safety-analysis] if (index->nHeight > chainman.ActiveChain().Tip()->nHeight && node::fPruneMode) { ^ 1 warning generated.

aureleoules

tACK 5826bf5
I tested the behavior by invalidating the tip with invalidateblock and trying to fetch it again with getblockfrompeer which resulted in In prune mode, only blocks that the node has already synced previously can be fetched from a peer as expected.

f5ff3d7 rpc: add missing lock around chainman.ActiveTip() (Andrew Toth) Pull request description: #23927 seems to have missed a lock around `chainman.ActiveChain()`. ACKs for top commit: aureleoules: ACK f5ff3d7 Tree-SHA512: 3f116ca44c1b2bc0c7042698249ea3417dfb7c0bb81158a7ceecd087f1e02baa89948f9bb7924b1757798a1691a7de6e886aa72a0a9e227c13a3f512cc59d6c9

…yncing …

f5ff3d7 rpc: add missing lock around chainman.ActiveTip() (Andrew Toth) Pull request description: bitcoin#23927 seems to have missed a lock around `chainman.ActiveChain()`. ACKs for top commit: aureleoules: ACK f5ff3d7 Tree-SHA512: 3f116ca44c1b2bc0c7042698249ea3417dfb7c0bb81158a7ceecd087f1e02baa89948f9bb7924b1757798a1691a7de6e886aa72a0a9e227c13a3f512cc59d6c9

DrahtBot added the RPC/REST/ZMQ label Dec 31, 2021

This was referenced Jan 1, 2022

Add test and docs for getblockfrompeer with pruning #23813

Merged

rpc: getblockfrompeer followups #23706

Merged

brunoerg approved these changes Jan 2, 2022

View reviewed changes

DrahtBot added the Needs rebase label Jan 25, 2022

fjahr force-pushed the 2021-12-prunefutureblockfetch branch from 105b287 to b750720 Compare January 25, 2022 20:20

fjahr force-pushed the 2021-12-prunefutureblockfetch branch from b750720 to 3ec2cba Compare January 25, 2022 20:28

DrahtBot removed the Needs rebase label Jan 25, 2022

DrahtBot mentioned this pull request Jan 26, 2022

Add CBlockIndex lock annotations, guard nStatus/nFile/nDataPos/nUndoPos by cs_main #22932

Merged

DrahtBot added the Needs rebase label Jan 27, 2022

fjahr force-pushed the 2021-12-prunefutureblockfetch branch from 3ec2cba to 92f4ca2 Compare January 29, 2022 16:56

DrahtBot removed the Needs rebase label Jan 29, 2022

DrahtBot added the Needs rebase label Jun 1, 2022

fjahr added 2 commits June 6, 2022 01:34

test: Add test for getblockfrompeer on syncing pruned nodes

5826bf5

fjahr force-pushed the 2021-12-prunefutureblockfetch branch from 92f4ca2 to 5826bf5 Compare June 5, 2022 23:44

DrahtBot removed the Needs rebase label Jun 6, 2022

furszy reviewed Oct 25, 2022

View reviewed changes

aureleoules approved these changes Oct 26, 2022

View reviewed changes

achow101 merged commit 88502ec into bitcoin:master Oct 26, 2022

andrewtoth mentioned this pull request Oct 26, 2022

rpc: add missing lock around chainman.ActiveTip() #26395

Merged

sidhujag pushed a commit to syscoin/syscoin that referenced this pull request Oct 27, 2022

Merge bitcoin#23927: rpc: Pruning nodes can not fetch blocks before s…

02a24fb

…yncing …

bitcoin locked and limited conversation to collaborators Oct 26, 2023

rpc: Pruning nodes can not fetch blocks before syncing past their height #23927

rpc: Pruning nodes can not fetch blocks before syncing past their height #23927

Uh oh!

Conversation

fjahr commented Dec 31, 2021

Problem

Approach

Testing

Uh oh!

DrahtBot commented Jan 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Conflicts

Uh oh!

brunoerg commented Jan 1, 2022

Uh oh!

fjahr commented Jan 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brunoerg commented Jan 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fjahr commented Jan 2, 2022

Uh oh!

fjahr commented Jan 2, 2022

Uh oh!

brunoerg commented Jan 2, 2022

Uh oh!

brunoerg left a comment

Choose a reason for hiding this comment

Uh oh!

Sjors commented Jan 6, 2022

Uh oh!

luke-jr commented Jan 12, 2022

Uh oh!

fjahr commented Jan 25, 2022

Uh oh!

Sjors commented Jan 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fjahr commented Jan 29, 2022

Uh oh!

fjahr commented Jun 5, 2022

Uh oh!

Sjors commented Jul 18, 2022

Uh oh!

achow101 commented Oct 25, 2022

Uh oh!

furszy Oct 25, 2022

Choose a reason for hiding this comment

Uh oh!

aureleoules Oct 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andrewtoth Oct 26, 2022

Choose a reason for hiding this comment

Uh oh!

achow101 Oct 26, 2022

Choose a reason for hiding this comment

Uh oh!

aureleoules left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DrahtBot commented Jan 1, 2022 •

edited

Loading

fjahr commented Jan 1, 2022 •

edited

Loading

brunoerg commented Jan 2, 2022 •

edited

Loading

Sjors commented Jan 26, 2022 •

edited

Loading

aureleoules Oct 26, 2022 •

edited

Loading

aureleoules left a comment •

edited

Loading