Allow 2 simultaneous block downloads #9447

morcos · 2016-12-30T15:35:45Z

This is built off of #9375, #9252, and #9400. I'll properly rebase it when those are merged.

It provides special logic to issue a second getdata request for a cmpctblock if there is only 1 request outstanding and we think the announced block would be our new tip.

It also changes the cmpctblock processing logic to be first come first served (regardless of in flight block requests) and allow two simultaneous compact block reconstructions.

So in particular, it is now possible to:

receive headers -> request cmpctblock from peer 1
receive headers -> request cmpctblock from peer 2
receive cmpctblock -> request blocktxn from peer 3
receive cmpctblock -> request blocktxn from peer 4

Upon receiving a cmpctblock from peers 1 or 2 after this, it will be treated the same as receiving an unsolicited cmpctblock from peer 5 and it will attempt opportunistic reconstruction but not request blocktxn. It will also remove the block in flight for peer 1 or 2.

I believe that the multiple requests is an acceptable increase in bandwidth in order to provide robustness against a single peer stalling us at any point in the logic. The reason to allow a second request for a cmpctblock is to allow LB peers a chance to become HB even if there is a staller who is always announcing first, otherwise, only existing HB peers would have a chance to deliver the block.

morcos · 2016-12-30T16:09:06Z

Seems to fail sendheaders.py but not on my local machine... I'll look into it...

rebroad · 2016-12-31T06:06:53Z

src/net_processing.cpp


        if (pindex->nStatus & BLOCK_HAVE_DATA) // Nothing to do here
            return true;
-
+        LogPrintf("ChainWork check at height %d new: %s  tip: %s\n",pindex->nHeight,pindex->nChainWork.GetHex(),chainActive.Tip()->nChainWork.GetHex());
        if (pindex->nChainWork <= chainActive.Tip()->nChainWork || // We know something better


why not make this < instead of <= ? why do we avoid requesting compact blocks for competing best blocks?

@morcos I couldn't see this added line in any of the 3 PRs you mentioned.

rebroad · 2016-12-31T06:11:00Z

src/net_processing.cpp

@@ -1893,24 +2022,41 @@ bool static ProcessMessage(CNode* pfrom, string strCommand, CDataStream& vRecv,
                    fBlockReconstructed = true;
                }
            }
+            if (pindex->nHeight == 165) {


What is the deal with block height 165?

rebroad · 2016-12-31T06:12:12Z

src/net_processing.cpp

-            map<uint256, pair<NodeId, list<QueuedBlock>::iterator> >::iterator it = mapBlocksInFlight.find(resp.blockhash);
-            if (it == mapBlocksInFlight.end() || !it->second.second->partialBlock ||
-                    it->second.first != pfrom->GetId()) {
+            bool fExpectedBLOCKTXN = false;


Very much approve of the variable namings chosen in various places.

rebroad · 2016-12-31T06:15:06Z

src/net_processing.cpp

-                        // We seem to be rather well-synced, so it appears pfrom was the first to provide us
-                        // with this block! Let's get them to announce using compact blocks in the future.
-                        MaybeSetPeerAsAnnouncingHeaderAndIDs(nodestate, pfrom, connman);
+                    if (nodestate->fSupportsDesiredCmpctVersion && vGetData.size() == 1 && mmapBlocksInFlight.size() == mmapBlocksInFlight.count(vGetData[0].hash) && pindexLast->pprev->IsValid(BLOCK_VALID_CHAIN)) {


Why not make this <=2 rather than ==1 (for mapBlocksInFlight.size())? this way if two blocks get announced in close proximity we can request compact blocks for both of them (rather than compact block for the oldest, and full block for the most recent).

rebroad · 2016-12-31T06:17:06Z

I like the coding style and elegance of this. Will help with testing.

morcos · 2016-12-31T14:30:18Z

@rebroad As mentioned in the PR comment, this is built off 3 other PR's. All of your comments either belong on those PR's or are related to the last commit which was just for debugging the travis failure and will be removed. I have just left it there for now in case anyone else wants to see the error details.

TheBlueMatt · 2016-12-31T16:19:12Z

@morcos claimed on IRC the test failures might be (in part) due to the issue mentioned at #9375 (comment)

rebroad · 2017-01-01T10:32:50Z

@morcos although most of the lines I am commenting on have not been introduced by you, given you are changing the code near to them, I tought it was a good opportunity to suggest these changes as I believe they ought to be changed at some point, so perhaps could be with this PR.

morcos · 2017-01-01T18:19:34Z

Rebased with the newest commit from #9375 which fixes the failure.

For clarity only the 5 commits on which I'm the author are meant for review here. The others are contained in the linked PR's...

maflcko · 2017-01-01T18:51:57Z

OT: Imo it makes sense to octomerge all pulls which this pull depends on and rebase the commits of this pull on top of the merge commit. Thus, the original commit hashes are preserved and it is clear what was old and should be reviewed in other pulls. Also, it is easier to see the fresh commits.

instagibbs

pre-rebase utACK. I still think it might make sense to only allow a second if a HB peer is responsible for one of them, but I think that kind of change is desired is complementary on top of this.

instagibbs · 2017-01-04T15:58:47Z

src/net_processing.cpp

+                if (fPeerWasFirstRequest) {
+                    // We requested this block, but its far into the future, so our
+                    // mempool will probably be useless - request the block normally
+                    // Only allow a the first peer to request a full block


"Only ask for the full block from the first peer we requested from"?

instagibbs · 2017-01-04T16:07:51Z

src/net_processing.cpp

+    state->nBlocksInFlightValidHeaders -= itInFlight->second.second->fValidatedHeaders;
+    if (state->nBlocksInFlightValidHeaders == 0 && itInFlight->second.second->fValidatedHeaders) {
+        // Last validated block on the queue was received.
+        nPeersWithValidatedDownloads--;


could just do the -= itInFlight->second.second->fValidatedHeaders as above to match.

instagibbs · 2017-01-04T16:30:50Z

src/net_processing.cpp

+        BlockDownloadMap::iterator itInFlight = range.first;
+        ClearDownloadState(itInFlight);
+        range.first++;
+        mmapBlocksInFlight.erase(itInFlight);


since C++11 can also just capture the return value and set as range.first instead of worrying about iterator invalidation.

ryanofsky · 2017-01-05T21:29:06Z

src/net_processing.cpp

@@ -105,7 +105,8 @@ namespace {
        bool fValidatedHeaders;                                  //!< Whether this block has validated headers at the time of request.
        std::unique_ptr<PartiallyDownloadedBlock> partialBlock;  //!< Optional, used for CMPCTBLOCK downloads
    };
-    map<uint256, pair<NodeId, list<QueuedBlock>::iterator> > mapBlocksInFlight;
+    typedef std::multimap<uint256, pair<NodeId, list<QueuedBlock>::iterator>> BlockDownloadMap;


I think indexing might by NodeId would be better than using a multimap, because it would ensure that there couldn't be multiple entries for a block from the same node. I'd suggest:

typedef map<pair<uint256, NodeId>, list<QueuedBlock>::iterator> BlockDownloadMap;

This also would let you replace some of the while loops added here with direct lookups. I thought some of these (especially the while loop with the break statement setting fExpectedBLOCKTXN) were kind of confusing.

The downside of using a map is that you wouldn't have an equal_range method to call in the places that do require a loop. But you could replace those equal_range calls with calls to an equivalent helper function:

pair<BlockDownloadMap::iterator, BlockDownloadMap::iterator> GetBlockDownloadRange(BlockDownloadMap& blocks, const uint256& hash) { return {blocks.lower_bound({hash, numeric_limits<NodeId>::min()}), blocks.upper_bound({hash, numeric_limits<NodeId>::max()})}; }

I'm unsure about this change. In particular there are a lot of mmapBlocksInFlight.count(hash) calls that would get a bit less clear... I'll think about it some more

Yeah, I noticed that in the later commits. You could have a count helper function returning std::distance(range.first, range.second). I do think a map is a better way to represent the data, but the c++ map implementation does makes it a little awkward. Anyway, it's just something to consider.

ryanofsky · 2017-01-05T21:55:45Z

src/net_processing.cpp

@@ -2292,6 +2292,18 @@ bool static ProcessMessage(CNode* pfrom, string strCommand, CDataStream& vRecv,
                }
                pindexWalk = pindexWalk->pprev;
            }
+            // Special case for second cmpctblock request of tip


Could you expand this comment a little bit to describe the condition being checked? In particular I don't understand how the IsWitnessEnabled and fHaveWitness parts relate to making the request.

Also, maybe consider just pulling the MSG_CMPCT_BLOCK setting a bit later down up to here and just requesting the block in this if statement (possibly before the while loop above).

ryanofsky · 2017-01-05T22:01:36Z

src/net_processing.cpp

@@ -1947,7 +1947,7 @@ bool static ProcessMessage(CNode* pfrom, string strCommand, CDataStream& vRecv,

        if (pindex->nChainWork <= chainActive.Tip()->nChainWork || // We know something better
                pindex->nTx != 0) { // We had this block at some point, but pruned it
-            if (fAlreadyInFlight) {
+            if (fInFlightFromSamePeer) {


Is this change ("Only request full blocks from the peer we thought had the block in-flight") a change in behavior? Or is it just a cleanup after the previous multimap commit? It seems like this commit should be merged into the preceding or following one, or the commit message should be extended to say what the effect is, what motivates it.

morcos · 2017-01-06T21:00:30Z

Rebased and I think done the way @MarcoFalke suggested.

Addressed feedback

morcos · 2017-01-17T18:19:25Z

rebased

da2ce7 · 2017-02-07T16:29:09Z

#9375, #9252, and #9400 are merged, needs rebase.

…ight This is a change in behavior so that if for some reason we request a block from a peer, we don't allow an unsolicited CMPCT_BLOCK announcement for that same block to cause a request for a full block from the uninvited peer (as some type of request is already outstanding from the original peer)

morcos · 2017-02-14T18:38:42Z

rebased

TheBlueMatt · 2017-02-24T17:15:01Z

src/net_processing.cpp

@@ -274,6 +274,41 @@ void InitializeNode(CNode *pnode, CConnman& connman) {
        PushNodeVersion(pnode, connman, GetTime());
 }

+// Requires cs_main


Can you AssertLockHeld?

TheBlueMatt · 2017-02-24T17:15:05Z

src/net_processing.cpp

+    state->nStallingSince = 0;
+}
+
+// Requires cs_main.


Can you AssertLockHeld?

TheBlueMatt · 2017-02-24T21:12:37Z

src/net_processing.cpp

@@ -2292,6 +2292,18 @@ bool static ProcessMessage(CNode* pfrom, string strCommand, CDataStream& vRecv,
                }
                pindexWalk = pindexWalk->pprev;
            }
+            // Special case for second cmpctblock request of tip


Also, maybe consider just pulling the MSG_CMPCT_BLOCK setting a bit later down up to here and just requesting the block in this if statement (possibly before the while loop above).

TheBlueMatt · 2017-02-24T21:13:44Z

src/net_processing.cpp

+                mmapBlocksInFlight.size() == mmapBlocksInFlight.count(pindexLast->GetBlockHash()) &&
+                mmapBlocksInFlight.count(pindexLast->GetBlockHash()) < MAX_CMPCTBLOCKS_INFLIGHT_PER_BLOCK &&
+                !(pindexLast->nStatus & BLOCK_HAVE_DATA) &&
+                (!IsWitnessEnabled(pindexLast->pprev, chainparams.GetConsensus()) || State(pfrom->GetId())->fHaveWitness) &&


In the strange case that a peer does not set the fHaveWItness service bit, but does announce compact blocks v2, I believe this line would result in a full block request. More generally, because the two if statements always have to be in sync to avoid this, I really prefer we pull the actual request logic into this if statement.

TheBlueMatt · 2017-02-24T21:20:38Z

src/net_processing.cpp

@@ -2072,8 +2086,7 @@ bool static ProcessMessage(CNode* pfrom, const std::string& strCommand, CDataStr
        // We want to be a bit conservative just to be extra careful about DoS
        // possibilities in compact block processing...
        if (pindex->nHeight <= chainActive.Height() + 2) {
-            if ((!fAlreadyInFlight && nodestate->nBlocksInFlight < MAX_BLOCKS_IN_TRANSIT_PER_PEER) ||
-                fInFlightFromSamePeer) {
+            if ((countPartialBlocksStarted < MAX_CMPCTBLOCKS_INFLIGHT_PER_BLOCK && nodestate->nBlocksInFlight < MAX_BLOCKS_IN_TRANSIT_PER_PEER)) {


The way I read this, the use of countPartialBlocksStarted, instead of a countBlocksStarted, means that we will request up to two compact blocks at a time, even if we are already requesting the full block from a peer. This seems strange to me, why not just max 2 in-flights at the same time for a given block, with the second never being a full block?

Also, why drop the fInFlightFromSamePeer option? It looks like we'll never getblocktxn from two peers simultaneously?

TheBlueMatt · 2017-02-24T21:26:55Z

src/net_processing.cpp

+                !(pindexLast->nStatus & BLOCK_HAVE_DATA) &&
+                (!IsWitnessEnabled(pindexLast->pprev, chainparams.GetConsensus()) || State(pfrom->GetId())->fHaveWitness) &&
+                nodestate->fSupportsDesiredCmpctVersion) {
+                vToFetch.push_back(pindexLast);


Further, simply adding the entry to vToFetch here may result in a null pointer dereference, I believe. If a peer announces two headers messages back-to-back, the first time we will MarkBlockAsInFlight to them, and the second time we'll hit this condition and add the entry to vToFetch again (which should also be fixed). Further down, we'll MarkBlockAsInFlight to them again, but MarkBlockAsInFlight requires that, if the block is already in-flight to the same peer, pit be something non-NULL as it will be dereferenced, but it is NULL in the call below.

I believe the above needs fixing in three ways - MarkBlockAsInFlight needs to be more robust against NULL pit, the request needs to move into this if statement and skip the remainder of this block of code, and we shouldn't double-request from the same peer.

TheBlueMatt · 2017-02-24T21:31:57Z

src/net_processing.cpp

@@ -2094,6 +2107,7 @@ bool static ProcessMessage(CNode* pfrom, const std::string& strCommand, CDataStr
                    return true;
                } else if (status == READ_STATUS_FAILED) {
                    // Duplicate txindexes, the block is now in-flight, so just request it
+                    // NOTE: This is the one place two full block requests can be outstanding


OK, so why not just check fPeerWasFirstRequest and MarkBlockAsNotInFlight otherwise?

jameshilliard · 2017-06-28T22:11:52Z

src/net_processing.h

@@ -9,6 +9,8 @@
 #include "net.h"
 #include "validationinterface.h"

+/** Maximum number of outstanding CMPCTBLOCK requests for the same block. */
+static const int MAX_CMPCTBLOCKS_INFLIGHT_PER_BLOCK = 2;


Should this be configurable? Might make sense for miners to have this higher.

Likely not, asking all your peers for a copy of the block poses the same network-DoS risks as connecting to hundreds of nodes, which people like to do because they believe it will help (though it usually actually hurts) them get their blocks out faster.

More importantly, if you're a miner and have good peers, I think it'd be somewhat rare for you to receive a third compact block announce before the first can respond to your blocktxn request, at least it will be once we get proper multi-threaded ProcessMessages implemented to respond to blocktxn requests for the latest block in the background (see #10652 for the beginnings of the steps to do so).

What do you think the upper limit is before it would generally cause a negative impact? Maybe just have an upper limit like we do for max outbound connections.

Probably around 2 :p. Its really only useful if your peer got stuck doing something and wasn't able to respond or is being actively malicious. Once we've fixed the block-on-block-validation-before-responding-to-blocktxn-requests issue, it should be somewhat rare for this to help more than a very small amount.

jtimon · 2017-09-06T00:41:35Z

More concurrency! concept ACK

TheBlueMatt · 2017-09-06T00:46:52Z

This should likely be closed in favor of #10984.

ryanofsky · 2017-10-12T18:08:44Z

Should this still be closed in favor of #10984?

morcos force-pushed the doubledownload branch from 33095f6 to 02da045 Compare December 30, 2016 16:08

rebroad reviewed Dec 31, 2016

View reviewed changes

fanquake added P2P Validation labels Jan 1, 2017

TheBlueMatt mentioned this pull request Jan 1, 2017

Relay compact block messages prior to full block connection #9375

Merged

morcos force-pushed the doubledownload branch from f4832c3 to 69a8ca6 Compare January 1, 2017 17:59

instagibbs reviewed Jan 4, 2017

View reviewed changes

ryanofsky reviewed Jan 5, 2017

View reviewed changes

morcos force-pushed the doubledownload branch from 69a8ca6 to 0b76739 Compare January 6, 2017 20:58

morcos force-pushed the doubledownload branch from 0b76739 to ca7c450 Compare January 17, 2017 18:19

morcos force-pushed the doubledownload branch from ca7c450 to d1c73b9 Compare January 20, 2017 20:02

morcos added 5 commits February 14, 2017 13:30

Turn mapBlocksInFlight into a multimap

d0260ae

Only mark block as received if BLOCK_VALID_TRANSACTIONS

f66254f

Allow multiple compact block reconstructions simultaneously

3f62b2c

Allow second request of cmpctblock

5eb857d

morcos force-pushed the doubledownload branch from d1c73b9 to 5eb857d Compare February 14, 2017 18:37

TheBlueMatt reviewed Feb 24, 2017

View reviewed changes

TheBlueMatt mentioned this pull request Jun 22, 2017

Small step towards demangling cs_main from CNodeState #10652

Closed

jameshilliard reviewed Jun 28, 2017

View reviewed changes

TheBlueMatt mentioned this pull request Aug 3, 2017

Allow 2 simultaneous (compact-)block downloads #10984

Closed

morcos closed this Nov 9, 2017

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Allow 2 simultaneous block downloads #9447

Allow 2 simultaneous block downloads #9447

Uh oh!

Conversation

morcos commented Dec 30, 2016

Uh oh!

morcos commented Dec 30, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rebroad Dec 31, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rebroad commented Dec 31, 2016

Uh oh!

morcos commented Dec 31, 2016

Uh oh!

TheBlueMatt commented Dec 31, 2016

Uh oh!

rebroad commented Jan 1, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

morcos commented Jan 1, 2017

Uh oh!

maflcko commented Jan 1, 2017

Uh oh!

instagibbs left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

morcos commented Jan 6, 2017

Uh oh!

morcos commented Jan 17, 2017

Uh oh!

da2ce7 commented Feb 7, 2017

Uh oh!

morcos commented Feb 14, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rebroad Dec 31, 2016 •

edited

Loading

rebroad commented Jan 1, 2017 •

edited

Loading