Yet another change to reduce recursive mempool locking #19917

promag · 2020-09-08T13:57:08Z

First 2 commits avoid unlock/lock mempool.cs and cs_main interchangeably by turning a loop in two loops - each mutex is locked throughout each corresponding loop.

Then explicit lock in CTxMemPool::RemoveUnbroadcastTx, CTxMemPool::GetUnbroadcastTxs and CTxMemPool::exists is removed forcing just 3 explicit WITH_LOCK where exists() is called and just 1 where GetUnbroadcastTxs() is called. This can be improved by adding an auxiliary function that locks and calls the original.

…roadcast

…alBroadcast

promag · 2020-09-08T14:04:04Z

@hebasto this is an example of stuff that I think we can do before moving locks around. The first and second commits refactor ReattemptInitialBroadcast to drop unlock/lock in each iteration.

promag · 2020-09-08T14:04:35Z

@vasild @ryanofsky your comment here would be nice too.

hebasto · 2020-09-08T15:50:10Z

Why draft?

promag · 2020-09-08T15:52:23Z

Yeah I don't mind setting it ready for review, the goal was to show an example rather than adding noise to your PR.

src/net_processing.cpp

hebasto · 2020-09-08T15:55:14Z

Yeah I don't mind setting it ready for review, the goal was to show an example rather than adding noise to your PR.

I'll be happy to postpone #19872 until this PR is reviewed and merged :)

hebasto · 2020-09-08T16:25:42Z

src/net_processing.cpp

+    {
+        LOCK(cs_main);
+        for (const auto& elem : relay_transactions) {
+            RelayTransaction(elem.first, elem.second, m_connman);
+        }
+    }


8f30df2
Does this approach decrease concurrency wrt to ::cs_main uninterruptible locking?

RelayTransaction doesn't hold cs_main that long.

hebasto · 2020-09-08T17:55:30Z

Concept ACK.

vasild · 2020-09-08T19:13:43Z

The changes in PeerManager::ReattemptInitialBroadcast() will execute CTxMemPool::GetUnbroadcastTxs() and CTxMemPool::exists() under CTxMemPool::cs whereas previously they were not called under this mutex. Both methods acquire the mutex themselves. So this adds more recursive mutex locks.

What is the purpose of this patch? There is no description and commit messages are a bit scarce, missing answer to "Why are we doing this?".

promag · 2020-09-08T19:28:01Z

@vasild sure I can detail the intention. See #19917 (comment).

hebasto · 2020-09-09T07:19:42Z

@promag To fix TSan errors consider comparing dde441c with 049d8c5.

vasild · 2020-09-09T06:59:49Z

src/txmempool.cpp

@@ -421,7 +421,7 @@ void CTxMemPool::removeUnchecked(txiter it, MemPoolRemovalReason reason)
    for (const CTxIn& txin : it->GetTx().vin)
        mapNextTx.erase(txin.prevout);

-    RemoveUnbroadcastTx(hash, true /* add logging because unchecked */ );
+    WITH_LOCK(cs, RemoveUnbroadcastTx(hash, true /* add logging because unchecked */ ));


missing return?

vasild · 2020-09-09T08:06:33Z

src/net_processing.cpp

 void PeerManager::ReattemptInitialBroadcast(CScheduler& scheduler) const
 {
-    std::map<uint256, uint256> unbroadcast_txids = m_mempool.GetUnbroadcastTxs();
-
-    for (const auto& elem : unbroadcast_txids) {
-        // Sanity check: all unbroadcast txns should exist in the mempool
-        if (m_mempool.exists(elem.first)) {
-            LOCK(cs_main);
+    std::vector<std::pair<uint256, uint256>> relay_transactions;
+    {
+        LOCK(m_mempool.cs);
+        std::map<uint256, uint256> unbroadcast_txids = m_mempool.GetUnbroadcastTxs();
+        relay_transactions.reserve(unbroadcast_txids.size());
+        for (const auto& elem : unbroadcast_txids) {
+            // Sanity check: all unbroadcast txns should exist in the mempool
+            if (m_mempool.exists(elem.first)) {
+                relay_transactions.push_back(elem);
+            } else {
+                m_mempool.RemoveUnbroadcastTx(elem.first, true);
+            }
+        }
+    }
+    {
+        LOCK(cs_main);
+        for (const auto& elem : relay_transactions) {
            RelayTransaction(elem.first, elem.second, m_connman);
-        } else {
-            m_mempool.RemoveUnbroadcastTx(elem.first, true);
        }
    }



This can be simplified to:

void PeerManager::ReattemptInitialBroadcast(CScheduler& scheduler) const { for (const auto& elem : WITH_LOCK(m_mempool.cs, return m_mempool.GetUnbroadcastTxs())) { LOCK(cs_main); RelayTransaction(elem.first, elem.second, m_connman); } // Schedule next run for 10-15 minutes in the future. ... }

Because m_mempool.exists() will always return true for a tx returned by m_mempool.GetUnbroadcastTxs() if we don't release m_mempool.cs between the two calls.

Also, the tx could be removed after we release m_mempool.cs and before we call RelayTransaction() and this is ok and is handled just fine.

cc @gzhao408

But is it really necessary to have cs_main lock in each iteration?

Since it was locked in each iteration before this PR, the question should rather be "If we move LOCK(cs_main) before the loop, why would we do that?"

The frequent lock/unlock allows other threads to proceed. I don't see a reason to change it.

The frequent lock/unlock allows other threads to proceed. I don't see a reason to change it.

Agree (#19917 (comment)).

@vasild I don't agree with that. Other threads can proceed but the current thread will wait unnecessarily in each iteration for the lock and as such other things will be delayed, not mentioning the mutex overhead. See https://stackoverflow.com/a/3652428.

In this case RelayTransaction is pretty quick, nothing that can potentially cause a big lock on cs_main.

DrahtBot · 2020-09-16T00:14:12Z

🐙 This pull request conflicts with the target branch and needs rebase.

_{Want to unsubscribe from rebase notifications on this pull request? Just convert this pull request to a "draft".}

promag · 2020-11-07T11:47:00Z

Rebase hell.

promag added 2 commits September 8, 2020 14:55

net: Batch RelayTransaction in PeerLogicValidation::ReattemptInitialB…

8f30df2

…roadcast

net: Batch RemoveUnbroadcastTx in PeerLogicValidation::ReattemptIniti…

746c6d4

…alBroadcast

fanquake added P2P Refactoring labels Sep 8, 2020

promag marked this pull request as ready for review September 8, 2020 15:52

promag commented Sep 8, 2020

View reviewed changes

src/net_processing.cpp Show resolved Hide resolved

hebasto reviewed Sep 8, 2020

View reviewed changes

hebasto mentioned this pull request Sep 8, 2020

Replace all of the RecursiveMutex instances with the Mutex ones #19303

Open

36 tasks

promag added 3 commits September 8, 2020 22:06

refactor: CTxMemPool::RemoveUnbroadcastTx requires lock

7c35539

refactor: CTxMemPool::GetUnbroadcastTxs requires lock

fb2dca4

refactor: CTxMemPool::exists requires lock

dde441c

promag force-pushed the 2020-09-removeunbroadcasttx branch from 8f4307c to dde441c Compare September 8, 2020 21:06

promag changed the title ~~RemoveUnbroadcastTx requires mempool lock~~ Yet another change to reduce recursive mempool locking Sep 8, 2020

vasild reviewed Sep 9, 2020

View reviewed changes

DrahtBot added the Needs rebase label Sep 16, 2020

promag closed this Nov 7, 2020

promag deleted the 2020-09-removeunbroadcasttx branch November 7, 2020 11:47

bitcoin locked as resolved and limited conversation to collaborators Feb 15, 2022

Yet another change to reduce recursive mempool locking #19917

Yet another change to reduce recursive mempool locking #19917

Uh oh!

Conversation

promag commented Sep 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

promag commented Sep 8, 2020

Uh oh!

promag commented Sep 8, 2020

Uh oh!

hebasto commented Sep 8, 2020

Uh oh!

promag commented Sep 8, 2020

Uh oh!

Uh oh!

hebasto commented Sep 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hebasto commented Sep 8, 2020

Uh oh!

vasild commented Sep 8, 2020

Uh oh!

promag commented Sep 8, 2020

Uh oh!

hebasto commented Sep 9, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DrahtBot commented Sep 16, 2020

Uh oh!

promag commented Nov 7, 2020

Uh oh!

Uh oh!

promag commented Sep 8, 2020 •

edited

Loading

hebasto commented Sep 8, 2020 •

edited

Loading