script: return verification flag responsible for error upon validation failure #33012

darosior · 2025-07-18T19:19:49Z

For 30.0 please consider reviewing #33050 instead of this PR. This can still be considered independently later on, but #33050 is a smaller patch to get the desired effects which should be easier to get in before feature freeze.

For unconfirmed transactions, we currently distinguish between Script validation failures due to standardness rules and those due to consensus rules. The distinction is used to decide whether to disconnect the peer which submitted this transaction.

This is currently achieved by repeating Script validation with only the consensus verification flags, after it failed with the standardness flags. It is wasteful, and potentially significantly more so since we have Taproot. This PR proposes to replace this detection by having Script validation return the verification flag responsible for the failure instead.

Note this is a slight behaviour change, as a consensus-related Script validation failure that happens after a standardness-related Script validation failure would not be treated as a consensus error anymore (and consequentially the peer not disconnected). I don't think this is an issue because we already do not treat all consensus-invalid transactions we get submitted as consensus errors. For instance, a non-standard transaction (for any reason other than its Scripts) with a consensus-invalid Script will be treated as a standardness error. Same for a transaction with a consensus-invalid Script in a later input than another with a non-standard Script. In these conditions, a Script that is consensus-invalid after being standardness-invalid is an edge case that seems unnecessary to detect. It also seems to me that the fact this edge case is detected at all is more an artifact of the existing logic (introduced in 97e7901 13 years ago) than a design decision. For all these reasons, this slight behaviour change seems fine to me (and definitely worth it when considering the upside of not having to run Script validation twice).

This change is quite verbose. If reviewers want i can try to make it less so, but i first wanted to open this as draft to get feedback on the broader approach of returning errors from Script validation directly instead of having to run it multiple times with different flags to distinguish the cause of an error.

DrahtBot · 2025-07-18T19:19:55Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/33012.

Reviews

See the guideline for information on the review process.
A summary of reviews will appear here.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#33050 (net, validation: don't punish peers for consensus-invalid txs by ajtowns)
#32998 (Bump SCRIPT_VERIFY flags to 64 bit by ajtowns)
#32575 (refactor: Split multithreaded case out of CheckInputScripts by fjahr)
#32473 (Introduce per-txin sighash midstate cache for legacy/p2sh/segwitv0 scripts by sipa)
#32247 (BIP-348 (OP_CHECKSIGFROMSTACK) (regtest only) by jamesob)
#32143 (Fix 11-year-old mis-categorized error code in OP_IF evaluation by cculianu)
#31713 (miniscript refactor: Remove unique_ptr-indirection (descriptor: Add proper Clone function to miniscript::Node #30866 follow-up) by hodlinator)
#31576 (test: Move script_assets_tests into its own suite by hebasto)
#29491 ([EXPERIMENTAL] Schnorr batch verification for blocks by fjahr)
#29247 (CAT in Tapscript (BIP-347) by arminsabouri)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

DrahtBot · 2025-07-18T20:11:33Z

🚧 At least one of the CI tasks failed.
_{Task TSan, depends, gui: https://github.com/bitcoin/bitcoin/runs/46284608490}
_{LLM reason (✨ experimental): The CI failure is caused by a compilation error in verify_script.cpp due to an incorrect call to EvalScript with mismatched argument types.}

Hints

Try to run the tests locally, according to the documentation. However, a CI failure may still
happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the
affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

ajtowns · 2025-07-20T05:44:56Z

Note this is a slight behaviour change, as a consensus-related Script validation failure that happens after a standardness-related Script validation failure would not be treated as a consensus error anymore (and consequentially the peer not disconnected).

I'm skeptical whether this behaviour is really worth preserving in a limited fashion? With this change, an attacker can waste your resources without being discouraged or risking having to pay tx fees by making a consensus invalid tx that fails standardness before the consensus failure is seen; and any innocent third parties are more likely to waste your resources by relaying txs under different standardness policies, and won't be penalised at all.

I think getting different errors for standardness vs consensus failures can be useful for debugging (and perhaps functional tests or network behaviour monitoring) so could see it being worthwhile to retain a (default off) config option to continue testing transactions twice.

darosior · 2025-07-21T13:35:40Z

The behaviour already exists in a limited fashion. Even without this change, an attacker can trivially create such a transaction (as discussed in OP). Here is for instance a p2p_invalid_tx.py case which passes both with or without this PR:

Click to see patch

diff --git a/test/functional/data/invalid_txs.py b/test/functional/data/invalid_txs.py
index 36b274efc27..46bb2bba367 100644
--- a/test/functional/data/invalid_txs.py
+++ b/test/functional/data/invalid_txs.py
@@ -286,6 +286,22 @@ class NonStandardAndInvalid(BadTxTemplate):
             self.spend_tx, 0, script_sig=b'\x00' * 3 + b'\xab\x6a',
             amount=(self.spend_avail // 2))
 
+
+class FirstInNonStdSecondInInvalid(BadTxTemplate):
+    """A transaction with first a standardness error and then a consensus error."""
+    reject_reason = "non-mandatory-script-verify-flag (Using OP_CODESEPARATOR in non-witness script)"
+    # NOTE: feature_block would need to be tweaked for the spent tx to have two outputs.
+    block_reject_reason = "mandatory-script-verify-flag-failed (OP_RETURN was encountered)"
+    expect_disconnect = False
+    valid_in_block = False
+
+    def get_tx(self):
+        tx = create_tx_with_script(self.spend_tx, 0, script_sig=b"", amount=(self.spend_avail // 2))
+        tx.vin.append(CTxIn(COutPoint(self.spend_tx.txid_int, 1)))
+        tx.vin[0].scriptSig = b"\xab"  # A CODESEP in the first input (non-standard)
+        tx.vin[1].scriptSig = b"\x6a"  # An OP_RETURN in the second input (invalid)
+        return tx
+
 # Disabled opcode tx templates (CVE-2010-5137)
 DisabledOpcodeTemplates = [getDisabledOpcodeTemplate(opcode) for opcode in [
     OP_CAT,
diff --git a/test/functional/p2p_invalid_tx.py b/test/functional/p2p_invalid_tx.py
index ae9771e7cb4..33082337cc8 100755
--- a/test/functional/p2p_invalid_tx.py
+++ b/test/functional/p2p_invalid_tx.py
@@ -14,6 +14,7 @@ from test_framework.messages import (
     CTxOut,
 )
 from test_framework.p2p import P2PDataStore
+from test_framework.script import CScript, OP_TRUE
 from test_framework.test_framework import BitcoinTestFramework
 from test_framework.util import (
     assert_equal,
@@ -56,7 +57,7 @@ class InvalidTxRequestTest(BitcoinTestFramework):
 
         self.log.info("Create a new block with an anyone-can-spend coinbase.")
         height = 1
-        block = create_block(tip, create_coinbase(height), block_time)
+        block = create_block(tip, create_coinbase(height, extra_output_script=CScript([OP_TRUE])), block_time)
         block.solve()
         # Save the coinbase for later
         block1 = block

I don't think this logic is actually meant to prevent an attacker from wasting resources anyways. I think it's rather to make sure we don't split the network when introducing new Script-runtime standardness rules. And this change doesn't affect this.

typedef enum is a C idiom which is unnecessary in C++

darosior · 2025-07-21T14:35:14Z

Rebased to fix a silent merge conflict that was making the "test each commit" CI job fail.

src/script/interpreter.cpp

…rrors We distinguish between consensus and standardness errors in validating a transaction to decide whether to disconnect the peer. When a transaction is both consensus-invalid and non-standard, we sometimes treat it as a consensus failure and sometimes not. This commit adds additional cases to invalid_txs.py to clarify the distinction and highlight the behaviour change in the next commit. We treat a non-standard transaction with an invalid Script as non-standard (and not disconnect the peer), and always treat a standard transaction with an invalid Script (whether or not the Script itself is also non-standard) as consensus. (That is, as long as no non-standard Script is encountered before.)

…nd standardness errors Change CScriptCheck() to return both the Script error type and the Script verification flag which caused the failure. This prevents having to run Script validation a second time upon failure. It does change behaviour slightly, in that a Script consensus failure after a Script standardness failure won't be detected anymore, and therefore won't lead to a disconnect. This is arguably by design though: this can only ever be detected by running Script validation without the standardness verification flags, which we are trying to avoid here.

ajtowns · 2025-07-23T00:52:48Z

The behaviour already exists in a limited fashion. Even without this change, an attacker can trivially create such a transaction (as discussed in OP). Here is for instance a p2p_invalid_tx.py case which passes both with or without this PR:

Okay, but that just means the extra work isn't providing much value today either?

I don't think this logic is actually meant to prevent an attacker from wasting resources anyways. I think it's rather to make sure we don't split the network when introducing new Script-runtime standardness rules. And this change doesn't affect this.

I'm suggesting we should not discourage for consensus invalid txs either, since we can't do that both consistently and efficiently -- ie, drop MaybePunishNodeForTx() entirely. That would also have the benefit of not having to worry about potential network splits due to txs violating new consensus rules that never actually make it into a block (cf #31269 and #26291). See https://github.com/ajtowns/bitcoin/commits/202507-nopunishtx/

darosior · 2025-07-23T19:50:46Z

That doesn't seem too unreasonable to stop disconnecting on receiving a transaction which fails Script validation due to a consensus verification flag. However it's not clear that it's necessarily better. It doesn't help in adversarial situations, but may still avoid wasting resources on innocently misbehaving peers. With this PR, opportunistically detecting such peers and evicting them comes at virtually no cost (unlike the current situation). So i suppose the tradeoff is between "maintain the opportunistic behaviour" and "avoid the cost of reviewing and merging this PR".

darosior · 2025-07-23T20:29:19Z

One example where the opportunistic disconnection could be useful is in the case of a hardfork coin with signature replay protection but still connecting to Bitcoin peers. It happened in the past but admittedly seems very unlikely nowadays.

ajtowns · 2025-07-24T01:14:39Z

One example where the opportunistic disconnection could be useful is in the case of a hardfork coin

In that case you'll already get a network split when the hard fork side mines a block.

It doesn't help in adversarial situations, but may still avoid wasting resources on innocently misbehaving peers. With this PR, opportunistically detecting such peers and evicting them comes at virtually no cost (unlike the current situation).

Disconnecting peers is only useful in adversarial situations -- keeping slightly buggy but otherwise honest peers in sync with our view of the world is a desirable outcome.

A particular example of where this makes the network more robust is when we take currently standard txs and make them consensus invalid: if we ban on consensus invalid txs, then old nodes (that consider the tx standard) and new nodes (that consider the tx invalid) need to be bridged by intermediate nodes (that consider it valid but non-standard). If we simply ignore invalid txs, making the network more fragile.

So i suppose the tradeoff is between "maintain the opportunistic behaviour" and "avoid the cost of reviewing and merging this PR".

Someone recently made the point that we already have a lot of code to maintain in bitcoin core, and that perhaps it would be beneficial to focus our attention on the parts that are providing the most benefit...

darosior · 2025-07-24T13:07:27Z

Someone recently made the point that we already have a lot of code to maintain in bitcoin core

A wise man! More seriously, i was merely pointing out the tradeoff here. Not saying this PR is the better alternative to #33050. I am out of rationalization for the existing behaviour anyways, so might as well take a red diff over a green one.

DrahtBot · 2025-07-26T00:11:57Z

🐙 This pull request conflicts with the target branch and needs rebase.

darosior · 2025-07-29T14:04:18Z

In that case you'll already get a network split when the hard fork side mines a block.

Discussed this with AJ (and others). For reference, in this case the disconnection would happen after a second block being submitted on top a first invalid one. We would not disconnect upon a first invalid block being submitted through compact blocks. It turns out that this logic was not tested at all, so i opened #33083 which exercises various cases in MaybePunishNodeForBlock.

…g an invalid compact block c157438 qa: test that we do disconnect upon a second invalid compact block being announced (Antoine Poinsot) fb2dcbb qa: test cached failure for compact block (Antoine Poinsot) f12d8b1 qa: test a compact block with an invalid transaction (Antoine Poinsot) d6c37b2 qa: remove unnecessary tx removal from compact block (Antoine Poinsot) Pull request description: In thinking about #33050 and #33012 (comment), i went through the code paths for peer disconnection upon submitting an invalid block. It turns out that the fact we exempt a peer from disconnection upon submitting an invalid compact block was not properly tested, as can be checked with these diffs: ```diff diff --git a/src/net_processing.cpp b/src/net_processing.cpp index https://github.com/bitcoin/bitcoin/commit/0c4a89c44cb6e9a61fdd486d576781e5b66c618c..d243fb88d4b 100644 --- a/src/net_processing.cpp +++ b/src/net_processing.cpp @@ -1805,10 +1805,10 @@ void PeerManagerImpl::MaybePunishNodeForBlock(NodeId nodeid, const BlockValidati // The node is providing invalid data: case BlockValidationResult::BLOCK_CONSENSUS: case BlockValidationResult::BLOCK_MUTATED: - if (!via_compact_block) { + //if (!via_compact_block) { if (peer) Misbehaving(*peer, message); return; - } + //} break; case BlockValidationResult::BLOCK_CACHED_INVALID: { ``` ```diff diff --git a/src/net_processing.cpp b/src/net_processing.cpp index 0c4a89c..e8e0c805367 100644 --- a/src/net_processing.cpp +++ b/src/net_processing.cpp @@ -1814,10 +1814,10 @@ void PeerManagerImpl::MaybePunishNodeForBlock(NodeId nodeid, const BlockValidati { // Discourage outbound (but not inbound) peers if on an invalid chain. // Exempt HB compact block peers. Manual connections are always protected from discouragement. - if (peer && !via_compact_block && !peer->m_is_inbound) { + //if (peer && !via_compact_block && !peer->m_is_inbound) { if (peer) Misbehaving(*peer, message); return; - } + //} break; } case BlockValidationResult::BLOCK_INVALID_HEADER: ``` We do have a test for this, but it actually uses a coinbase witness commitment error, which is checked much earlier in `FillBlock`. This PR adds coverage for the two exemptions in `MaybePunishNodeForBlock`. ACKs for top commit: kevkevinpal: ACK [c157438](c157438) nervana21: tACK [c157438](c157438) instagibbs: crACK c157438 stratospher: ACK c157438. Tree-SHA512: a77d5a9768c0d73f122b06db2e416e80d0b3c3fd261dae8e340ecec2ae92d947d31988894bc732cb6dad2e338b3c82f33e75eb3280f8b0933b285657cf3b212c

DrahtBot added the Consensus label Jul 18, 2025

darosior force-pushed the 2507_script_errors branch from 00ce527 to 48a5d18 Compare July 18, 2025 20:10

DrahtBot added the CI failed label Jul 18, 2025

DrahtBot removed the CI failed label Jul 18, 2025

darosior added 2 commits July 21, 2025 09:39

script: turn ScriptError into a proper C++ enum

c9a906f

typedef enum is a C idiom which is unnecessary in C++

script: rename ScriptError to ScriptErrorType

f7cdad3

darosior force-pushed the 2507_script_errors branch from 48a5d18 to f7dc2be Compare July 21, 2025 13:54

sipa reviewed Jul 21, 2025

View reviewed changes

src/script/interpreter.cpp Outdated Show resolved Hide resolved

darosior added 2 commits July 22, 2025 12:40

script: return offending Script verification flag upon error

cc615fa

darosior force-pushed the 2507_script_errors branch from f7dc2be to 5557cbb Compare July 22, 2025 16:40

DrahtBot added the CI failed label Jul 22, 2025

darosior force-pushed the 2507_script_errors branch from 5557cbb to 648e5e2 Compare July 22, 2025 22:01

DrahtBot removed the CI failed label Jul 23, 2025

ajtowns mentioned this pull request Jul 24, 2025

net, validation: don't punish peers for consensus-invalid txs #33050

Merged

DrahtBot mentioned this pull request Jul 25, 2025

[EXPERIMENTAL] Schnorr batch verification for blocks #29491

Draft

5 tasks

DrahtBot added the Needs rebase label Jul 26, 2025

darosior mentioned this pull request Jul 28, 2025

qa: test that we do not disconnect a peer for submitting an invalid compact block #33083

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

script: return verification flag responsible for error upon validation failure #33012

script: return verification flag responsible for error upon validation failure #33012

Uh oh!

darosior commented Jul 18, 2025 •

edited

Loading

Uh oh!

DrahtBot commented Jul 18, 2025 •

edited

Loading

Uh oh!

DrahtBot commented Jul 18, 2025

Uh oh!

ajtowns commented Jul 20, 2025

Uh oh!

darosior commented Jul 21, 2025

Uh oh!

darosior commented Jul 21, 2025

Uh oh!

Uh oh!

ajtowns commented Jul 23, 2025

Uh oh!

darosior commented Jul 23, 2025

Uh oh!

darosior commented Jul 23, 2025

Uh oh!

ajtowns commented Jul 24, 2025

Uh oh!

darosior commented Jul 24, 2025

Uh oh!

DrahtBot commented Jul 26, 2025

Uh oh!

darosior commented Jul 29, 2025

Uh oh!

Uh oh!

script: return verification flag responsible for error upon validation failure #33012

Are you sure you want to change the base?

script: return verification flag responsible for error upon validation failure #33012

Uh oh!

Conversation

darosior commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DrahtBot commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Coverage & Benchmarks

Reviews

Conflicts

Uh oh!

DrahtBot commented Jul 18, 2025

Uh oh!

ajtowns commented Jul 20, 2025

Uh oh!

darosior commented Jul 21, 2025

Uh oh!

darosior commented Jul 21, 2025

Uh oh!

Uh oh!

ajtowns commented Jul 23, 2025

Uh oh!

darosior commented Jul 23, 2025

Uh oh!

darosior commented Jul 23, 2025

Uh oh!

ajtowns commented Jul 24, 2025

Uh oh!

darosior commented Jul 24, 2025

Uh oh!

DrahtBot commented Jul 26, 2025

Uh oh!

darosior commented Jul 29, 2025

Uh oh!

Uh oh!

darosior commented Jul 18, 2025 •

edited

Loading

DrahtBot commented Jul 18, 2025 •

edited

Loading