Build previous releases and run functional tests #12134

Sjors · 2018-01-09T19:35:58Z

This PR adds binaries for 0.17, 0.18 and 0.19 to Travis and runs a basic block propagation test.

Includes test for upgrading v0.17.1 wallets and opening master wallets with older versions.

Usage:

contrib/devtools/previous_release.sh -f -b v0.19.0.1 v0.18.1 v0.17.1
test/functional/backwards_compatibility.py

Travis caches these earlier releases, so it should be able to run these tests with little performance impact.

Additional scenarios where it might be useful to run tests against earlier releases:

creating a wallet with SegWit wallet support #11403's segwit implementation, copying it to an older node and making sure the user didn't lose any funds (although this PR doesn't support v0.15.1)
future consensus changes
P2P changes (e.g. to make sure we don't accidentally ban old nodes)

ryanofsky · 2018-01-09T20:27:34Z

Can you explain why exactly the patching is needed in this case? It would definitely seem better to test against unpatched previous releases if possible.

Sjors · 2018-01-09T20:33:40Z

@ryanofsky the regtest parameters were changed to activate SegWit at the genesis block, which causes nodes after #11389 to reject blocks created by v0.15.1 and older nodes (and vice versa). Maybe there's a less drastic way to change the regtest consensus params for these older releases?

ryanofsky · 2018-01-09T20:42:53Z

Not sure, but is passing -vbparams=segwit:0:0 an option?

sdaftuar · 2018-01-09T21:16:39Z

@Sjors I've spent some time looking into how feasible it would be to write functional tests that would run against older versions of our software, which I think would be great if we could pull off -- for instance, I thought it would be nice in something like the p2p-segwit.py test for us to use an actual old binary in the upgrade_after_activation test, rather than mocking it with a current binary by setting the bip9 parameters to 0.

However, in my experience this proved to be impractical. It seems like we periodically make changes to the RPC test framework infrastructure itself, in such ways that cause problems with backward compatibility. Here are a few examples I have run into, so you get a flavor of some of the issues:

sync_blocks() was rewritten to use an rpc call, waitforblockheight, that was introduced in 0.14. So no test that uses sync_blocks can run on a pre-0.14 version of the code (which is basically every test we have!).
Even if you solve that problem somehow (eg by rewriting sync_blocks, or avoiding it somehow), we also introduced in 0.14 support for RPC named arguments. This also breaks compatibility with pre-0.14 software (so without a patch, I believe no RPC calls work at all).
I might have the details of this one slightly wrong, but I think we made a change at some point to support parsing numbers as strings, versus floats, and then I think our JSON library started to take advantage of that as well? Not sure if that's exactly right, but I remember having an issue running newer tests against older releases when using RPC calls involving numbers, and needing a patch for that.

So my takeaway from this exercise was that if we want to support running modern tests against older binaries, we need to change the way we do things, so that we (a) have a better split between the python test code, and the python utilities/RPC handlers/etc that allow us to talk to a node, and (b) come up with a way to use older python utility code when talking to older nodes.

ryanofsky · 2018-01-09T21:38:22Z

So no test that uses sync_blocks can run on a pre-0.14 version of the code (which is basically every test we have!).

I don't understand why this would matter. I think the goal here is just to be able to write a small test that brings up a specific version of bitcoind (0.15) and tests a few things with it, not to be able to run large swathes of the test framework against arbitrary bitcoin releases. It doesn't seem it would require a lot of code to support this, and if the tests are run on travis it doesn't seem like whatever code is added could get broken easily.

sdaftuar · 2018-01-09T21:49:36Z

@ryanofsky Perhaps I phrased that sentence poorly -- my point wasn't that we'd want to run all the tests against older code (I think that would be absurd), just that we take it for granted that we can call sync_blocks in all sorts of places, so to not have such a basic utility at your disposal can be surprising and hard to work around.

Another way to look at it: imagine you wanted to run a test against 0.15 now, so you write one and you can even use sync_blocks and everything works fine (I agree that this shouldn't be too hard, as 0.15 was very recent). Based on our history, I imagine there will come a time when someone wants to do something which will break the test, because some utility code -- like sync_blocks, or the way we make RPC calls, or something else -- will change, and the test will be busted, and we won't have a good way to fix it.

jnewbery · 2018-01-09T22:00:53Z

This seems possible for small, limited tests which don't use the full range of test framework capabilities (for example, if we just want to test downgrading to v0.15 to test segwit wallets). As Suhas has pointed out, there are lots of reasons why this is difficult in the general case, but I think we can work around them for a targeted test cases:

sync_blocks() was rewritten to use an rpc call, waitforblockheight, that was introduced in 0.14.

Not relevant if we only want to test as far back as v0.15, but we can add a waitforblockheight method to TestNode for versions that don't have a waitforblockheight RPC method.

Even if you solve that problem somehow (eg by rewriting sync_blocks, or avoiding it somehow), we also introduced in 0.14 support for RPC named arguments.

We could be careful to not use named arguments in tests where we want to use earlier versions. Or we could add a shim to TestNode to convert named -> positional.

I think we made a change at some point to support parsing numbers as strings, versus floats, and then I think our JSON library started to take advantage of that as well?

As above.

I imagine there will come a time when someone wants to do something which will break the test, because some utility code -- like sync_blocks, or the way we make RPC calls, or something else -- will change, and the test will be busted, and we won't have a good way to fix it.

I expect we'll only want to have tests covering the last one or two releases, so I don't see this as a huge problem.

I think the goal here is ... not to be able to run large swathes of the test framework against arbitrary bitcoin releases.

Yes - I agree that would be almost impossible.

@Sjors - if the goal is only to test official releases, why not download the binaries directly from https://bitcoincore.org/bin/ rather than building them locally?

sdaftuar · 2018-01-09T22:09:39Z

This is an aside if the goal here is just to make 0.15 work, but:

We could be careful to not use named arguments in tests where we want to use earlier versions. Or we could add a shim to TestNode to convert named -> positional.

The change to support named args actually affected RPC invocations that don't use named args -- I think we send an empty dictionary instead of an empty list in authproxy.py which an old server wouldn't accept.

Anyway I don't mean to be overly pessimistic if you all think we can make something work! I'm definitely in favor of the overall goal, I just haven't come across a simple solution that I thought was robust.

sdaftuar · 2018-01-09T22:12:46Z

I expect we'll only want to have tests covering the last one or two releases, so I don't see this as a huge problem.

I don't follow this point though -- in something like the p2p-segwit.py test, you'd have to do the upgrade after activation test on an 0.13.0 node or earlier. So if we'd written the test in such a way as to use an old binary, we'd have to just decommission that test at some point...? Obviously replacing with a later binary (that has the feature you want to test upgrade from) would mean you can't do that test anymore.

jnewbery · 2018-01-09T22:16:35Z

in something like the p2p-segwit.py test, you'd have to do the upgrade after activation test on an 0.13.0 node or earlier. So if we'd written the test in such a way as to use an old binary, we'd have to just decommission that test at some point...?

Yes - my expectation is that we'd decommission that test at some point. I don't think that testing versions older than a couple of releases is something that we should do as part of travis or developer local testing (although there's nothing stopping anyone from having a custom test rig to do that sort of thing).

Sjors · 2018-01-10T16:26:36Z

@jnewbery compiling from source seems more flexible than fetching binaries, especially if we need to patch things. There might also be configure flags that are optimal for the functional tests. It can be expanded to fetch code from other repositories (on your own test rig).

Downloading binaries would be faster, but it also involves figuring out which OS you're on and checking the checksum, so it's not necessarily easier to write a script for. That said, the one-off build is painfully slow on Travis atm, so I might add an option to just fetch a binary.

@ryanofsky -vbparams=segwit:0:0 -vbparams=csv:0:0 did the trick, as long as I generate blocks on the modern node (which is fine). So no patch needed for v0.15.1 at the moment.

@sdaftuar I ran into the issue you mentioned for versions before v0.14 and added a comment to the test file to warn about that.

I changed the demonstration test to simply check if old nodes sync blocks from the new node. I'll try some SegWit wallet related scenarios in a different PR.

I added a commit that makes one Travis instance compile v0.14.2 and v0.15.1 (cached) and then run the relevant functional test. It works, but I'm a bit confused as to what the right place is on Travis to put these older builds.

maflcko · 2018-01-10T16:33:10Z

.travis.yml

@@ -7,6 +7,7 @@ cache:
  - depends/built
  - depends/sdk-sources
  - $HOME/.ccache
+  - build/releases


That cache should be covered by .ccache, no?

will look into ccache behavior.

Forget what I said. It is probably better to cache the resulting binary than the individual obj files, since we are building specific tags that don't change. For master or other branch build, the commits often only change parts of the code, so the other obj files can be cached and re-used. For tag builds everything is static, so caching the binaries works as well.

maflcko

Concept ACK. Just noting that every change to the rpc interface is a breaking change in compatibility tests, that potentially needs attention. So we need to run those tests in the pull request travis run.

maflcko · 2018-01-10T16:46:07Z

test/functional/backwards_compatibility.py

+        """Override this method to customize test node setup"""
+        self.add_nodes(self.num_nodes, self.extra_args, None, None, [
+            None,
+            "build/releases/v0.15.1/src/bitcoind",


Would be nice to be able to pass those paths in. Could overwrite add_options in test_framework.

Do you mean making --srcdir accept multiple paths? That's probably a useful PR by itself, I made a note.

maflcko · 2018-01-10T16:46:29Z

test/functional/backwards_compatibility.py

+        ]
+
+    def setup_nodes(self):
+        """Override this method to customize test node setup"""


Comment seems misplaced

maflcko · 2018-01-10T16:47:35Z

test/functional/backwards_compatibility.py

+
+    def setup_nodes(self):
+        """Override this method to customize test node setup"""
+        self.add_nodes(self.num_nodes, self.extra_args, None, None, [


Use named args for those unnamed arguments to provide documentation and robustness against future interface changes.

jonasschnelli · 2018-01-10T19:01:59Z

Concept ACK, haven't looked at the code.

I think not testing with older versions is the biggest lack in our test env.
Things like the UTXO migration (now done) or testing how "older" nodes act on the (new) p2p network.
Also, up- and downgrading wallets is not covered by our tests and could be with something like this.

meshcollider · 2018-01-11T09:15:40Z

Concept ACK, cross version testing will be a very nice addition to the testing suite IMO

Sjors · 2020-02-11T19:05:54Z

Rebased. Reminder to self: cherry-pick Sjors@bacf559 after #18067 is merged.

maflcko · 2020-02-11T19:34:04Z

contrib/devtools/previous_release.sh

+    fi
+  done
+}
+popd || exit 1


I don't think we have anyone interested in tests who can also review bash scripts. Maybe rewrite this in python?

Added to my todo list, but equally happy to (see someone else) do this in a followup.

ci/test/05_before_script.sh

ci/test/04_install.sh

maflcko

ACK, some style nits

maflcko · 2020-02-11T19:46:28Z

test/functional/feature_backwards_compatibility.py

+        if not os.path.isdir(releases_path):
+            if os.getenv("TEST_PREVIOUS_RELEASES") == "true":
+                raise AssertionError("TEST_PREVIOUS_RELEASES=1 but releases missing: " + releases_path)
+            raise SkipTest("This test requires binaries for previous releases")


I'd rather have both of these an Assertion instead of a silent pass

That seems inconsistent with the way we skip wallet tests, i.e. the test runner calls each test file and each test file decides if needs to be skipped.

test/functional/feature_backwards_compatibility.py

test/functional/test_framework/test_framework.py

maflcko · 2020-02-12T14:19:31Z

ACK c456145 🔨

Show signature and timestamp

Signature:

Timestamp of file with hash 9f2d9c2708f034d9a99d437d02cbd47fb5e0b16b9e7cfc969396186bdeabfbbf -

c456145 [test] add 0.19 backwards compatibility tests (Sjors Provoost) b769cd1 [test] add v0.17.1 wallet upgrade test (Sjors Provoost) 9d9390d [tests] add wallet backwards compatility tests (Sjors Provoost) c7ca630 [scripts] support release candidates of earlier releases (Sjors Provoost) 8b1460d [tests] check v0.17.1 and v0.18.1 backwards compatibility (Sjors Provoost) ae379cf [scripts] build earlier releases (Sjors Provoost) Pull request description: This PR adds binaries for 0.17, 0.18 and 0.19 to Travis and runs a basic block propagation test. Includes test for upgrading v0.17.1 wallets and opening master wallets with older versions. Usage: ```sh contrib/devtools/previous_release.sh -f -b v0.19.0.1 v0.18.1 v0.17.1 test/functional/backwards_compatibility.py ``` Travis caches these earlier releases, so it should be able to run these tests with little performance impact. Additional scenarios where it might be useful to run tests against earlier releases: * creating a wallet with #11403's segwit implementation, copying it to an older node and making sure the user didn't lose any funds (although this PR doesn't support `v0.15.1`) * future consensus changes * P2P changes (e.g. to make sure we don't accidentally ban old nodes) ACKs for top commit: MarcoFalke: ACK c456145 🔨 Tree-SHA512: 360bd870603f95b14dc0cd629532cc147344f632b808617c18e1b585dfb1f082b401e5d493a48196b719e0aeaee533ae0a773dfc9f217f704aae898576c19232

Sjors force-pushed the previous-release branch 14 times, most recently from cf362b5 to f46bc95 Compare January 10, 2018 16:11

maflcko reviewed Jan 10, 2018

View reviewed changes

fanquake added the Tests label Jan 11, 2018

Sjors force-pushed the previous-release branch from fef4581 to bcdd3ea Compare December 11, 2019 12:41

DrahtBot added Needs rebase and removed Needs rebase labels Dec 11, 2019

Sjors force-pushed the previous-release branch from bcdd3ea to 0beb2a3 Compare January 9, 2020 04:20

DrahtBot removed the Needs rebase label Jan 9, 2020

DrahtBot added the Needs rebase label Jan 22, 2020

Sjors force-pushed the previous-release branch from 0beb2a3 to 842e4e0 Compare January 23, 2020 18:08

DrahtBot removed the Needs rebase label Jan 23, 2020

ryanofsky mentioned this pull request Feb 5, 2020

wallet: #17261 regression: addmultisigaddress addresses not treated as solvable #18075

Closed

[scripts] build earlier releases

ae379cf

Sjors force-pushed the previous-release branch from 842e4e0 to 6c530f5 Compare February 11, 2020 18:39

maflcko reviewed Feb 11, 2020

View reviewed changes

maflcko added this to the 0.20.0 milestone Feb 11, 2020

maflcko reviewed Feb 11, 2020

View reviewed changes

ci/test/05_before_script.sh Outdated Show resolved Hide resolved

maflcko reviewed Feb 11, 2020

View reviewed changes

ci/test/04_install.sh Show resolved Hide resolved

maflcko approved these changes Feb 11, 2020

View reviewed changes

Sjors added 5 commits February 11, 2020 21:46

[tests] check v0.17.1 and v0.18.1 backwards compatibility

8b1460d

[scripts] support release candidates of earlier releases

c7ca630

[tests] add wallet backwards compatility tests

9d9390d

[test] add v0.17.1 wallet upgrade test

b769cd1

[test] add 0.19 backwards compatibility tests

c456145

Sjors force-pushed the previous-release branch from 6c530f5 to c456145 Compare February 11, 2020 20:49

maflcko merged commit c456145 into bitcoin:master Feb 12, 2020

Sjors deleted the previous-release branch February 12, 2020 14:36

ryanofsky mentioned this pull request Feb 12, 2020

wallet: Improve LegacyScriptPubKeyMan::CanProvide script recognition #18067

Merged

MarkLTZ mentioned this pull request Apr 4, 2020

Bitcoin PR tracking litecoinz-core/litecoinz#14

Closed

bitcoin locked as resolved and limited conversation to collaborators Feb 15, 2022

Build previous releases and run functional tests #12134

Build previous releases and run functional tests #12134

Uh oh!

Conversation

Sjors commented Jan 9, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryanofsky commented Jan 9, 2018

Uh oh!

Sjors commented Jan 9, 2018

Uh oh!

ryanofsky commented Jan 9, 2018

Uh oh!

sdaftuar commented Jan 9, 2018

Uh oh!

ryanofsky commented Jan 9, 2018

Uh oh!

sdaftuar commented Jan 9, 2018

Uh oh!

jnewbery commented Jan 9, 2018

Uh oh!

sdaftuar commented Jan 9, 2018

Uh oh!

sdaftuar commented Jan 9, 2018

Uh oh!

jnewbery commented Jan 9, 2018

Uh oh!

Sjors commented Jan 10, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Sjors Jan 11, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

maflcko left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Sjors Jan 11, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jonasschnelli commented Jan 10, 2018

Uh oh!

meshcollider commented Jan 11, 2018

Uh oh!

Sjors commented Feb 11, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

maflcko left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

maflcko commented Feb 12, 2020

Uh oh!

Uh oh!

Sjors commented Jan 9, 2018 •

edited

Loading

Sjors Jan 11, 2018 •

edited

Loading

Sjors Jan 11, 2018 •

edited

Loading