[test] Add aborttrescan tests #10225

kallewoof · 2017-04-18T08:13:16Z

This PR adds tests for the new abortrescan RPC command.

A new function ref_node was added to util.py, which works like start_node except it never actually launches the process. This is used to get two node objects in python which work separately from each other, in order to make two simultaneous requests (importprivkey and abortrescan).

jonasschnelli · 2017-04-18T08:16:18Z

test/functional/test_runner.py

@@ -108,6 +108,7 @@
    'rpcnamedargs.py',
    'listsinceblock.py',
    'p2p-leaktests.py',
+    'import-abort-rescan.py',           # ~17s


I think we should remove the time hint, otherwise this will become "the standard" and, with that, the "keep-it-updated" problem will follow.

Hm, yeah, you're right. Removing.

jonasschnelli · 2017-04-18T08:17:20Z

Concept ACK.
Nice way with the ref node & threading.

jnewbery · 2017-04-18T16:43:40Z

Thanks for opening this PR to cover abortrescan. Definitely worth doing.

I don't like the change you've made to start_node(). Adding an optional parameter to a function called start_node() which causes the function to not start a node is really an abuse of the function.

I'd much prefer us to move toward having an encapsulated class for a test node, where we could have multiple rpc connections, including asynchronous rpc connections if required. I've been trying to push towards that model in #10082, but I haven't had much luck in attracting reviewers for that PR or the linked PR. I think adding random functionality into util.py is unmaintainable and is moving in the wrong direction.

So concept ACK for covering this with a functional test, but please lets not add more to util.py.

kallewoof · 2017-04-19T00:35:30Z

@jnewbery

I don't like the change you've made to start_node(). Adding an optional parameter to a function called start_node() which causes the function to not start a node is really an abuse of the function.

Yeah, I was wondering if I should have reversed it (i.e. ref_node takes a launch bool and start_node just proxies).

As for encapsulate with multiple rpc connections, that sounds great but the functionality is, as you say, not in there. My change is minimal and works, so maybe it's an okay for now until #10082 is ready? (I'll gladly review btw. I just gotta wake up first..)

Edit: Alternatively I could base this PR on top of #10082. Never done that kind of thing before though.

kallewoof · 2017-04-19T02:20:20Z

#10082 seems like a big project (I'd love to help btw), so I am proposing a solution until that is resolved here. start_node now calls ref_node which now has an optional spawnproc flag. This means start_node always starts a node, and ref_node can do both, depending on the flag.

Unsquashed history: 1 → 2 → 3⊱1 → 4⊱2

jnewbery · 2017-04-19T21:05:30Z

@kallewoof thanks for being so accepting of my feedback! I don't like NACKing PRs, but I really want to try to not put any additional complexity in util.py if we can help it.

Have you had a look at getblocktemplate_longpoll.py ? That's doing something similar to this test where an additional asynchronous RPC thread is required. I've tried to rewrite your testcase in the same style here: https://github.com/jnewbery/bitcoin/tree/pr10225. Can you take a look and tell me what you think? I'd prefer to do it this way than adding more complexity to the mainline start_node() function. If we can get #10082 merged, then we can then look at adding an asynchronous RPC thread to the TestNode class so it's available more generally.

EDIT: I've found that this test fails intermittently when I run it locally. I think perhaps the importprivkey() is completing very quickly so it's a race condition and the abortrescan() call needs to arrive at the right moment.

kallewoof · 2017-04-20T01:44:22Z

@jnewbery not a worry at all -- you don't have to hold back the punches with me. I am learning a lot from the feedback I get from you guys. :)

Gotcha on the no-touching-util.py. I will look at getblocktemplate_longpoll and see if I can adapt. Worst case I have two options: I can drop this PR until #10082 or I can put the ref_node code into the test itself with a # TODO and we simply rip it out when it's time to replace.

As for the intermittent failures, yes, I am trying to keep the time to run as low as possible and I may have put it a bit too low (upping the range in one or both of the top for loops should stabilize it, I think).

kallewoof · 2017-04-20T02:16:11Z

@jnewbery Wow, the solution in getblocktemplate_longpoll.py was so much cleaner. I switched to that and dropped some commits. I also upped the range to hopefully address the intermittent fails you experienced. Edit: beginning to suspect problem is in fact in the abortres aborting too early. Added small sleep (7''⊱2).

Unsquashed history: 1 → 2 → ~~3⊱1~~ → 4⊱2 → 5⊱2 → 6⊱2 → 7''⊱2

jnewbery · 2017-04-20T13:52:29Z

Looks better, but the test is still failing more often than not for me. I ran the test 20 times (with 4 tests running in parallel):

TEST                                 | STATUS    | DURATION

import-abort-rescan.py --portseed=1  | ✓ Passed  | 11 s
import-abort-rescan.py --portseed=10 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=11 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=12 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=13 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=14 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=15 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=16 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=17 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=18 | ✓ Passed  | 11 s
import-abort-rescan.py --portseed=19 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=2  | ✖ Failed  | 12 s
import-abort-rescan.py --portseed=20 | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=3  | ✖ Failed  | 12 s
import-abort-rescan.py --portseed=4  | ✖ Failed  | 13 s
import-abort-rescan.py --portseed=5  | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=6  | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=7  | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=8  | ✖ Failed  | 11 s
import-abort-rescan.py --portseed=9  | ✖ Failed  | 11 s

ALL                                  | ✖ Failed  | 224 s (accumulated)

(ignore the portseed argument - that's just a hack to allow instances of the same test to be run by the test_runner. The value of portseed is ignored).

I'm not why this fails so much for me, but seems to pass for you and Travis.

kallewoof · 2017-04-21T00:30:09Z

@jnewbery Is there an easy way to run the test like that? E.g. 20 times 4 in parallel?

Edit: tests keep succeeding for me on a MacBook Pro. Running them on a linux machine (lubuntu) resulted in sporadic failures. Looking into it now.

Edit: there are two cases where the test will fail; one is when the abortres loop sleeps right over the importprivkey time-to-finish (for 0.01), and one, I realized, is when the importprivkey doesn't actually start up before the abortres loop ends. I increased the range from 200 to 2000, with sleep kept at 0.001, which means a total approximate time of 2 seconds in the loop. This almost fixed it on my end but test started failing on the next assertion (aborted, so should not have balance). I bumped block chain size and that seems to have done it. This all feels very flakey though. :(

[...]: → 8⊱2 → 9⊱2

kallewoof · 2017-04-23T23:31:17Z

~~Still seeing intermittent failures. Marking this WIP until this is fully resolved.~~

Edit: Actually, the failures I were seeing were related to a debug line that triggered #10265 so I am removing the WIP part.

ed60970 [test] Test abortrescan command. (Karl-Johan Alm) Tree-SHA512: 7f617adba65a6df8fdc4b01432992926a06c4a05da4e657653436f7716301fa5d6249d77894a097737e7fb9e118925883f2425c639058b8973680339bb8e61b6

jnewbery · 2017-04-25T14:55:34Z

Is there an easy way to run the test like that? E.g. 20 times 4 in parallel?

You can do this by changing the BASE_SCRIPTS list in test_runner.py to be the same test multiple times. test_runner will automatically remove duplicates, but if you add --portseed=x to the test name, then it will run them as separate tests. The dummy portseed parameter is overridden by an actual portseed further down in test_runner.

This all feels very flakey though. :(

Indeed! Is it true to say there's a tradeoff between making the test case last longer and making it more robust? If that's the case I think you should err towards making it run longer and perhaps add it as an extended script rather than a base script.

jnewbery · 2017-05-02T19:10:43Z

This test is still failing intermittently for me in two different ways:

2017-05-02 18:51:29.922000 TestFramework (ERROR): Assertion failed
Traceback (most recent call last):
  File "/home/ubuntu/bitcoin/test/functional/test_framework/test_framework.py", line 146, in main
    self.run_test()
  File "./import-abort-rescan.py", line 50, in run_test
    assert abortres # if false, we failed to abort
AssertionError

and:

Traceback (most recent call last):
  File "/home/ubuntu/bitcoin/test/functional/test_framework/test_framework.py", line 146, in main
    self.run_test()
  File "./import-abort-rescan.py", line 58, in run_test
    assert_equal(self.nodes[1].getbalance(), 0.0)
  File "/home/ubuntu/bitcoin/test/functional/test_framework/util.py", line 408, in assert_equal
    raise AssertionError("not(%s)" % " == ".join(str(arg) for arg in (thing1, thing2) + args))
AssertionError: not(0.12300000 == 0.0)

This is also causing our Jenkins build machine to fail occasionally (2 times out of 30 builds)

@laanwj - was this merged accidentally? There haven't been any reviews/ACKs. Can we revert it and get it reviewed before remerging?

laanwj · 2017-05-03T13:51:06Z

@laanwj - was this merged accidentally?

I think so, sorry @kallewoof, needs a new PR now.

jnewbery · 2017-05-03T13:53:04Z

@kallewoof - this has been backed out by #10327 . Please open a new PR so this test can be reviewed before being merged back in. A few suggestions for making this less flakey:

increase the number of generated blocks by at least an order of magnitude. Run the test many times on a fast machine, with bitcoind's datadir in /dev/shm. I think this passes on Travis because bitcoind runs slowly so you have a larger window for the abortrescan RPC to hit.
check the debug logs to see how long the rescan actually takes so we're not guessing on what a safe window is.
I think this test should be in the extended_test list rather than the base_tests list.
alternatively, tack this test onto a longer test script which already has a long chain, for example pruning.py. Seems a bit untidy, but should guarantee that the rescan takes a long time.

kallewoof · 2017-05-04T12:13:49Z

No problem - I will do proper testing and make a new PR once done. Sorry for the trouble!

ed60970 [test] Test abortrescan command. (Karl-Johan Alm) Tree-SHA512: 7f617adba65a6df8fdc4b01432992926a06c4a05da4e657653436f7716301fa5d6249d77894a097737e7fb9e118925883f2425c639058b8973680339bb8e61b6

jonasschnelli reviewed Apr 18, 2017

View reviewed changes

kallewoof force-pushed the abort-rescan-tests branch from e68c168 to 28c2abf Compare April 18, 2017 08:21

fanquake added the Tests label Apr 18, 2017

kallewoof force-pushed the abort-rescan-tests branch from 28c2abf to 296bf4a Compare April 19, 2017 02:16

kallewoof force-pushed the abort-rescan-tests branch 3 times, most recently from 995105b to 7e3157a Compare April 19, 2017 05:23

kallewoof force-pushed the abort-rescan-tests branch from 7e3157a to 3dc232f Compare April 20, 2017 02:05

kallewoof force-pushed the abort-rescan-tests branch 4 times, most recently from 96dd997 to 6b8d2ea Compare April 20, 2017 04:33

kallewoof force-pushed the abort-rescan-tests branch from 6b8d2ea to 761a753 Compare April 21, 2017 02:50

[test] Test abortrescan command.

ed60970

kallewoof force-pushed the abort-rescan-tests branch from 761a753 to ed60970 Compare April 21, 2017 03:51

kallewoof changed the title ~~[test] Add aborttrescan tests~~ [WIP] [test] Add aborttrescan tests Apr 23, 2017

kallewoof changed the title ~~[WIP] [test] Add aborttrescan tests~~ [test] Add aborttrescan tests Apr 24, 2017

laanwj merged commit ed60970 into bitcoin:master Apr 25, 2017

kallewoof deleted the abort-rescan-tests branch April 25, 2017 14:22

jnewbery mentioned this pull request May 3, 2017

[tests] remove import-abort-rescan.py #10327

Merged

kallewoof mentioned this pull request May 9, 2017

[WIP] [test] Test abortrescan command. #10367

Closed

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

[test] Add aborttrescan tests #10225

[test] Add aborttrescan tests #10225

Uh oh!

Conversation

kallewoof commented Apr 18, 2017

Uh oh!

jonasschnelli Apr 18, 2017

Choose a reason for hiding this comment

Uh oh!

kallewoof Apr 18, 2017

Choose a reason for hiding this comment

Uh oh!

jonasschnelli commented Apr 18, 2017

Uh oh!

jnewbery commented Apr 18, 2017

Uh oh!

kallewoof commented Apr 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kallewoof commented Apr 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnewbery commented Apr 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kallewoof commented Apr 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kallewoof commented Apr 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnewbery commented Apr 20, 2017

Uh oh!

kallewoof commented Apr 21, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kallewoof commented Apr 23, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnewbery commented Apr 25, 2017

Uh oh!

jnewbery commented May 2, 2017

Uh oh!

laanwj commented May 3, 2017

Uh oh!

jnewbery commented May 3, 2017

Uh oh!

kallewoof commented May 4, 2017

Uh oh!

Uh oh!

kallewoof commented Apr 19, 2017 •

edited

Loading

kallewoof commented Apr 19, 2017 •

edited

Loading

jnewbery commented Apr 19, 2017 •

edited

Loading

kallewoof commented Apr 20, 2017 •

edited

Loading

kallewoof commented Apr 20, 2017 •

edited

Loading

kallewoof commented Apr 21, 2017 •

edited

Loading

kallewoof commented Apr 23, 2017 •

edited

Loading