txnprovider/shutter: integration tests and fixes #13983

taratorio · 2025-02-26T23:24:57Z

integration tests caught several bugs with initial implementation

gist of testing approach:

we have an integration test which starts up Erigon as EL only
we drive Erigon with a MockCl via the engine api json rpc
we interact with Erigon via the "eth_" namespace json rpc api (ie submit transactions, deploy shutter contracts, check transaction inclusion, etc)
this way we can build blocks and verify logic
this is a black box test where we feed Erigon inputs via the public api surface and check the end result again via the public api surface

test scenarios:

deploy initial keyper set
build a block with 1 shutter txn and 1 non-shutter txn
build a block after new eon change
built block does not exceed shutter encryption txn gas limit
built block does not include blob txns (if they accidentally make their way into the sequencer contract) - pending
build block after an unwind has happened (to cover unwind logic in our encrypted txn pool) - pending (probably in a follow up PR)

bug fixes caught and fixed by this:

block tracker waiting timeout - fixed in this PR

[WARN] [02-26|21:59:15.010] Failed to build a block                  err="[3/4 MiningExecution] issue while waiting for parent block 14520965: context deadline exceeded"

local rpc subscription panic - fixed in this PR

[EROR] [02-27|19:45:55.076] catch panic                              err="can't Notify before subscription is created" stack="[log_panic.go:41 panic.go:785 subscription.go:137 eth_filters.go:289 asm_amd64.s:1700]

Recent logs notifications not sent via "flush extending fork" path - fix merged in a separate PR execution: fix missing log notifications when flushing extending fork #14192
Caught that we do not send log notifications upon unwind - PENDING (will provide a better solution in a follow up PR as this got quite big)
Caught issue with deployment of new eons and tracking of future eons - PENDING (will provide a better solution in a follow up PR as this got quite big)

…s-to-block-tracker

…es-to-block-tracker

…h/erigon into shutter-fixes-to-block-tracker

…s-to-block-tracker

used in #13983 taking a cohesive unit of logic out of the bigger PR for ease of reviewing adds a json rpc client for interacting with the engine api the gist is: - we have an integration test which starts up Erigon as EL only - we drive Erigon with a MockCl - in order to be able to drive Erigon we need to access it's engine api json rpc (for this we need an engine api client) - analogous to that, we need a json rpc api client for the rpcdaemon publicly exposed apis (e.g. the "eth_" namespace, "debug_", etc.) - this is a black box test where we feed Erigon inputs via the public api surface and check the end result again via the public api surface

…s-to-block-tracker

…#14192) found in #13983 we weren't getting any new log notifications (rpc log subscriptions api) when the EL was going via the `e.forkValidator.FlushExtendingFork` code path which does an in-memory flush of the previously cached execution of `EthereumExecutionModule.ValidateChain`

…es-to-block-tracker

…s-to-block-tracker

txnprovider/shutter/decryption_keys_signature_data.go

txnprovider/shutter/internal/testhelpers/free_port.go

somnathb1 · 2025-03-19T19:29:49Z

txnprovider/shutter/internal/testhelpers/free_port.go

+		}
+		iterations++
+		if iterations > 1024 {
+			return 0, nil, fmt.Errorf("failed to find a free port after %d iterations", iterations)


Will this ever hit? If it does, doesn't it mean it's being used in the wrong way? I think the statefulness of this method might not be a good thing here

I've never hit it as of now, it assumes it won't take us more than 1024 tries to find a non consumed port (so say 1 test uses 10 ports, that's 100 tests running at the same time will potentially cause us to hit this)

You are right though, maybe I can switch to an approach where we have 1 global atomic port counter which we increment, try if the port is free, if yes then give it back to the port consumer, if not increment and try the next port and so on. It can be circular so if it ever reaches port 65535 it starts from 1024 again. It still needs a global atomic counter but it is a bit less stateless and less likely to be exhausted I guess. Let me try

My concern is the "consumed" ports are not consumed, and just reserved in the map. Maybe the ports were closed but the map wasn't updated, or ports were never opened. But that's probably an edge case not applicable here. One way would be to just use a free port, just in time, and if it didn't get opened, just try again (just in time).

The code to get a free port (stateless) and the management of the map (global test counter within the test(s))that you are using for convenience to not overlap each other could be kept segregated. Because your random returned port might just be 8080 and it could be used by another service.

ok, what about this: 98c2671

it is close to what you are suggesting as "request on demand" and still catering for this problem I mentioned here #13983 (comment) (we just need to make sure all big tests like this that open ports in our repo use this approach - right now there are only a few tests which open ports so we should be fine - I may also move this func in a top level package in a follow up PR so it is visible)

Looks good. And yes, i think that will fix about a few of those that fail because of insistence on specific ports. I made a mental note to tackle one on the next failure.

…s-to-block-tracker

…#14192) found in #13983 we weren't getting any new log notifications (rpc log subscriptions api) when the EL was going via the `e.forkValidator.FlushExtendingFork` code path which does an in-memory flush of the previously cached execution of `EthereumExecutionModule.ValidateChain`

@taratorio

…xtending fork… (#14578) … (#14192) found in #13983 we weren't getting any new log notifications (rpc log subscriptions api) when the EL was going via the `e.forkValidator.FlushExtendingFork` code path which does an in-memory flush of the previously cached execution of `EthereumExecutionModule.ValidateChain` @taratorio you forgot to cherry-pick this --------- Co-authored-by: milen <94537774+taratorio@users.noreply.github.com>

taratorio added 5 commits February 27, 2025 01:02

add debug log for block tracker

ae1ed22

simplify cond mutexes

40f62ab

fix block tracker

678ead6

fix subscription local notifier

21b441a

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

01553af

…s-to-block-tracker

taratorio changed the title ~~txnprovider/shutter: fix block tracker~~ txnprovider/shutter: fix block tracker, rpc subscription local notifier Feb 27, 2025

taratorio added 24 commits February 28, 2025 15:33

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

0af1b4f

…s-to-block-tracker

wip integration test

bd8edea

Merge branch 'main' of github.com:ledgerwatch/erigon into shutter-fix…

3cf0452

…es-to-block-tracker

wip

fb87609

Merge branch 'shutter-fixes-to-block-tracker' of github.com:erigontec…

d5f9729

…h/erigon into shutter-fixes-to-block-tracker

fix block building test with mock cl

901b675

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

ce8625f

…s-to-block-tracker

deploy shutter contracts

9f0dc6d

deploy shutter contracts

6b92302

move to testhelpers

7874c40

block building universe

ac4d4b5

submit encrypted txns

6138ce0

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

ad8bb94

…s-to-block-tracker

build shutter block

755e9c4

fix

0e5749d

add more info to err paths

129d414

execution: fix missing log notifications when flushing extending fork

5b68a0f

log deployed shutter contracts in test

dcb3edc

fixes

87821d4

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

173de5b

…s-to-block-tracker

fix compilation

271eccd

improvements to connect bootstrap nodes

52b959c

add todos & temp fixes

1d9a352

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

2ec9246

…s-to-block-tracker

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

3362015

…s-to-block-tracker

taratorio added 6 commits March 18, 2025 15:13

Merge branch 'main' of github.com:ledgerwatch/erigon into shutter-fix…

9bfaca2

…es-to-block-tracker

fix block tracker test on windows due to win time precision

2464619

fixes to race conditions

162648a

remove usage of testing.T where not necessary

cf8b4fd

Merge branch 'main' of github.com:ledgerwatch/erigon into shutter-fix…

76cb874

…es-to-block-tracker

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

61ee788

…s-to-block-tracker

taratorio marked this pull request as ready for review March 19, 2025 09:45

taratorio requested review from shohamc1, somnathb1 and yperbasis March 19, 2025 09:46

add eon change test case

fe1cbea

somnathb1 reviewed Mar 19, 2025

View reviewed changes

somnathb1 approved these changes Mar 19, 2025

View reviewed changes

taratorio added 8 commits March 20, 2025 11:35

fix for flaky test on win

74834f3

switch to new next free port approach

98c2671

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

0d03632

…s-to-block-tracker

improve bootstrap p2p connection for decryption keys

01578ab

split tests by eon

c1cf922

add encrypted gas limit test case and fix thread safe txn parsing

1cb2a5e

Merge branch 'main' of github.com:erigontech/erigon into shutter-fixe…

3f56beb

…s-to-block-tracker

add port starting point randomess

0cb1ed6

taratorio merged commit 18af7e1 into main Mar 21, 2025
13 checks passed

taratorio deleted the shutter-fixes-to-block-tracker branch March 21, 2025 08:26

taratorio mentioned this pull request Mar 21, 2025

[shutter] integration tests #13382

Closed

Giulio2002 mentioned this pull request Apr 12, 2025

Cherry-Pick: execution: fix missing log notifications when flushing extending fork… #14578

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

txnprovider/shutter: integration tests and fixes #13983

txnprovider/shutter: integration tests and fixes #13983

Uh oh!

taratorio commented Feb 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

somnathb1 Mar 19, 2025 •

edited

Loading

Uh oh!

taratorio Mar 20, 2025

Uh oh!

somnathb1 Mar 20, 2025 •

edited

Loading

Uh oh!

taratorio Mar 20, 2025

Uh oh!

somnathb1 Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

txnprovider/shutter: integration tests and fixes #13983

txnprovider/shutter: integration tests and fixes #13983

Uh oh!

Conversation

taratorio commented Feb 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

somnathb1 Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

taratorio Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

somnathb1 Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

taratorio Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

somnathb1 Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

taratorio commented Feb 26, 2025 •

edited

Loading

somnathb1 Mar 19, 2025 •

edited

Loading

somnathb1 Mar 20, 2025 •

edited

Loading