AS transaction isolation tweaks #12032

Fizzadar · 2022-02-18T14:57:37Z

Probably fixes #11620.

Let me know if you'd prefer this in two PRs!

Signed-off-by: Nick Mills-Barrett nick@beeper.com

Pull Request Checklist

Pull request is based on the develop branch
Pull request includes a changelog file.
Pull request includes a sign off
Code style is correct
(run the linters)

…level.

synapse/storage/databases/main/appservice.py

squahtx · 2022-02-18T15:53:34Z

synapse/storage/databases/main/appservice.py

+            "complete_appservice_txn",
+            _complete_appservice_txn,
+            isolation_level=IsolationLevel.READ_COMMITTED,


I didn't realise one of the transactions in question was non-trivial! That complicates things a little since last_txn can now change out from under us in between _get_last_txn and the upsert. Previously, we'd get a SerializationFailure and retry until we get a consistent view throughout the transaction.

I think this is okay though since last_txn_id is only used for a debug check, which will now be less reliable - if two complete_appservice_txn transactions run simultaneously with the same txn_id, neither of them may trip the debug check (whereas before, one of them would have to retry and trip the check). It's best to leave a comment noting that last_txn_id can change out from under us before the upsert because we're in READ COMMITTED mode, or something to that effect.

An alternative is to use a RETURNING clause on the upsert to get the last_txn_id. The downside is you'd need a different path for SQLite (which doesn't support RETURNING).

If we then also replace the DELETE with a DELETE WHERE txn_id <= ? then I don't think this even needs to be a transaction, and we can do the two queries with db_autocommit.

Another thought - should txn_id be generated using a sequence? The debug check and manual generation of TXN ID seems unncessary to me (this would potentially open up running multiple appservice workers with additional work I think?).

Currently I believe this is safe but only because there is only a single process-per-AS working on transactions (enforced by https://github.com/matrix-org/synapse/blob/develop/synapse/appservice/scheduler.py#L181).

For some reason the debug check trips on matrix.org from time to time, perhaps around the time synapse workers are restarting(?). It's not immediately clear to me why it happens.

I can't see why we don't use a sequence for txn_id. Something to do with sqlite support maybe?

Possibly historical reasons? The AS codepaths look old on blame to me! Not super familiar with the rest of synapse but sequences in use/defined in https://github.com/matrix-org/synapse/blob/develop/synapse/storage/util/sequence.py so potentially would be beneficial to switch to those. Need more context to make that call though, and probably beyond the scope of this PR!

Fizzadar · 2022-04-26T12:43:27Z

This has now been fixed in #12209!

Fizzadar added 2 commits February 18, 2022 14:53

Use simple_update_one when updating AS state stream positions.

83eaeb0

Run the complete AS txn transaction using READ COMMITTED isolation …

91485b7

…level.

Fizzadar requested a review from a team as a code owner February 18, 2022 14:57

Add changelog file.

e891764

squahtx reviewed Feb 18, 2022

View reviewed changes

squahtx self-assigned this Feb 18, 2022

erikjohnston added the X-Awaiting-Changes A contributed PR which needs changes and re-review before it can be merged label Feb 23, 2022

Fizzadar mentioned this pull request Mar 11, 2022

Use a sequence to generate AS transaction IDs, drop last_txn AS state #12209

Merged

4 tasks

Fizzadar closed this Apr 26, 2022

Fizzadar deleted the as-transaction-isolation branch April 26, 2022 12:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

AS transaction isolation tweaks #12032

AS transaction isolation tweaks #12032

Uh oh!

Fizzadar commented Feb 18, 2022 •

edited by squahtx

Loading

Uh oh!

Uh oh!

squahtx Feb 18, 2022 •

edited

Loading

Uh oh!

erikjohnston Feb 21, 2022

Uh oh!

Fizzadar Feb 21, 2022

Uh oh!

squahtx Feb 22, 2022

Uh oh!

Fizzadar Feb 22, 2022

Uh oh!

Fizzadar commented Apr 26, 2022

Uh oh!

Uh oh!

Uh oh!

AS transaction isolation tweaks #12032

AS transaction isolation tweaks #12032

Uh oh!

Conversation

Fizzadar commented Feb 18, 2022 • edited by squahtx Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Checklist

Uh oh!

Uh oh!

squahtx Feb 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

erikjohnston Feb 21, 2022

Choose a reason for hiding this comment

Uh oh!

Fizzadar Feb 21, 2022

Choose a reason for hiding this comment

Uh oh!

squahtx Feb 22, 2022

Choose a reason for hiding this comment

Uh oh!

Fizzadar Feb 22, 2022

Choose a reason for hiding this comment

Uh oh!

Fizzadar commented Apr 26, 2022

Uh oh!

Uh oh!

Fizzadar commented Feb 18, 2022 •

edited by squahtx

Loading

squahtx Feb 18, 2022 •

edited

Loading