Refactor MDL BF aborting through ha_abort_transaction() #307

temeo · 2023-04-05T09:51:31Z

BF aborting through ha_abort_transaction() has proven to be
problematic. The InnoDB implementation of wsrep_abort_transaction
is racy with respect to THD/trx object access because THD mutexes
cannot be held over whole BF abort process to avoid deadlocks when
wsrep_abort_transaction calls back to server.

Change the logic for BF aborts which originate from MDL in the
following way:

First check if the BF abort should progress by calling
wsrep-lib bf_abort() which checks the wsrep transaction
and Galera side transaction states.
If the BF abort should progress, call ha_abort_transaction()
without releasing LOCK_thd_data in between.
Remove call back to server in InnoDB wsrep_abort_transaction().

This makes the wsrep transaction state check and InnoDB
transaction kill atomic, while simultaneously removing the
need to call back to server from InnoDB.

TODO: Is THD::wsrep_aborter needed anymore? Always calling
wsrep-lib bf_abort() guards against multiple BF aborts.
Maybe still needed for InnoDB.

Change for MDEV-25717: Introduce separate sync point in
wsrep_abort_thd() to control thread execution during MDL
induced BF abort.

Change for galera_create_table_as_select: CTAS may now fail
also with ER_QUERY_INTERRUPTED due to use of THD::awake().

janlindstrom · 2023-04-05T11:07:31Z

storage/innobase/handler/ha_innodb.cc

+		{
+			victim_trx->lock.set_wsrep_victim();
+		}
+		victim_trx->mutex_unlock();


This is the point where:
Can victim transaction move from ACTIVE to COMMITTED_IN_MEMORY so that flag is still there ? or it is cleared during commit ?

The transaction state is changed to TRX_STATE_COMMITTED_IN_MEMORY in trx_t::commit_state(). This is protected by TMTrxGuard and happens before the end of trx_t::commit_in_memory() where the lock.was_chose_as_deadlock_victim is set to false.

If the lock.was_chosen_as_deadlock_victim is set just before commit, the commit can still proceed as the flag is not checked in commit code path, and the flag is cleared at the end of trx_t::commit_in_memory(). With this fix the flag cannot be set to true after the transaction has passed trx_t::commit_in_memory().

TODO: Add comment in code which clarifies this.

After code reading I can see that you are correct.

There is one more possible race when autocommit non-locking transaction changes its state to TRX_STATE_NOT_STARTED in commit_in_memory(). Not sure what to do about it...

janlindstrom

I just have one question but overall this looks very promising and better what we have now.

temeo · 2023-04-05T12:33:59Z

--thread-handling=pool-of-threads must be tested.

BF aborting through ha_abort_transaction() has proven to be problematic. The InnoDB implementation of wsrep_abort_transaction is racy with respect to THD/trx object access because THD mutexes cannot be held over whole BF abort process to avoid deadlocks when wsrep_abort_transaction calls back to server. Change the logic for BF aborts which originate from MDL in the following way: * First check if the BF abort should progress by calling wsrep-lib bf_abort() which checks the wsrep transaction and Galera side transaction states. * If the BF abort should progress, call ha_abort_transaction() without releasing LOCK_thd_data in between. * Remove call back to server in InnoDB wsrep_abort_transaction(). This makes the wsrep transaction state check and InnoDB transaction kill atomic, while simultaneously removing the need to call back to server from InnoDB. MTR test changes: - MDEV-25717: Introduce separate sync point in wsrep_abort_thd() to control thread execution during MDL induced BF abort. - galera_create_table_as_select: CTAS may now fail also with ER_QUERY_INTERRUPTED due to use of THD::awake().

temeo marked this pull request as draft April 5, 2023 09:52

janlindstrom reviewed Apr 5, 2023

View reviewed changes

janlindstrom approved these changes Apr 5, 2023

View reviewed changes

temeo force-pushed the 10.6-refact-mdl-bf-abort branch from 0143cde to 0d8dc5d Compare April 6, 2023 10:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor MDL BF aborting through ha_abort_transaction() #307

Refactor MDL BF aborting through ha_abort_transaction() #307

temeo commented Apr 5, 2023

janlindstrom Apr 5, 2023

temeo Apr 5, 2023

temeo Apr 5, 2023

janlindstrom Apr 5, 2023

temeo Apr 6, 2023

janlindstrom left a comment

temeo commented Apr 5, 2023

Refactor MDL BF aborting through ha_abort_transaction() #307

Are you sure you want to change the base?

Refactor MDL BF aborting through ha_abort_transaction() #307

Conversation

temeo commented Apr 5, 2023

janlindstrom Apr 5, 2023

Choose a reason for hiding this comment

temeo Apr 5, 2023

Choose a reason for hiding this comment

temeo Apr 5, 2023

Choose a reason for hiding this comment

janlindstrom Apr 5, 2023

Choose a reason for hiding this comment

temeo Apr 6, 2023

Choose a reason for hiding this comment

janlindstrom left a comment

Choose a reason for hiding this comment

temeo commented Apr 5, 2023