Retrying transactions with backoff #698

thebrianchen · 2019-08-09T23:53:17Z

Good call on recommending that I try and implement Android first before merging web. This has a been a painful, but educational experience.

A couple things about Android:

The recursive method seemed like the only viable option here, since we cannot await backoffs in the same thread as the one calling it (something that I realized after a bit too much struggling). That being said, I still think the web approach is cleaner and more readable compared to Android. How do you feel about keeping the implementations different? I'm also not sure how I would port the whole TaskCompletionSource strategy used by Android for recursion over to web.
The skipDelaysForTimerIds approach of web doesn't port over to Android b/c it relies on runDelayedTasksUntil, which blocks until the delay is completed. This means that we must call delays from outside the AsyncQueue, or else it deadlocks. Overriding the backoff is the only other approach that works aside from calling a runDelayedTasksUntil on a 10ms interval on a separate thread. Thoughts on this approach?

mikelehen

Mostly LGTM but I'm not psyched about removeTransactionBackoffs(). I think we can make skipDelaysForTimerId() work.

Also, I think this code is portable to JS, and it would be worthwhile to change JS to run transaction code on the AsyncQueue anyway (other than the user's updateFunction) in order to make the platforms more consistent. As-is, it's not clear what is an intentional deviation and what is accidental.

If the recursive transaction update code is getting too gnarly, I think we could probably simplify things by introducing a TransactionRunner class that encapsulates the logic of running / retrying with backoff. Right now, SyncEngine.transaction() is getting a bit unwieldy with 5 arguments, when outside callers (i.e. FirestoreClient.transaction()) really only need to provide asyncQueue and updateFunction. The rest are just there to enable the method to call itself recursively, I think.

firebase-firestore/src/main/java/com/google/firebase/firestore/util/ExponentialBackoff.java

firebase-firestore/src/main/java/com/google/firebase/firestore/core/FirestoreClient.java

firebase-firestore/src/androidTest/java/com/google/firebase/firestore/TransactionTest.java

firebase-firestore/src/main/java/com/google/firebase/firestore/core/SyncEngine.java

firebase-firestore/src/main/java/com/google/firebase/firestore/util/AsyncQueue.java

thebrianchen

Thanks for the review! Quick question: from a code health and maintainability perspective, why is it bad to introduce new test hooks, like I did with removeTransactionBackoffs? I can see how skipDelaysForTimerId is a more elegant solution, but when is it appropriate/preferred to add new test hooks?

mikelehen

Thanks for reworking! I have some minor feedback, but overall I'm liking this approach better.

firebase-firestore/src/main/java/com/google/firebase/firestore/core/TransactionRunner.java

mikelehen · 2019-08-12T22:39:15Z

firebase-firestore/src/main/java/com/google/firebase/firestore/core/TransactionRunner.java

+  }
+
+  /** Returns the result of the transaction after it has been run. */
+  public Task<TResult> getTask() {


Is there a reason for this to be a separate function? I think I'd just make runTransaction() return the Task<TResult>.

This will also probably port better to iOS (runTransaction() will just accept a completion callback instead of returning a Task).

firebase-firestore/src/main/java/com/google/firebase/firestore/core/TransactionRunner.java

firebase-firestore/src/main/java/com/google/firebase/firestore/util/AsyncQueue.java

firebase-firestore/src/main/java/com/google/firebase/firestore/core/SyncEngine.java

mikelehen · 2019-08-12T23:44:17Z

Oh! By the way, I realized I forgot to answer your question:

from a code health and maintainability perspective, why is it bad to introduce new test hooks, like I did with removeTransactionBackoffs? I can see how skipDelaysForTimerId is a more elegant solution, but when is it appropriate/preferred to add new test hooks?

Uh, this is pretty subjective. I guess my general philosophy is that test hooks are bad for a variety of reasons, including:

You end up shipping test-only code as part of your product which means it's essentially "dead code."
It means your tests are modifying the behavior of the client which necessarily means it's not a fully realistic test so there's risk of masking bugs or confusing future devs working on the code / tests.

And so in general, it's good to have as few test hooks as possible, keep the scope as limited as possible, and make them reusable so you don't have to invent more test hooks later.

In our case, we've already chosen the AsyncQueue as a useful place to allow tests to hook in to deal with time-related issues in tests (i.e. fast-forwarlding time via runDelayedOperationsEarly()). So adding new test hooks to the AsyncQueue is preferable to a new transaction-specific test hook. It's more consistent and is more likely to be reusable in the future.

mikelehen

A few remaining nits, but this basically LGTM.

firebase-firestore/src/main/java/com/google/firebase/firestore/core/SyncEngine.java

firebase-firestore/src/main/java/com/google/firebase/firestore/core/TransactionRunner.java

thebrianchen · 2019-08-13T01:13:32Z

/test new-smoke-tests

mikelehen

LGTM except for typo.

mikelehen · 2019-08-13T14:52:18Z

firebase-firestore/src/main/java/com/google/firebase/firestore/core/TransactionRunner.java

@@ -27,8 +26,7 @@
 import com.google.firebase.firestore.util.ExponentialBackoff;

 /**
- * TransactionRunner encapsulates the logic needed to run and retry transactions so that the caller
- * does not have to manage the backoff and retry count through recursive calls.
+ * TransactionRunner encapsulates the logic needed to run and retry transactions without backoff.


without => with ? 😄

Brian Chen added 3 commits August 9, 2019 16:39

working everything

abb8367

lint

fdbd0a5

rename txTaskSource to transactionSource

3d8eda2

thebrianchen added the api: firestore label Aug 9, 2019

thebrianchen requested a review from mikelehen August 9, 2019 23:53

thebrianchen assigned mikelehen Aug 9, 2019

googlebot added the cla: yes Override cla label Aug 9, 2019

google-oss-bot added the size/L label Aug 9, 2019

lint

8e3bc3d

thebrianchen changed the title ~~Bc/tx backoff~~ Retrying transactions with backoff Aug 12, 2019

mikelehen suggested changes Aug 12, 2019

View reviewed changes

mikelehen assigned thebrianchen and unassigned mikelehen Aug 12, 2019

Added TransactionRunner

a90317c

thebrianchen force-pushed the bc/tx-backoff branch from ee1e768 to a90317c Compare August 12, 2019 21:50

add comments to AsyncQueue

6c3264d

thebrianchen commented Aug 12, 2019

View reviewed changes

thebrianchen requested a review from mikelehen August 12, 2019 21:58

thebrianchen assigned mikelehen and unassigned thebrianchen Aug 12, 2019

mikelehen suggested changes Aug 12, 2019

View reviewed changes

mikelehen assigned thebrianchen and unassigned mikelehen Aug 12, 2019

resolve michael's comments

1e25218

thebrianchen requested a review from mikelehen August 12, 2019 23:33

thebrianchen assigned mikelehen and unassigned thebrianchen Aug 12, 2019

mikelehen suggested changes Aug 13, 2019

View reviewed changes

really resolving the comments

7d290c6

mikelehen approved these changes Aug 13, 2019

View reviewed changes

mikelehen assigned schmidt-sebastian and thebrianchen and unassigned mikelehen and schmidt-sebastian Aug 13, 2019

Brian Chen added 2 commits August 13, 2019 10:26

typo machine pls turn off

3b1ecfd

lint

c7864a1

thebrianchen removed the cla: yes Override cla label Aug 13, 2019

googlebot added the cla: yes Override cla label Aug 13, 2019

changelog

a7c8d1c

thebrianchen merged commit 137fb97 into master Aug 13, 2019

thebrianchen deleted the bc/tx-backoff branch August 13, 2019 23:08

thebrianchen mentioned this pull request Aug 14, 2019

Retrying transactions with backoff firebase/firebase-ios-sdk#3599

Merged

firebase locked and limited conversation to collaborators Oct 8, 2019

Retrying transactions with backoff #698

Retrying transactions with backoff #698

Uh oh!

Conversation

thebrianchen commented Aug 9, 2019

Uh oh!

mikelehen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thebrianchen left a comment

Choose a reason for hiding this comment

Uh oh!

mikelehen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikelehen Aug 12, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikelehen commented Aug 12, 2019

Uh oh!

mikelehen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thebrianchen commented Aug 13, 2019

Uh oh!

mikelehen left a comment

Choose a reason for hiding this comment

Uh oh!

mikelehen Aug 13, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!