refactor: move multiplexed session handling to separate class #3063

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

olavloite merged 52 commits into main from mux-benchmark-experiments

May 2, 2024

Collaborator

olavloite commented Apr 25, 2024

Refactors the multiplexed session implementation to use its own class instead of being combined with the existing session pool. This simplifies the code path for multiplexed sessions, and makes it possible to entirely remove the SessionPool class when all operations can be executed on multiplexed sessions.

olavloite added 17 commits

April 24, 2024 10:10


          chore: try with random channel hint

e2dde6f


          chore: add option for random channel

e4b585f


          chore: actually use random channel option

0cf711d


          chore: only lock the specific wrapper

fa1c7a1


          chore: simplify creation and assignment

66763e5


          chore: make more variables final

78fe4f7


          Merge branch 'main' into mux-benchmark-experiments

1ea4c28


          chore: use separate pool

3eab7fa


          chore: use a separate mux client

571f8a6


          chore: make init blocking

ef9939b


          chore: disable pending tx check

7afe362


          chore: add call durations to client lib

3e7a6e4


          chore: add call_durations

648fe58


          chore: use session pool for mux session


          chore: use mux database client

14f9fc9


          chore: make mux client optional


          refactor: move multiplexed session handling to separate class

b17215d

product-auto-label bot added size: l api: spanner labels

olavloite added 4 commits

April 25, 2024 16:50


          chore: cleanup

5cb343b


          feat: add maintainer

f7394d1


          chore: add more tests


          chore: fix test failures

92795df

product-auto-label bot added size: xl and removed size: l labels

olavloite added 5 commits

April 26, 2024 13:35


          fix: ChannelUsageTest should keep session in use for longer

9162a2b


          test: skip ChannelUsageTest in all cases

b71ef15


          chore: keep track of DatabaseDeleted errors

e027e2e


          fix: freeze server to prevent flakiness

8258ce7


          fix: freeze server to prevent flakiness

9bbb402

rahul2393 approved these changes

View reviewed changes

olavloite and others added 2 commits

April 30, 2024 14:32


          Merge branch 'main' into mux-benchmark-experiments

9e3a3f5


          🦉 Updates from OwlBot post-processor

fe27c6a

See https://github.com/googleapis/repo-automation-bots/blob/main/packages/owl-bot/README.md

olavloite requested a review from a team as a code owner

April 30, 2024 12:35

olavloite and others added 14 commits

April 30, 2024 14:57


          chore: add random channel hint as option

16b3578


          Merge branch 'mux-benchmark-experiments' of github.com:googleapis/jav…

7a042fc

…a-spanner into mux-benchmark-experiments


          chore: add single-use channel hint

00088a5


          🦉 Updates from OwlBot post-processor

2ea7c0a

See https://github.com/googleapis/repo-automation-bots/blob/main/packages/owl-bot/README.md


          chore: single-use hint

b26ed58


          Merge branch 'mux-benchmark-experiments' of github.com:googleapis/jav…

f3b1519

…a-spanner into mux-benchmark-experiments


          🦉 Updates from OwlBot post-processor

c39148e

See https://github.com/googleapis/repo-automation-bots/blob/main/packages/owl-bot/README.md


          chore: use next available channel

45b2c0d


          Merge branch 'mux-benchmark-experiments' of github.com:googleapis/jav…

3cacd70

…a-spanner into mux-benchmark-experiments


          chore: keep track of num transactions and channels in use

9cac61a


          chore: remove println

5bc5040


          chore: remove option for using session pool for mux

807120d


          feat: add UNIMPLEMENTED handler

55daa50


          chore: cleanup

5979f2d

olavloite added the kokoro:force-run label

yoshi-kokoro removed kokoro:force-run labels

arpan14 approved these changes

View reviewed changes

...e-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java

+                 * It is enough with one executor to maintain the multiplexed sessions in all the clients, as they
+                 * do not need to be updated often, and the maintenance task is light.
+                 */
+                private static final ScheduledExecutorService MAINTAINER_SERVICE =

Contributor

arpan14 May 1, 2024

This will be a new thread which will maintain multiplexed sessions? If yes, can you share if re-using the existing thread was a harder implementation?

Collaborator Author

olavloite May 1, 2024

It would have been slightly harder, but that was not my main reason for not using the same thread. My reasoning for creating a separate maintainer and executor was:

This separates multiplexed sessions completely from the session pool. That means that once all operations work on multiplexed sessions, we can just remove the entire session pool implementation.
This maintainer uses a ScheduledThreadPoolExecutor with zero core threads, and only runs the task once every 10 minutes. The additional resource usage is therefore minimal.

...e-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java

+                 * This flag is set to true if the server return UNIMPLEMENTED when we try to create a multiplexed
+                 * session.
+                 */
+                private final AtomicBoolean unimplemented = new AtomicBoolean(false);

Contributor

arpan14 May 1, 2024

Nit: The formatting looks off. Also, should we add a TODO that this is a temporary measure and we can remove it in future when this has a bit of increased usage?

Collaborator Author

olavloite May 1, 2024

I added a TODO for removing.

I'm not sure what you mean with the formatting being off (?) The code is formatted using the normal formatter, and I don't see anything strange with this line or the ones directly above.

...e-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java


		package com.google.cloud.spanner;

		import static com.google.cloud.spanner.SessionImpl.NO_CHANNEL_HINT;

Contributor

arpan14 May 1, 2024

Given we are initializing this to -1, can there be an edge case where the value gets decremented to -1? Would using a boxed integer and setting null be a better default than -1?

Collaborator Author

olavloite May 1, 2024

I think that there's a bit of confusion between this value and the numCurrentSingleUseTransactions:

The channel hint that is generated will never be -1. It is always calculated by getting the first clear bit from index 0. That means that it will always be >= 0.
The numCurrentSingleUseTransactions gets incremented and decremented based on transactions being started and ended. That value should also never get to -1. In theory it could if we were 'over-closing' single-use transactions, but even then we have a safe-guard in the onReadDone() method that only decrements it the first time the method is called, and ignores any second call.

One other scenario that is a bit more probable (but still not much) is that reads never call onReadDone. This could cause numCurrentSingleUseTransactions to get higher than it should. The effect of this would be that we would fall back to using a random channel hint more often than we in the ideal case should. (The same method is also used to end spans, which means that it would also cause spans not to be ended. That is already the case in the current regular session implementation.)

...e-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java

+                static class MultiplexedSessionTransaction extends SessionImpl {
+                  private final MultiplexedSessionDatabaseClient client;
+                  private final boolean singleUse;

Contributor

arpan14 May 1, 2024

How is the channel hint managed for multi-use transactions?
Given we will always set the options at SessionImpl, the random hint logic that was introduced in AbstractReadContext becomes dead code. Do we clean it up here itself? Or do that in a separate PR? I'm fine either ways.

Collaborator Author

olavloite May 1, 2024

Channel hint for multi-use read-only transactions are always random. In theory we could use the same strategy for them as for single-use read-only transactions, but I think we should not, because it would make the 'channel-hint-bitset' depend on read-only transactions always being closed. Failure to close read-only transactions has been a common source of session leaks in the past, and that is exactly what we want to get rid of with multiplexed sessions. Adding a new dependency on closing all read-only transactions here could in theory give us a bit better latency if someone is running 1qps multi-use read-only transactions, but I don't think the benefit of that outweighs the potential downside.
The code for assigning a random hint in AbstractReadContext is still used. We only create an options instance if the channel hint is not the special value NO_CHANNEL_HINT. See

java-spanner/google-cloud-spanner/src/main/java/com/google/cloud/spanner/SessionImpl.java

Line 125 in 5979f2d

if (channelHint == NO_CHANNEL_HINT) {

. What happens in this case is the following:
2.1. MultiplexedSessionDatabaseClient returns NO_CHANNEL_HINT if we should just use a random channel:

java-spanner/google-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java

Line 315 in 5979f2d

return NO_CHANNEL_HINT;

2.2. That value is then passed to the constructor of SessionImpl.
2.3. SessionImpl checks if the channel hint is NO_CHANNEL_HINT, and if so does not create an options instance, but instead calls sessionReference.getOptions().
2.4. sessionReference.getOptions() returns null for multiplexed sessions and the affiliated channel for regular sessions.

...e-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java Show resolved Hide resolved

olavloite and others added 5 commits

May 1, 2024 20:37


          chore: add TODO for removing the unimplemented handling

9d6845c


          🦉 Updates from OwlBot post-processor

00c9a44

See https://github.com/googleapis/repo-automation-bots/blob/main/packages/owl-bot/README.md


          chore: run formatter

b3aee68


          Merge branch 'mux-benchmark-experiments' of github.com:googleapis/jav…

d18b834

…a-spanner into mux-benchmark-experiments


          test: fix flaky tests

794cbd8

Fixes #3050
Fixes #3081
Fixes #3080

olavloite merged commit b2795a7 into main

olavloite deleted the mux-benchmark-experiments branch

May 2, 2024 08:41

psinghbay1 reviewed

View reviewed changes

google-cloud-spanner/src/main/java/com/google/cloud/spanner/ForwardingAsyncResultSet.java

                 }
                 @Override
                 public CursorState tryNext() throws SpannerException {
-                  return delegate.tryNext();
+                  return getDelegate().tryNext();

psinghbay1 May 24, 2024

👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

api: spanner size: xl