Index-Free (2/6): Add update_time to SQLRemoteDocumentCache schema #615

schmidt-sebastian · 2019-07-15T08:57:24Z

This is the second of 6 PRs for "index-free" querying.

This PR modifies the SQL schema to add the update time to the RemoteDocumentCache. There is currently no migration that populates the information - instead, the consuming code will handle NULL value.

We can discuss a migration strategy later.

wilhuff · 2019-07-18T02:04:14Z

firebase-firestore/src/main/java/com/google/firebase/firestore/model/SnapshotVersion.java

@@ -36,6 +37,12 @@ public Timestamp getTimestamp() {
    return timestamp;
  }

+  /** Returns the microseconds since EPOCH that this snapshot version represents. */
+  public long toMicroseconds() {


Historically we've tried to avoid baking in this assumption about backend timestamp resolution into our system. This is because the backend currently truncates this time but they have an open bug for fixing this.

Meanwhile we've already established a precedent for how to represent timestamps literally in our schema. For example, the targets table includes the following:

+ "snapshot_version_seconds INTEGER, " + "snapshot_version_nanos INTEGER, "

It seems like we lose nothing by representing these accurately--even if we only query on seconds we'll be fine for the purposes of index-free querying--so it seems like we should follow the established precedent and represent timestamps at full resolution.

I updated this PR to use the same seconds/nanos approach as the targets table. I didn't want to do this originally cause it would result in some overselection, but I assume that you are correct when you say that we don't need to worry about this here.

wilhuff · 2019-07-20T17:36:51Z

firebase-firestore/src/main/java/com/google/firebase/firestore/local/SQLiteSchema.java

@@ -351,6 +355,16 @@ private void addSequenceNumber() {
    }
  }

+  private void addUpdateTime() {
+    if (!tableContainsColumn("remote_documents", "snapshot_version_seconds")) {


Since there are multiple versions of a document, it'd probably be better to follow the name used on the backend and call this the update_time_seconds -- that way it's clear what the correspondence is with the proto fields.

Done. Future me will thank you for this :)

wilhuff · 2019-07-20T17:41:28Z

firebase-firestore/src/main/java/com/google/firebase/firestore/local/SQLiteSchema.java

+          "Table contained snapshot_version_seconds, but is missing snapshot_version_nanos");
+      db.execSQL("ALTER TABLE remote_documents ADD COLUMN snapshot_version_seconds INTEGER");
+      db.execSQL("ALTER TABLE remote_documents ADD COLUMN snapshot_version_nanos INTEGER");
+    }


If all the documents in the cache have a zero version doesn't that mean the query to find recently updated documents has to also check for zero? Doesn't that nearly nullify the benefits of index-free on any device that has pre-existing data?

Consider an existing device containing two targets, A updated at T1 matching D1 and D2. There's also a second target, listened to later: B updated at T2 matching D3. D3 actually matches the query for target A. All of this happened before this schema upgrade, so after the upgrade they'll all have update_time of zero.

If the user queries on A again we need to find D3, but it's going to have a zero update_time.

I think we're better off either:

actually populating update_time based on the value from inside the document.

dropping the contents of the remote_documents table

Though the latter is pretty unfriendly so I think we should only consider it if all other possibilities are exhausted.

Alternatively, maybe you have something up your sleeve to handle this case that I'm not seeing :-).

I tried to describe my strategy for this in the PR comment, but added it as a comment in the code itself for more visibility. Basically - I haven't quite decided what the best way forward it as migrating the contents of the entire RemoteDocumentCache could be quite slow. I decided to punt on this till later (when we actually want to enable this).

schmidt-sebastian · 2019-07-22T21:06:02Z

/test device-check-changed

schmidt-sebastian · 2019-07-22T23:17:16Z

/test device-check-changed

wilhuff

LGTM

schmidt-sebastian added 3 commits July 14, 2019 20:22

Adding LocalStoreTestHelper for NO_CHANGE event

cce8fd0

Add update_time to SQLRemoteDocumentCache schema

3d2d896

Fix comment

9de2df8

googlebot added the cla: yes Override cla label Jul 15, 2019

google-oss-bot added the size/S label Jul 15, 2019

schmidt-sebastian requested a review from wilhuff July 15, 2019 09:02

schmidt-sebastian assigned wilhuff Jul 15, 2019

schmidt-sebastian added api: firestore and removed api: firestore labels Jul 15, 2019

schmidt-sebastian added 2 commits July 15, 2019 16:29

Update ktx's test-util

2bf12c2

Merge branch 'mrschmidt/indexfree-1' into mrschmidt/indexfree-2

b9a3664

schmidt-sebastian changed the title ~~Add update_time to SQLRemoteDocumentCache schema~~ Index-Free (2/6): Add update_time to SQLRemoteDocumentCache schema Jul 15, 2019

wilhuff suggested changes Jul 18, 2019

View reviewed changes

wilhuff assigned schmidt-sebastian and unassigned wilhuff Jul 18, 2019

Use seconds and nanos

ddee17b

schmidt-sebastian assigned wilhuff and unassigned schmidt-sebastian Jul 18, 2019

wilhuff reviewed Jul 20, 2019

View reviewed changes

schmidt-sebastian changed the base branch from mrschmidt/indexfree-1 to master July 22, 2019 18:18

google-oss-bot added size/L and removed size/S labels Jul 22, 2019

Merge

5679784

google-oss-bot added size/M and removed size/L labels Jul 22, 2019

Address comments

b77194a

wilhuff approved these changes Jul 23, 2019

View reviewed changes

wilhuff merged commit 6135c4a into master Jul 23, 2019

schmidt-sebastian mentioned this pull request Jul 31, 2019

Revert "Add update_time to SQLRemoteDocumentCache schema" #674

Merged

schmidt-sebastian deleted the mrschmidt/indexfree-2 branch August 27, 2019 23:45

schmidt-sebastian mentioned this pull request Aug 29, 2019

Index-Free: Track readTime in the RemoteDocument store firebase/firebase-js-sdk#2125

Merged

firebase locked and limited conversation to collaborators Oct 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Index-Free (2/6): Add update_time to SQLRemoteDocumentCache schema #615

Index-Free (2/6): Add update_time to SQLRemoteDocumentCache schema #615

Uh oh!

schmidt-sebastian commented Jul 15, 2019

Uh oh!

wilhuff Jul 18, 2019

Uh oh!

schmidt-sebastian Jul 18, 2019

Uh oh!

wilhuff Jul 20, 2019

Uh oh!

schmidt-sebastian Jul 22, 2019

Uh oh!

wilhuff Jul 20, 2019

Uh oh!

schmidt-sebastian Jul 22, 2019

Uh oh!

schmidt-sebastian commented Jul 22, 2019

Uh oh!

schmidt-sebastian commented Jul 22, 2019

Uh oh!

wilhuff left a comment

Uh oh!

Uh oh!

Index-Free (2/6): Add update_time to SQLRemoteDocumentCache schema #615

Index-Free (2/6): Add update_time to SQLRemoteDocumentCache schema #615

Uh oh!

Conversation

schmidt-sebastian commented Jul 15, 2019

Uh oh!

wilhuff Jul 18, 2019

Choose a reason for hiding this comment

Uh oh!

schmidt-sebastian Jul 18, 2019

Choose a reason for hiding this comment

Uh oh!

wilhuff Jul 20, 2019

Choose a reason for hiding this comment

Uh oh!

schmidt-sebastian Jul 22, 2019

Choose a reason for hiding this comment

Uh oh!

wilhuff Jul 20, 2019

Choose a reason for hiding this comment

Uh oh!

schmidt-sebastian Jul 22, 2019

Choose a reason for hiding this comment

Uh oh!

schmidt-sebastian commented Jul 22, 2019

Uh oh!

schmidt-sebastian commented Jul 22, 2019

Uh oh!

wilhuff left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!