PYTHON-4669 - Update More APIs for Motor Compatibility #1815

NoahStapp · 2024-08-27T13:49:42Z

No description provided.

blink1073

LGTM!

ShaneHarvey · 2024-08-27T21:02:37Z

gridfs/asynchronous/grid_file.py

@@ -1484,6 +1486,17 @@ def __init__(
    _file: Any
    _chunk_iter: Any

+    async def __anext__(self) -> bytes:
+        return super().__next__()


This is incorrect. We can't call super().__next__() because that does blocking I/O.

Hmm, good catch. We don't have an async equivalent here unless we write one ourselves.

IOBase implements next using readline:

IOBase (and its subclasses) supports the iterator protocol, meaning that an IOBase object can be iterated over yielding the lines in a stream. Lines are defined slightly differently depending on whether the stream is a binary stream (yielding bytes), or a text stream (yielding character strings). See readline() below.

https://docs.python.org/3/library/io.html#io.IOBase

There isn't an asyncio version of readline, so we'd need to write our own. The canonical way to do so appears to be with threads (https://stackoverflow.com/questions/34699948/does-asyncio-supports-asynchronous-i-o-for-file-operations), at which point I question if the performance gained by not blocking the loop is more than the cost of thread overhead. The official CPython forums have similar concerns at the OS level: https://discuss.python.org/t/asyncio-for-files/31077/15.

ShaneHarvey · 2024-08-27T21:04:05Z

gridfs/synchronous/grid_file.py

+    def __next__(self) -> bytes:
+        return super().__next__()
+
+    def __next__(self) -> bytes:  # noqa: F811, RUF100


Any way to avoid the duplicate def __next__(self) definitions?

This is a limitation of the synchro script: it will translate the async __anext__ into __next__, but we want to have a separate __next__ for the async class that raises an error. That explicit __next__ will also get ported to the synchronous class unfortunately, giving us the duplicate defs.

Right, but can we workaround that? The duplicate code is strange the read. There's also a runtime perf cost to overriding a method just to call the super class.

There isn't a simple way to workaround it, no. We could change the definition to be less confusing, like this:

async def __anext__(self) -> bytes: return super().__next__() if not _IS_SYNC: def __next__(self) -> bytes: # noqa: F811, RUF100 raise TypeError( "AsyncGridOut does not support synchronous iteration. Use `async for` instead" )

Which would synchronize to

def __next__(self) -> bytes: return super().__next__() if not _IS_SYNC: def __next__(self) -> bytes: # noqa: F811, RUF100 raise TypeError("GridOut does not support synchronous iteration. Use `for` instead")

We can also add a comment explaining why the duplicate def exists.

Good idea, but how about:

if not _IS_SYNC: async def __anext__(self) -> bytes: return await self.readline() def __next__(self) -> bytes: # noqa: F811, RUF100 raise TypeError( "AsyncGridOut does not support synchronous iteration. Use `async for` instead" )

I forgot we had our own async readline, good catch. This looks like we'd read a single line or every byte if the file wasn't line-delimited. Is that the intended behavior for iteration here?

Yeah that's what IOBase is supposed to do and AsyncGridOut iteration should match the sync version. We also need to remove IOBase from the async.

Done, follow-up PR: #1821.

PYTHON-4669 - Update More APIs for Motor Compatibility

00f569b

mongodb-drivers-pr-bot bot requested a review from Jibola August 27, 2024 14:07

Motor compat changes

54d333a

blink1073 requested review from blink1073 and removed request for Jibola August 27, 2024 17:26

blink1073 approved these changes Aug 27, 2024

View reviewed changes

NoahStapp merged commit 81ea92b into mongodb:master Aug 27, 2024
34 checks passed

NoahStapp deleted the PYTHON-4669 branch August 27, 2024 17:38

ShaneHarvey reviewed Aug 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PYTHON-4669 - Update More APIs for Motor Compatibility #1815

PYTHON-4669 - Update More APIs for Motor Compatibility #1815

Uh oh!

NoahStapp commented Aug 27, 2024

Uh oh!

blink1073 left a comment

Uh oh!

Uh oh!

ShaneHarvey Aug 27, 2024

Uh oh!

NoahStapp Aug 28, 2024

Uh oh!

ShaneHarvey Aug 28, 2024

Uh oh!

NoahStapp Aug 29, 2024 •

edited

Loading

Uh oh!

ShaneHarvey Aug 27, 2024

Uh oh!

NoahStapp Aug 28, 2024

Uh oh!

ShaneHarvey Aug 28, 2024

Uh oh!

NoahStapp Aug 29, 2024

Uh oh!

ShaneHarvey Aug 29, 2024

Uh oh!

NoahStapp Aug 29, 2024

Uh oh!

ShaneHarvey Aug 29, 2024

Uh oh!

NoahStapp Aug 29, 2024 •

edited

Loading

Uh oh!

Uh oh!

PYTHON-4669 - Update More APIs for Motor Compatibility #1815

PYTHON-4669 - Update More APIs for Motor Compatibility #1815

Uh oh!

Conversation

NoahStapp commented Aug 27, 2024

Uh oh!

blink1073 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NoahStapp Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NoahStapp Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NoahStapp Aug 29, 2024 •

edited

Loading

NoahStapp Aug 29, 2024 •

edited

Loading