Add logical operator support (`and`/`or` methods on dataframe and column) #171

MarcoGorelli · 2023-05-18T13:18:11Z

Also a typo fix for __add__.

rgommers

This really was meant to be __add__ I suspect, not __and__, since it came in with the update to Column which matched DataFrame methods, and lives right above __sub__ - and __add__ was part of the arithmetic methods in gh-94. Also, the docstring says "dataframe" and that should be "column".

Adding __and__ and __or__ too seems very reasonable though, I think we can have both __add__ and __and__ here.

MarcoGorelli · 2023-05-18T16:58:31Z

good catch, thanks, have updated

rgommers

LGTM, thanks Marco. The other thing that I thought for a minute should perhaps be different is the error type, however ValueError is also used by any/all when the dtype isn't bool. So this should be uncontroversial I think.

rgommers · 2023-05-22T09:15:28Z

I plan to merge this within the next couple of days, unless there is a concern about adding __and__ and __or__.

kkraus14 · 2023-05-22T17:55:57Z

Should we clarify whether Kleene logic is used for these or not (to clarify null behavior)?

Also, it looks like for __and__ we're requiring bools, but for __or__ it's more generically Scalar. Should these be consistent?

MarcoGorelli · 2023-05-22T18:13:18Z

but for or it's more generically Scalar.

I think it's also bools there?

def __or__(self, other: Column[bool] | bool) -> Column:

it's __add__ which can more generically take a Scalar

EDIT: nvm, the docstring did in fact say 'Scalar' - have updated, thanks!

spec/API_specification/dataframe_api/dataframe_object.py

rgommers · 2023-05-25T11:54:21Z

Should we clarify whether Kleene logic is used for these or not (to clarify null behavior)?

That would be good to add indeed. With the answer being that yes, Kleene logic is used I'd think - I assume that's true for all libraries?

Related: we probably should add __bool__ and make it raise, because truthiness of a column/dataframe is not defined.

jorisvandenbossche · 2023-05-25T13:11:29Z

I assume that's true for all libraries?

Not for the default (numpy based) data types in pandas. But I would say that's a problem to tackle for pandas / the package implementing this standard on top of pandas. I agree we should specify that it is expected use Kleene logic.

BTW, we should also specify the behaviour of nulls in the comparison methods (__eq__, __lt__, and friends)

MarcoGorelli · 2023-06-15T16:13:44Z

I've noted that nulls should follow kleene logic - any further requests?

spec/API_specification/dataframe_api/column_object.py

spec/API_specification/dataframe_api/dataframe_object.py

spec/API_specification/dataframe_api/column_object.py

MarcoGorelli · 2023-06-20T15:40:57Z

merging then - thanks for your reviews!

MarcoGorelli added 3 commits May 18, 2023 13:37

fix __and__ docstring

696640a

fix __and__ docs, add __or__

4d7122a

more fixups

200c6b4

rgommers reviewed May 18, 2023

View reviewed changes

MarcoGorelli marked this pull request as draft May 18, 2023 15:10

MarcoGorelli marked this pull request as ready for review May 18, 2023 16:56

rgommers approved these changes May 20, 2023

View reviewed changes

rgommers added the API design label May 20, 2023

rgommers changed the title ~~Fix __and__ docsting, add __or__~~ And logical operator support (__and__/__or__ methods on dataframe and column) May 20, 2023

MarcoGorelli commented May 22, 2023

View reviewed changes

spec/API_specification/dataframe_api/dataframe_object.py Outdated Show resolved Hide resolved

Update spec/API_specification/dataframe_api/dataframe_object.py

fdd56e5

note that nulls should follow kleene logic

e065b81

MarcoGorelli changed the title ~~And logical operator support (__and__/__or__ methods on dataframe and column)~~ Add logical operator support (__and__/__or__ methods on dataframe and column) Jun 15, 2023

type return

0d01891

kkraus14 approved these changes Jun 16, 2023

View reviewed changes

spec/API_specification/dataframe_api/column_object.py Outdated Show resolved Hide resolved

spec/API_specification/dataframe_api/dataframe_object.py Outdated Show resolved Hide resolved

MarcoGorelli commented Jun 20, 2023

View reviewed changes

spec/API_specification/dataframe_api/column_object.py Outdated Show resolved Hide resolved

Update spec/API_specification/dataframe_api/column_object.py

d305da6

MarcoGorelli merged commit 30b242d into data-apis:main Jun 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add logical operator support (`and`/`or` methods on dataframe and column) #171

Add logical operator support (`and`/`or` methods on dataframe and column) #171

Uh oh!

MarcoGorelli commented May 18, 2023 •

edited by rgommers

Loading

Uh oh!

rgommers left a comment

Uh oh!

MarcoGorelli commented May 18, 2023

Uh oh!

rgommers left a comment

Uh oh!

rgommers commented May 22, 2023

Uh oh!

kkraus14 commented May 22, 2023

Uh oh!

MarcoGorelli commented May 22, 2023 •

edited

Loading

Uh oh!

Uh oh!

rgommers commented May 25, 2023

Uh oh!

jorisvandenbossche commented May 25, 2023

Uh oh!

MarcoGorelli commented Jun 15, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MarcoGorelli commented Jun 20, 2023

Uh oh!

Uh oh!

Add logical operator support (__and__/__or__ methods on dataframe and column) #171

Add logical operator support (__and__/__or__ methods on dataframe and column) #171

Uh oh!

Conversation

MarcoGorelli commented May 18, 2023 • edited by rgommers Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rgommers left a comment

Choose a reason for hiding this comment

Uh oh!

MarcoGorelli commented May 18, 2023

Uh oh!

rgommers left a comment

Choose a reason for hiding this comment

Uh oh!

rgommers commented May 22, 2023

Uh oh!

kkraus14 commented May 22, 2023

Uh oh!

MarcoGorelli commented May 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

rgommers commented May 25, 2023

Uh oh!

jorisvandenbossche commented May 25, 2023

Uh oh!

MarcoGorelli commented Jun 15, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MarcoGorelli commented Jun 20, 2023

Uh oh!

Uh oh!

Add logical operator support (`and`/`or` methods on dataframe and column) #171

Add logical operator support (`and`/`or` methods on dataframe and column) #171

MarcoGorelli commented May 18, 2023 •

edited by rgommers

Loading

MarcoGorelli commented May 22, 2023 •

edited

Loading