Skip to content

PDEP-18: Nullable Object Dtype #61599

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

simonjayhawkins
Copy link
Member

as per PDEP-1

The initial status of a PDEP will be Status: Draft. This will be changed to Status: Under discussion by the author(s), when they are ready to proceed with the decision making process.

but comments are surely welcome in the meantime.

@simonjayhawkins simonjayhawkins added the PDEP pandas enhancement proposal label Jun 7, 2025
@jbrockmendel
Copy link
Member

“Object” analogous to “Float64”?

@datapythonista
Copy link
Member

At least to me the PDEP will be easier to read (and comment on) if you limit the line width, to 80 or similar.

The idea sounds good, it'd be good if you can provide information on how using a boolean mask compares to having pandas.NA inside the main array.

@simonjayhawkins
Copy link
Member Author

At least to me the PDEP will be easier to read (and comment on) if you limit the line width, to 80 or similar.

sure.

it'd be good if you can provide information on how using a boolean mask compares to having pandas.NA inside the main array.

using a sentinel as opposed to a mask is an implementation detail that I can expand on. I'm assuming that this would still be a separate dtype from the traditional numpy dtype, a pandas nullable dtype? We have the string array backed by a masked object array that I was effectively proposing reusing/refactoring as a base class.

There is also another option which is maybe what you are proposing: making a breaking change to the exisiting numpy object dtype to handle pd.NA differently? This is perhaps what is in the rejected ideas section and that needs clarification?

@simonjayhawkins
Copy link
Member Author

“Object” analogous to “Float64”?

That's the obvious choice but IIRC the capitalization was considered as confusing/non intuitive by some when discussed with respect to the string dtype.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
PDEP pandas enhancement proposal
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants