BUG: Fix problem with SparseDataFrame not persisting to csv #19441

hexgnu · 2018-01-29T08:03:44Z

closes SparseDataFrame to_csv returns IndexError: too many indices for array #19384
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

jreback · 2018-01-30T00:21:55Z

pandas/core/internals.py

@@ -704,7 +704,9 @@ def to_native_types(self, slicer=None, na_rep='nan', quoting=None,
                        **kwargs):
        """ convert to our native types format, slicing if desired """

-        values = self.values
+        # values = self.values


you can remove the comment

jreback · 2018-01-30T00:23:29Z

pandas/tests/io/formats/test_to_csv.py

@@ -253,3 +253,12 @@ def test_to_csv_string_array_utf8(self):
            df.to_csv(path, encoding='utf-8')
            with open(path, 'r') as f:
                assert f.read() == expected_utf8
+
+    def test_to_csv_sparse_dataframe(self):
+        sdf = pd.SparseDataFrame({'a': [1, 2]})


add a comment here for the issue number. I would rather have this test in pandas/tests/sparse/frame/test_to_csv.py

can you add several variations, e.g. using fill value and with some nulls.

pep8speaks · 2018-01-30T02:43:26Z

Hello @hexgnu! Thanks for updating the PR.

Cheers ! There are no PEP8 issues in this Pull Request. 🍻

Comment last updated on February 01, 2018 at 13:26 Hours UTC

hexgnu · 2018-01-30T02:46:25Z

K updated thanks @jreback

TomAugspurger · 2018-01-30T15:16:51Z

@hexgnu do the tests you've written pass for you locally?

hexgnu · 2018-01-30T23:54:05Z

oi sorry, should pass now.

TomAugspurger · 2018-01-31T12:49:38Z

Still one lint failure: https://travis-ci.org/pandas-dev/pandas/jobs/335405859#L3035

I'm looking into the (unrelated) S3 failures later this morning.

codecov · 2018-01-31T14:59:19Z

Codecov Report

Merging #19441 into master will increase coverage by 0.05%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #19441      +/-   ##
==========================================
+ Coverage   91.62%   91.67%   +0.05%     
==========================================
  Files         150      148       -2     
  Lines       48724    48553     -171     
==========================================
- Hits        44642    44513     -129     
+ Misses       4082     4040      -42

Flag	Coverage Δ
#multiple	`90.04% <100%> (+0.05%)`	⬆️
#single	`41.72% <100%> (-0.03%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/internals.py	`95.47% <100%> (ø)`	⬆️
pandas/plotting/_converter.py	`65.22% <0%> (-1.74%)`	⬇️
pandas/core/frame.py	`97.42% <0%> (-0.16%)`	⬇️
pandas/core/indexes/multi.py	`95.06% <0%> (-0.09%)`	⬇️
pandas/io/parsers.py	`95.49% <0%> (ø)`	⬆️
pandas/core/resample.py	`96.43% <0%> (ø)`	⬆️
pandas/plotting/_core.py	`82.41% <0%> (ø)`	⬆️
pandas/io/formats/format.py	`96.24% <0%> (ø)`	⬆️
pandas/core/dtypes/generic.py	`100% <0%> (ø)`	⬆️
... and 10 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 238499a...87946d2. Read the comment docs.

hexgnu · 2018-02-01T00:25:19Z

Ok for real this time it passed @jreback @TomAugspurger thanks 😄

jreback · 2018-02-01T00:30:05Z

pandas/tests/sparse/frame/test_to_csv.py

+def test_to_csv_sparse_dataframe():
+    fill_values = [np.nan, 0, None, 1]
+
+    for fill_value in fill_values:


can you parameterize

jreback · 2018-02-01T00:30:47Z

pandas/tests/sparse/frame/test_to_csv.py

+
+
+def test_to_csv_sparse_dataframe():
+    fill_values = [np.nan, 0, None, 1]


can you add the issue number here as well

jreback · 2018-02-01T00:31:26Z

doc/source/whatsnew/v0.23.0.txt

@@ -497,7 +497,7 @@ I/O
 - Bug in :func:`DataFrame.to_parquet` where an exception was raised if the write destination is S3 (:issue:`19134`)
 - :class:`Interval` now supported in :func:`DataFrame.to_excel` for all Excel file types (:issue:`19242`)
 - :class:`Timedelta` now supported in :func:`DataFrame.to_excel` for xls file type (:issue:`19242`, :issue:`9155`)
-
+- Bug in :class:`SparseDataFrame.to_csv` where too many indices for values (:issue:`19384`)


move to sparse, and don't need this kind of detail, just say was raising.

jreback · 2018-02-01T12:19:05Z

a lint error.ping when pushed and green.

jreback · 2018-02-01T13:45:42Z

great. ping on green.

TomAugspurger · 2018-02-01T19:26:21Z

Thanks @hexgnu!

…ev#19441) * BUG: Fix problem with SparseDataFrame not persisting to csv * FIX: Remove comment and move test with more coverage * FIX: Flake8 issues cleanup * Fix failing test due to blank lines * FIX: linting errors on whitespace * Use parametrize on test * Move bug description to sparse header * Add GH issue to test * Fix linting error

BUG: Fix problem with SparseDataFrame not persisting to csv

f67d7f4

jreback requested changes Jan 30, 2018

View reviewed changes

jreback added IO CSV read_csv, to_csv Sparse Sparse Data Type labels Jan 30, 2018

FIX: Remove comment and move test with more coverage

ffdcaa2

FIX: Flake8 issues cleanup

6538f19

Fix failing test due to blank lines

8ba1efe

hexgnu added 2 commits January 31, 2018 21:58

Merge remote-tracking branch 'upstream/master' into fix_to_csv_sparse_df

e465799

FIX: linting errors on whitespace

04ddc88

jreback requested changes Feb 1, 2018

View reviewed changes

jreback added this to the 0.23.0 milestone Feb 1, 2018

jreback reviewed Feb 1, 2018

View reviewed changes

hexgnu added 3 commits February 1, 2018 10:04

Use parametrize on test

1ede12c

Move bug description to sparse header

5e1a3ae

Add GH issue to test

d6a5acd

Fix linting error

87946d2

jreback approved these changes Feb 1, 2018

View reviewed changes

TomAugspurger merged commit 78ba063 into pandas-dev:master Feb 1, 2018



		def test_to_csv_sparse_dataframe():
		fill_values = [np.nan, 0, None, 1]

Uh oh!

BUG: Fix problem with SparseDataFrame not persisting to csv #19441

BUG: Fix problem with SparseDataFrame not persisting to csv #19441

Uh oh!

Conversation

hexgnu commented Jan 29, 2018

Uh oh!

jreback Jan 30, 2018

Choose a reason for hiding this comment

Uh oh!

jreback Jan 30, 2018

Choose a reason for hiding this comment

Uh oh!

pep8speaks commented Jan 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Comment last updated on February 01, 2018 at 13:26 Hours UTC

Uh oh!

hexgnu commented Jan 30, 2018

Uh oh!

TomAugspurger commented Jan 30, 2018

Uh oh!

hexgnu commented Jan 30, 2018

Uh oh!

TomAugspurger commented Jan 31, 2018

Uh oh!

codecov bot commented Jan 31, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hexgnu commented Feb 1, 2018

Uh oh!

jreback Feb 1, 2018

Choose a reason for hiding this comment

Uh oh!

jreback Feb 1, 2018

Choose a reason for hiding this comment

Uh oh!

jreback Feb 1, 2018

Choose a reason for hiding this comment

Uh oh!

jreback commented Feb 1, 2018

Uh oh!

jreback commented Feb 1, 2018

Uh oh!

TomAugspurger commented Feb 1, 2018

Uh oh!

Uh oh!

pep8speaks commented Jan 30, 2018 •

edited

Loading

codecov bot commented Jan 31, 2018 •

edited

Loading