Skip to content

Releases: googleapis/python-bigquery-dataframes

v0.19.2

22 Jan 19:08
ae43905
Compare
Choose a tag to compare

0.19.2 (2024-01-22)

Bug Fixes

  • Read_gbq large response issue (#332) (b8178b9)
  • Use object dtype for ARRAY columns in to_pandas() with pandas 1.x (#329) (374ddb5)

Documentation

v0.19.1

17 Jan 01:20
f3cd2f4
Compare
Choose a tag to compare

0.19.1 (2024-01-17)

Bug Fixes

  • Handle multi-level columns for df aggregates properly (#305) (5bb45ba)
  • Update max_output_token limitation. (#308) (5cccd36)

Documentation

v0.19.0

10 Jan 00:29
aac35a3
Compare
Choose a tag to compare

0.19.0 (2024-01-09)

Features

  • Add 'columns' as an alias for 'col_order' (#298) (a01b271)
  • Add Series dt.tz and dt.unit properties (#303) (2e1a403)
  • Add to_gbq() method for LLM models (#299) (dafbc1b)
  • Allow manually set clustering_columns in dataframe.to_gbq (#302) (9c21323)
  • Support assigning to columns like a property (#304) (f645c56)
  • Support upcasting numeric columns in concat (#294) (e3a056a)

Bug Fixes

  • DF.drop tuple input as multi-index (#301) (21391a9)
  • Fix bug converting non-string labels to sql ids (#296) (a61c5fe)

Documentation

  • Add code samples for Series.ffill and DataFrame.ffill (#307) (1c63b45)

v0.18.0

03 Jan 19:29
655178a
Compare
Choose a tag to compare

0.18.0 (2024-01-02)

Features

  • Add dataframe.to_html (#259) (2cd6489)
  • Add IntervalIndex support to bigframes.pandas.cut (#254) (6c1969a)
  • Add replace method to DataFrame (#261) (5092215)
  • Specific pyarrow mappings for decimal, bytes types (#283) (a1c0631)

Bug Fixes

  • Dataframes.to_gbq now creates dataset if it doesn't exist (#222) (bac62f7)
  • Exclude pandas 2.2.0rc0 to unblock prerelease tests (#292) (ac1a745)
  • Fix DataFrameGroupby.agg() issue with as_index=False (#273) (ab49350)
  • Make Series.str.replace work for simple strings (#285) (ad67465)
  • Update dataframe.to_gbq to dedup column names. (#286) (746115d)
  • Use setuptools.find_namespace_packages (#246) (9ec352a)

Dependencies

  • Migrate to ibis-framework >= "7.1.0" (#53) (9798a2b)

Documentation

  • Add code snippets for explore query result page (#278) (7cbbb7d)
  • Code samples for astype common to DataFrame and Series (#280) (95b673a)
  • Code samples for DataFrame.copy and Series.copy (#290) (7cbc2b0)
  • Code samples for drop and fillna (#284) (9c5012e)
  • Code samples for isna, isnull, dropna, isin (#289) (ad51035)
  • Code samples for rename , size (#293) (eb69f60)
  • Code samples for reset_index and sort_values (#282) (acc0eb7)
  • Code samples for sample, get, Series.round (#295) (c2b1892)
  • Code samples for Series.{add, replace, unique, T, transpose} (#287) (0e1bbfc)
  • Code samples for Series.{map, to_list, count} (#290) (7cbc2b0)
  • Code samples for Series.{name, std, agg} (#293) (eb69f60)
  • Code samples for Series.groupby and Series.{sum,mean,min,max} (#280) (95b673a)
  • Code samples for DataFrame set_index, items (#295) (c2b1892)
  • Fix the rendering for get_dummies (#291) (252f3a2)

v0.17.0

14 Dec 06:21
31e1741
Compare
Choose a tag to compare

0.17.0 (2023-12-14)

Features

  • Add filters argument to read_gbq for enhanced data querying (#198) (034f71f)
  • Add module/class level api tracking (#272) (4f3db3d)
  • Deprecate use_regional_endpoints (#199) (319a1f2)

Bug Fixes

  • Increase recursion limit, cache compilation tree hashes (#184) (b54791c)
  • Replaced raise NotImplementedError with return NotImplemented (#258) (a133822)

Documentation

  • Add code samples for values and value_counts (#249) (f247d95)
  • Add sample for getting started with BQML (#141) (fb14f54)

v0.16.0

12 Dec 04:59
9cde708
Compare
Choose a tag to compare

0.16.0 (2023-12-12)

Features

  • Add ARIMAPlus.predict parameters (#264) (99598c7)
  • Add DataFrame from_dict and from_records methods (#244) (8d81e24)
  • Add DataFrame.select_dtypes method (#242) (1737acc)
  • Add nunique method to Series/DataFrameGroupby (#256) (c8ec245)
  • Support dataframe.loc with conditional columns selection (#233) (3febea9)

Bug Fixes

  • Enfore pandas version requirement <2.1.4 (#265) (9dd63f6)
  • Exclude pandas 2.1.4 from prerelease tests to unblock e2e tests (b02fc2c)
  • Fix value_counts column label for normalize=True (#245) (d3fa6f2)
  • Migrate e2e tests to bigframes-load-testing project (8766ac6)
  • Ml.sql logic (#262) (68c6fdf)
  • Update the llm_kmeans notebook (#247) (66d1839)

Documentation

  • Add code samples for shape and head (#257) (5bdcc65)
  • Add example for dataframe.melt, dataframe.pivot, dataframe.stac… (#252) (8c63697)
  • Add example to dataframe.nlargest, dataframe.nsmallest, datafra… (#234) (e735412)
  • Add examples for dataframe.cummin, dataframe.cummax, dataframe.cumsum, dataframe.cumprod (#243) (0523a31)
  • Add examples for dataframe.nunique, dataframe.diff, dataframe.a… (#251) (77074ec)
  • Correct the docs for option_context (#263) (d21c6dd)
  • Correct the params rendering for ml.remote and ml.ensemble modules (#248) (c2829e3)
  • Fix return annotation in API docstrings (#253) (89a1c67)

v0.15.0

29 Nov 22:46
8089b15
Compare
Choose a tag to compare

0.15.0 (2023-11-29)

⚠ BREAKING CHANGES

  • model.predict returns all the columns (#204)

Features

  • Add info and memory_usage methods to dataframe (#219) (9d6613d)
  • Add remote vertex model support (#237) (0bfc4fb)
  • Add the recent api method for ML component (#225) (ed8876d)
  • Model.predict returns all the columns (#204) (416171a)
  • Send warnings on LLM prediction partial failures (#216) (81125f9)

Bug Fixes

  • Add df snapshots lookup for read_gbq (#229) (d0d9b84)
  • Avoid unnecessary row_number() on sort key for io (#211) (a18d40e)
  • Dedup special character (#209) (dd78acb)
  • Invalid JSON type of the notebook (#215) (a729831)
  • Make to_pandas override enable_downsampling when sampling_method is manually set. (#200) (ae03756)
  • Polish the llm+kmeans notebook (#208) (e8532b1)
  • Update the llm+kmeans notebook with recent change (#236) (f8917ab)
  • Use anonymous dataset to create remote_function (#205) (69b016e)

Documentation

  • Add code samples for index and column properties (#212) (c88d38e)
  • Add code samples for df reshaping, function, merge, and join methods (#203) (010486c)
  • Add examples for dataframe.kurt, dataframe.std, dataframe.count (#232) (f9c6e72)
  • Add examples for dataframe.mean, dataframe.median, dataframe.va… (#228) (edd0522)
  • Add examples for dataframe.min, dataframe.max and dataframe.sum (#227) (3a375e8)
  • Code samples for Series.dot and DataFrame.dot (#226) (b62a07a)
  • Code samples for Series.where and Series.mask (#217) (52dfad2)
  • Code samples for dataframe.any, dataframe.all and dataframe.prod (#223) (d7957fa)
  • Make the code samples reflect default bq connection usage (#206) (71844b0)

Miscellaneous Chores

v0.14.1

16 Nov 04:26
5ab5059
Compare
Choose a tag to compare

0.14.1 (2023-11-16)

Bug Fixes

  • Correctly handle null values when initializing fingerprint ordering (#210) (8324f13)

Documentation

  • Add an example notebook about line graphs (#197) (f957b27)

v0.14.0

15 Nov 00:26
e2b7df5
Compare
Choose a tag to compare
v0.14.0 Pre-release
Pre-release

0.14.0 (2023-11-14)

Warning: This release has a known bug that row ordering unstable.

Features

  • Add 'cross' join support (#176) (765446a)
  • Add 'index', 'pad', 'nearest' interpolate methods (#162) (6a28403)
  • Add series.sample (identical to existing dataframe.sample) (#187) (37914a4)
  • Add unordered sql compilation (#156) (58f420c)
  • Log most recent API calls as recent-bigframes-api-xx labels on BigQuery jobs (#145) (4ea33b7)
  • Read_gbq creates order deterministically without table copy (#191) (8ab81de)
  • Support date_series.astype("string[pyarrow]") to cast DATE to STRING (#186) (aee0e8e)
  • Support series.at[row_label] = scalar (#173) (0c8bd33)
  • Temporary resources no longer use BigQuery Sessions (#194) (4a02cac)

Bug Fixes

  • All sort operation are now stable (#195) (3a2761f)
  • Default to 7 days expiration for read_csv, read_json, read_parquet (#193) (03606cd)
  • Deprecate the remote_service_type in llm model (#180) (a8a409a)
  • For reset_index on unnamed multiindex, always use level_[n] label (#182) (f95000d)
  • Match pandas behavior when assigning listlike to empty dfs (#172) (c1d1f42)
  • Use anonymous dataset instead of session dataset for temp tables (#181) (800d44e)
  • Use random table for read_pandas (#192) (741c75e)
  • Use random table when loading data for read_csv, read_json, read_parquet (#175) (9d2e6dc)

Documentation

  • Add code samples for read_gbq_function using community UDFs (#188) (7506eab)
  • Add docstring code samples for Series.apply and DataFrame.map (#185) (c816d84)
  • Add llm kmeans notebook as an included example (#177) (d49ae42)
  • Use head() to get top n results, not to preview results (#190) (87f84c9)

v0.13.0

07 Nov 16:57
8b6b1c6
Compare
Choose a tag to compare

0.13.0 (2023-11-07)

Features

  • to_gbq without a destination table writes to a temporary table (#158) (e1817c9)
  • Add DataFrame.__iter__, DataFrame.iterrows, DataFrame.itertuples, and DataFrame.keys methods (#164) (c065071)
  • Add Series.__iter__ method (#164) (c065071)
  • Add interpolate() to series and dataframe (#157) (b9cb55c)
  • Support 32k text-generation and multilingual embedding models (#161) (5f0ea37)

Bug Fixes

  • Update default temp table expiration to 7 days (#174) (4ff26cd)