Releases: googleapis/python-bigquery-dataframes
Releases · googleapis/python-bigquery-dataframes
v0.19.2
v0.19.1
v0.19.0
0.19.0 (2024-01-09)
Features
- Add 'columns' as an alias for 'col_order' (#298) (a01b271)
- Add Series dt.tz and dt.unit properties (#303) (2e1a403)
- Add to_gbq() method for LLM models (#299) (dafbc1b)
- Allow manually set clustering_columns in dataframe.to_gbq (#302) (9c21323)
- Support assigning to columns like a property (#304) (f645c56)
- Support upcasting numeric columns in concat (#294) (e3a056a)
Bug Fixes
- DF.drop tuple input as multi-index (#301) (21391a9)
- Fix bug converting non-string labels to sql ids (#296) (a61c5fe)
Documentation
v0.18.0
0.18.0 (2024-01-02)
Features
- Add
dataframe.to_html
(#259) (2cd6489) - Add
IntervalIndex
support tobigframes.pandas.cut
(#254) (6c1969a) - Add
replace
method to DataFrame (#261) (5092215) - Specific pyarrow mappings for decimal, bytes types (#283) (a1c0631)
Bug Fixes
Dataframes.to_gbq
now creates dataset if it doesn't exist (#222) (bac62f7)- Exclude pandas 2.2.0rc0 to unblock prerelease tests (#292) (ac1a745)
- Fix
DataFrameGroupby.agg()
issue with as_index=False (#273) (ab49350) - Make
Series.str.replace
work for simple strings (#285) (ad67465) - Update
dataframe.to_gbq
to dedup column names. (#286) (746115d) - Use setuptools.find_namespace_packages (#246) (9ec352a)
Dependencies
Documentation
- Add code snippets for explore query result page (#278) (7cbbb7d)
- Code samples for
astype
common to DataFrame and Series (#280) (95b673a) - Code samples for
DataFrame.copy
andSeries.copy
(#290) (7cbc2b0) - Code samples for
drop
andfillna
(#284) (9c5012e) - Code samples for
isna
,isnull
,dropna
,isin
(#289) (ad51035) - Code samples for
rename
,size
(#293) (eb69f60) - Code samples for
reset_index
andsort_values
(#282) (acc0eb7) - Code samples for
sample
,get
,Series.round
(#295) (c2b1892) - Code samples for
Series.{add, replace, unique, T, transpose}
(#287) (0e1bbfc) - Code samples for
Series.{map, to_list, count}
(#290) (7cbc2b0) - Code samples for
Series.{name, std, agg}
(#293) (eb69f60) - Code samples for
Series.groupby
andSeries.{sum,mean,min,max}
(#280) (95b673a) - Code samples for DataFrame
set_index
,items
(#295) (c2b1892) - Fix the rendering for
get_dummies
(#291) (252f3a2)
v0.17.0
0.17.0 (2023-12-14)
Features
- Add
filters
argument toread_gbq
for enhanced data querying (#198) (034f71f) - Add module/class level api tracking (#272) (4f3db3d)
- Deprecate
use_regional_endpoints
(#199) (319a1f2)
Bug Fixes
- Increase recursion limit, cache compilation tree hashes (#184) (b54791c)
- Replaced raise
NotImplementedError
with returnNotImplemented
(#258) (a133822)
Documentation
v0.16.0
0.16.0 (2023-12-12)
Features
- Add ARIMAPlus.predict parameters (#264) (99598c7)
- Add DataFrame from_dict and from_records methods (#244) (8d81e24)
- Add DataFrame.select_dtypes method (#242) (1737acc)
- Add nunique method to Series/DataFrameGroupby (#256) (c8ec245)
- Support dataframe.loc with conditional columns selection (#233) (3febea9)
Bug Fixes
- Enfore pandas version requirement <2.1.4 (#265) (9dd63f6)
- Exclude pandas 2.1.4 from prerelease tests to unblock e2e tests (b02fc2c)
- Fix value_counts column label for normalize=True (#245) (d3fa6f2)
- Migrate e2e tests to bigframes-load-testing project (8766ac6)
- Ml.sql logic (#262) (68c6fdf)
- Update the llm_kmeans notebook (#247) (66d1839)
Documentation
- Add code samples for
shape
andhead
(#257) (5bdcc65) - Add example for dataframe.melt, dataframe.pivot, dataframe.stac… (#252) (8c63697)
- Add example to dataframe.nlargest, dataframe.nsmallest, datafra… (#234) (e735412)
- Add examples for dataframe.cummin, dataframe.cummax, dataframe.cumsum, dataframe.cumprod (#243) (0523a31)
- Add examples for dataframe.nunique, dataframe.diff, dataframe.a… (#251) (77074ec)
- Correct the docs for
option_context
(#263) (d21c6dd) - Correct the params rendering for
ml.remote
andml.ensemble
modules (#248) (c2829e3) - Fix return annotation in API docstrings (#253) (89a1c67)
v0.15.0
0.15.0 (2023-11-29)
⚠ BREAKING CHANGES
- model.predict returns all the columns (#204)
Features
- Add info and memory_usage methods to dataframe (#219) (9d6613d)
- Add remote vertex model support (#237) (0bfc4fb)
- Add the recent api method for ML component (#225) (ed8876d)
- Model.predict returns all the columns (#204) (416171a)
- Send warnings on LLM prediction partial failures (#216) (81125f9)
Bug Fixes
- Add df snapshots lookup for
read_gbq
(#229) (d0d9b84) - Avoid unnecessary row_number() on sort key for io (#211) (a18d40e)
- Dedup special character (#209) (dd78acb)
- Invalid JSON type of the notebook (#215) (a729831)
- Make to_pandas override enable_downsampling when sampling_method is manually set. (#200) (ae03756)
- Polish the llm+kmeans notebook (#208) (e8532b1)
- Update the llm+kmeans notebook with recent change (#236) (f8917ab)
- Use anonymous dataset to create
remote_function
(#205) (69b016e)
Documentation
- Add code samples for
index
andcolumn
properties (#212) (c88d38e) - Add code samples for df reshaping, function, merge, and join methods (#203) (010486c)
- Add examples for dataframe.kurt, dataframe.std, dataframe.count (#232) (f9c6e72)
- Add examples for dataframe.mean, dataframe.median, dataframe.va… (#228) (edd0522)
- Add examples for dataframe.min, dataframe.max and dataframe.sum (#227) (3a375e8)
- Code samples for
Series.dot
andDataFrame.dot
(#226) (b62a07a) - Code samples for
Series.where
andSeries.mask
(#217) (52dfad2) - Code samples for dataframe.any, dataframe.all and dataframe.prod (#223) (d7957fa)
- Make the code samples reflect default bq connection usage (#206) (71844b0)
Miscellaneous Chores
v0.14.1
v0.14.0
0.14.0 (2023-11-14)
Warning: This release has a known bug that row ordering unstable.
Features
- Add 'cross' join support (#176) (765446a)
- Add 'index', 'pad', 'nearest' interpolate methods (#162) (6a28403)
- Add series.sample (identical to existing dataframe.sample) (#187) (37914a4)
- Add unordered sql compilation (#156) (58f420c)
- Log most recent API calls as
recent-bigframes-api-xx
labels on BigQuery jobs (#145) (4ea33b7) - Read_gbq creates order deterministically without table copy (#191) (8ab81de)
- Support
date_series.astype("string[pyarrow]")
to cast DATE to STRING (#186) (aee0e8e) - Support
series.at[row_label] = scalar
(#173) (0c8bd33) - Temporary resources no longer use BigQuery Sessions (#194) (4a02cac)
Bug Fixes
- All sort operation are now stable (#195) (3a2761f)
- Default to 7 days expiration for
read_csv
,read_json
,read_parquet
(#193) (03606cd) - Deprecate the
remote_service_type
in llm model (#180) (a8a409a) - For reset_index on unnamed multiindex, always use level_[n] label (#182) (f95000d)
- Match pandas behavior when assigning listlike to empty dfs (#172) (c1d1f42)
- Use anonymous dataset instead of session dataset for temp tables (#181) (800d44e)
- Use random table for
read_pandas
(#192) (741c75e) - Use random table when loading data for
read_csv
,read_json
,read_parquet
(#175) (9d2e6dc)
Documentation
v0.13.0
0.13.0 (2023-11-07)
Features
to_gbq
without a destination table writes to a temporary table (#158) (e1817c9)- Add
DataFrame.__iter__
,DataFrame.iterrows
,DataFrame.itertuples
, andDataFrame.keys
methods (#164) (c065071) - Add
Series.__iter__
method (#164) (c065071) - Add interpolate() to series and dataframe (#157) (b9cb55c)
- Support 32k text-generation and multilingual embedding models (#161) (5f0ea37)