Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
Name: python312-pandas-performance | Distribution: openSUSE Tumbleweed |
Version: 2.3.2 | Vendor: openSUSE |
Release: 1.2 | Build date: Sun Sep 14 23:48:11 2025 |
Group: Unspecified | Build host: reproducible |
Size: 13171 | Source RPM: python-pandas-2.3.2-1.2.src.rpm |
Packager: http://bugs.opensuse.org | |
Url: https://pandas.pydata.org/ | |
Summary: The python pandas[performance] extra |
This package provides the [performance] extra for python-pandas It is highly recommended to install this subpackage, as its dependencies provide speed improvements, especially when working with large data sets.
BSD-3-Clause
* Sun Sep 14 2025 Dirk Müller <dmueller@suse.com> - update to 2.3.2: * Fix to_json() with orient="table" to correctly use the “string” type in the JSON Table Schema for StringDtype columns * Boolean operations (|, &, ^) with bool-dtype objects on the left and StringDtype objects on the right now cast the string to bool, with a deprecation warning * Fixed match(), fullmatch() and contains() string methods with compiled regex for the Arrow-backed string dtype * Bug in Series.replace() and DataFrame.replace() inconsistently replacing matching values when missing values are present for string dtypes * Fri Jul 11 2025 Ben Greiner <code@bnavigator.de> - Update to 2.3.1 * This release includes some improvements and fixes to the future string data type (preview feature for the upcoming pandas 3.0) [#]# Improvements and fixes for the StringDtype * Comparisons between different string dtypes * Index set operations ignore empty RangeIndex and object dtype Index [#]# Bug fixes * Bug in DataFrameGroupBy.min(), DataFrameGroupBy.max(), Resampler.min(), Resampler.max() where all NA values of string dtype would return float instead of string dtype (GH 60810) * Bug in DataFrame.join() incorrectly downcasting object-dtype indexes (GH 61771) * Bug in DataFrame.sum() with axis=1, DataFrameGroupBy.sum() or SeriesGroupBy.sum() with skipna=True, and Resampler.sum() with all NA values of StringDtype resulted in 0 instead of the empty string "" (GH 60229) * Fixed bug in DataFrame.explode() and Series.explode() where methods would fail with dtype="str" (GH 61623) * Fixed bug in unpickling objects pickled in pandas versions pre-2.3.0 that used StringDtype (GH 61763) - Release 2.3.0 [#]# Enhancements * The semantics for the copy keyword in __array__ methods (i.e. called when using np.array() or np.asarray() on pandas objects) has been updated to work correctly with NumPy >= 2 (GH 57739) * Series.str.decode() result now has StringDtype when future.infer_string is True (GH 60709) * to_hdf() and to_hdf() now round-trip with StringDtype (GH 60663) * Improved repr of NumpyExtensionArray to account for NEP51 (GH 61085) * The Series.str.decode() has gained the argument dtype to control the dtype of the result (GH 60940) * The cumsum(), cummin(), and cummax() reductions are now implemented for StringDtype columns (GH 60633) * The sum() reduction is now implemented for StringDtype columns (GH 59853) [#]# Deprecations * Deprecated allowing non-bool values for na in str.contains(), str.startswith(), and str.endswith() for dtypes that do not already disallow these (GH 59615) * Deprecated the "pyarrow_numpy" storage option for StringDtype (GH 60152) * The deprecation of setting the argument include_groups to True in DataFrameGroupBy.apply() has been promoted from a DeprecationWarning to FutureWarning; only False will be allowed (GH 7155) [#]# Bug fixes [#]## Numeric * Bug in Series.mode() and DataFrame.mode() with dropna=False where not all dtypes would sort in the presence of NA values (GH 60702) * Bug in Series.round() where a TypeError would always raise with object dtype (GH 61206) [#]## Strings * Bug in Series.__pos__() and DataFrame.__pos__() where an Exception was not raised for StringDtype with storage="pyarrow" (GH 60710) * Bug in Series.rank() for StringDtype with storage="pyarrow" that incorrectly returned integer results with method="average" and raised an error if it would truncate results (GH 59768) * Bug in Series.replace() with StringDtype when replacing with a non-string value was not upcasting to object dtype (GH 60282) * Bug in Series.str.center() with StringDtype with storage="pyarrow" not matching the python behavior in corner cases with an odd number of fill characters (GH 54792) * Bug in Series.str.replace() when n < 0 for StringDtype with storage="pyarrow" (GH 59628) * Bug in Series.str.slice() with negative step with ArrowDtype and StringDtype with storage="pyarrow" giving incorrect results (GH 59710) [#]## Indexing Bug in Index.get_indexer() round-tripping through string dtype when infer_string is enabled (GH 55834) [#]## I/O * Bug in DataFrame.to_excel() which stored decimals as strings instead of numbers (GH 49598) [#]## Other * Fixed usage of inspect when the optional dependencies pyarrow or jinja2 are not installed (GH 60196) - Drop patches: * timedelta.patch * pandas-pr60545-arrow-exception.patch * pandas-pr60584-60586-mpl-vert.patch - Refresh dropna.patch to pandas-pr61132-dropna.patch * gh#pandas-dev/pandas#61132 * Thu Mar 20 2025 Markéta Machová <mmachova@suse.com> - Add dropna.patch and timedelta.patch to fix tests with Numpy 2.2 * Thu Feb 20 2025 Ben Greiner <code@bnavigator.de> - Support pyarrow 19 * Add pandas-pr60545-arrow-exception.patch gh#pandas-dev/pandas#60545 * Skip TestParquetPyArrow.test_roundtrip_decimal: Would require gh#pandas-dev/pandas#60755 and all other backports of the new string type system. * Fri Feb 07 2025 Ben Greiner <code@bnavigator.de> - Add pandas-pr60584-60586-mpl-vert.patch * Fixes matplotlib deprecation errors in the test suite * gh#pandas-dev/pandas#60584 * backported in gh#pandas-dev/pandas#60586 * Fri Dec 13 2024 Steve Kowalik <steven.kowalik@suse.com> - Change skipped tests to also support Python 3.13. * Wed Nov 27 2024 Markéta Machová <mmachova@suse.com> - Drop tests-nomkl.patch and tests-wasm.patch, not needed anymore - Skip a test failing with new xarray * Fri Oct 25 2024 Steve Kowalik <steven.kowalik@suse.com> - Skip two tests that fail with Numpy 2.1. * Fri Oct 11 2024 Steve Kowalik <steven.kowalik@suse.com> - Prepare for Python 3.13, by skipping it if we aren't building for it. * Tue Oct 01 2024 John Paul Adrian Glaubitz <adrian.glaubitz@suse.com> - Update to 2.2.3 * Bug in eval() on complex including division / discards imaginary part. (GH 21374) * Minor fixes for numpy 2.1 compatibility. (GH 59444) * Missing licenses for 3rd party dependencies were added back into the wheels. (GH 58632) - Drop pandas-pr58269-pyarrow16xpass.patch, merged upstream - Drop pandas-pr58484-matplotlib.patch, merged upstream - Drop pandas-pr59175-matplotlib.patch, merged upstream - Drop pandas-pr59353-np2eval.patch, merged upstream - Drop tests-npdev.patch, merged upstream - Drop tests-timedelta.patch, merged upstream - Refresh tests-nomkl.patch - Renumber remaining patches * Mon Sep 16 2024 Markéta Machová <mmachova@suse.com> - Add bunch of patches to fix the testsuite with NumPy 2.1 * tests-wasm.patch * tests-nomkl.patch * tests-timedelta.patch * tests-npdev.patch - Skip one test failing with new timezone, the patch would be too big * Sun Sep 08 2024 Ben Greiner <code@bnavigator.de> - Drop pandas-pr58720-xarray-dp.patch: It does no longer xfail * Wed Aug 28 2024 Ben Greiner <code@bnavigator.de> - Skip overflowing tests on 32-bit * Sun Aug 25 2024 Ben Greiner <code@bnavigator.de> - Add pandas-pr59353-np2eval.patch * gh#pandas-dev/pandas#59353 * gh#pandas-dev/pandas#58548 * Thu Jul 11 2024 Ben Greiner <code@bnavigator.de> - Add pandas-pr59175-matplotlib.patch -- gh#pandas-dev/pandas#59175 * Sun May 12 2024 Matej Cepl <mcepl@cepl.eu> - Add pandas-pr58269-pyarrow16xpass.patch (gh#pandas-dev/pandas!58269) - Add pandas-pr58720-xarray-dp.patch (gh#pandas-dev/pandas!58720), which makes pandas compatible with the modern xarray - Add pandas-pr58484-matplotlib.patch (gh#pandas-dev/pandas!58484), which makes pandas compatible with the modern matplotlib - Skip also test_plot_scatter_shape (gh#pandas-dev/pandas#58851) * Thu May 09 2024 Matej Cepl <mcepl@cepl.eu> - Skip build on Python 3.10 ... too many dependencies are missing. * Tue Apr 30 2024 Ben Greiner <code@bnavigator.de> - Update to 2.2.2 * Pandas 2.2.2 is now compatible with numpy 2.0 * Pandas 2.2.2 is the first version of pandas that is generally compatible with the upcoming numpy 2.0 release, and wheels for pandas 2.2.2 will work with both numpy 1.x and 2.x. One major caveat is that arrays created with numpy 2.0’s new StringDtype will convert to object dtyped arrays upon Series/DataFrame creation. Full support for numpy 2.0’s StringDtype is expected to land in pandas 3.0. * As usual please report any bugs discovered to our issue tracker [#]# Fixed regressions * DataFrame.__dataframe__() was producing incorrect data buffers when the a column’s type was a pandas nullable on with missing values (GH 56702) * DataFrame.__dataframe__() was producing incorrect data buffers when the a column’s type was a pyarrow nullable on with missing values (GH 57664) * Avoid issuing a spurious DeprecationWarning when a custom DataFrame or Series subclass method is called (GH 57553) * Fixed regression in precision of to_datetime() with string and unit input (GH 57051) [#]# Bug fixes * DataFrame.__dataframe__() was producing incorrect data buffers when the column’s type was nullable boolean (GH 55332) * DataFrame.__dataframe__() was showing bytemask instead of bitmask for 'string[pyarrow]' validity buffer (GH 57762) * DataFrame.__dataframe__() was showing non-null validity buffer (instead of None) 'string[pyarrow]' without missing values (GH 57761) * DataFrame.to_sql() was failing to find the right table when using the schema argument (GH 57539) - Remove obsolete python39 multibuild - Add pandas-pr58269-pyarrow16xpass.patch gh#pandas-dev/pandas#58269 * Mon Mar 04 2024 Ben Greiner <code@bnavigator.de> - No xarrary for python 3.9 anymore: Remove from pandas[all] and exclude pandas[computation]. Reenable testing to check it. It will be skipped automatically when python39 is dropped from Tumbleweed globally. - Fix 15.x build: requires newer compiler - Fix 15.x test skips: sle15_python_module_pythons needs to be declared earlier * Fri Feb 23 2024 Ben Greiner <code@bnavigator.de> - Update to 2.2.1 [#]# Enhancements * Added pyarrow pip extra so users can install pandas and pyarrow with pip with pip install pandas[pyarrow] (#54466) [#]# Fixed regressions * Fixed memory leak in `read_csv` (#57039) * Fixed performance regression in `Series.combine_first` (#55845) * Fixed regression causing overflow for near-minimum timestamps (#57150) * Fixed regression in `concat` changing long-standing behavior that always sorted the non-concatenation axis when the axis was a `DatetimeIndex` (#57006) * Fixed regression in `merge_ordered` raising TypeError for fill_method="ffill" and how="left" (#57010) * Fixed regression in `pandas.testing.assert_series_equal` defaulting to check_exact=True when checking the `Index` (#57067) * Fixed regression in `read_json` where an `Index` would be returned instead of a `RangeIndex` (#57429) * Fixed regression in `wide_to_long` raising an AttributeError for string columns (#57066) * Fixed regression in `.DataFrameGroupBy.idxmin`, `.DataFrameGroupBy.idxmax`, `.SeriesGroupBy.idxmin`, `.SeriesGroupBy.idxmax` ignoring the skipna argument (#57040) * Fixed regression in `.DataFrameGroupBy.idxmin`, `.DataFrameGroupBy.idxmax`, `.SeriesGroupBy.idxmin`, `.SeriesGroupBy.idxmax` where values containing the minimum or maximum value for the dtype could produce incorrect results (#57040) * Fixed regression in `CategoricalIndex.difference` raising KeyError when other contains null values other than NaN (#57318) * Fixed regression in `DataFrame.groupby` raising ValueError when grouping by a `Series` in some cases (#57276) * Fixed regression in `DataFrame.loc` raising IndexError for non-unique, masked dtype indexes where result has more than 10,000 rows (#57027) * Fixed regression in `DataFrame.loc` which was unnecessarily throwing "incompatible dtype warning" when expanding with partial row indexer and multiple columns (see PDEP6) (#56503) * Fixed regression in `DataFrame.map` with na_action="ignore" not being respected for NumPy nullable and `ArrowDtypes` (#57316) * Fixed regression in `DataFrame.merge` raising ValueError for certain types of 3rd-party extension arrays (#57316) * Fixed regression in `DataFrame.query` with all NaT column with object dtype (#57068) * Fixed regression in `DataFrame.shift` raising AssertionError for axis=1 and empty `DataFrame` (#57301) * Fixed regression in `DataFrame.sort_index` not producing a stable sort for a index with duplicates (#57151) * Fixed regression in `DataFrame.to_dict` with orient='list' and datetime or timedelta types returning integers (#54824) * Fixed regression in `DataFrame.to_json` converting nullable integers to floats (#57224) * Fixed regression in `DataFrame.to_sql` when method="multi" is passed and the dialect type is not Oracle (#57310) * Fixed regression in `DataFrame.transpose` with nullable extension dtypes not having F-contiguous data potentially causing exceptions when used (#57315) * Fixed regression in `DataFrame.update` emitting incorrect warnings about downcasting (#57124) * Fixed regression in `DataFrameGroupBy.idxmin`, `DataFrameGroupBy.idxmax`, `SeriesGroupBy.idxmin`, `SeriesGroupBy.idxmax` ignoring the skipna argument (#57040) * Fixed regression in `DataFrameGroupBy.idxmin`, `DataFrameGroupBy.idxmax`, `SeriesGroupBy.idxmin`, `SeriesGroupBy.idxmax` where values containing the minimum or maximum value for the dtype could produce incorrect results (#57040) * Fixed regression in `ExtensionArray.to_numpy` raising for non-numeric masked dtypes (#56991) * Fixed regression in `Index.join` raising TypeError when joining an empty index to a non-empty index containing mixed dtype values (#57048) * Fixed regression in `Series.astype` introducing decimals when converting from integer with missing values to string dtype (#57418) * Fixed regression in `Series.pct_change` raising a ValueError for an empty `Series` (#57056) * Fixed regression in `Series.to_numpy` when dtype is given as float and the data contains NaNs (#57121) * Fixed regression in addition or subtraction of `DateOffset` objects with millisecond components to datetime64 `Index`, `Series`, or `DataFrame` (#57529) [#]# Bug fixes * Fixed bug in `pandas.api.interchange.from_dataframe` which was raising for Nullable integers (#55069) * Fixed bug in `pandas.api.interchange.from_dataframe` which was raising for empty inputs (#56700) * Fixed bug in `pandas.api.interchange.from_dataframe` which wasn't converting columns names to strings (#55069) * Fixed bug in `DataFrame.__getitem__` for empty `DataFrame` with Copy-on-Write enabled (#57130) * Fixed bug in `PeriodIndex.asfreq` which was silently converting frequencies which are not supported as period frequencies instead of raising an error (#56945) [#]# Note * The DeprecationWarning that was raised when pandas was imported without PyArrow being installed has been removed. This decision was made because the warning was too noisy for too many users and a lot of feedback was collected about the decision to make PyArrow a required dependency. Pandas is currently considering the decision whether or not PyArrow should be added as a hard dependency in 3.0. Interested users can follow the discussion here. * Added the argument skipna to `DataFrameGroupBy.first`, `DataFrameGroupBy.last`, `SeriesGroupBy.first`, and `SeriesGroupBy.last`; achieving skipna=False used to be available via `DataFrameGroupBy.nth`, but the behavior was changed in pandas 2.0.0 (#57019) * Added the argument skipna to `Resampler.first`, `Resampler.last` (#57019) - Release notes for 2.2.0 * For full changelog see https://github.com/pandas-dev/pandas/blob/main/doc/source/whatsnew/v2.2.0.rst [#]# Enhancements * ADBC Driver support in to_sql and read_sql * Create a pandas Series based on one or more conditions * to_numpy for NumPy nullable and Arrow types converts to suitable NumPy dtype * Series.struct accessor for PyArrow structured data * Series.list accessor for PyArrow list data * Calamine engine for `read_excel` [#]# Notable bug fixes * `merge` and `DataFrame.join` now consistently follow documented sort behavior * `merge` and `DataFrame.join` no longer reorder levels when levels differ * Increased minimum versions for dependencies [#]# Deprecations * Chained assignment * Deprecate aliases M, Q, Y, etc. in favour of ME, QE, YE, etc. for offsets * Deprecated automatic downcasting - Simplify flavor test setup: obs can evaluate %{shrink:} now * Tue Feb 06 2024 Dirk Müller <dmueller@suse.com> - enable py312 testing, remove py39 testing * Fri Jan 12 2024 pgajdos@suse.com - have a possibility to not use pyarrow [bsc#1218592] * Wed Dec 13 2023 Ben Greiner <code@bnavigator.de> - Update to 2.1.4 [#]# Fixed regressions * Fixed regression when trying to read a pickled pandas DataFrame from pandas 1.3 (GH 55137) [#]# Bug fixes * Bug in Series constructor raising DeprecationWarning when index is a list of Series (GH 55228) * Bug in Series when trying to cast date-like string inputs to ArrowDtype of pyarrow.timestamp (GH 56266) * Bug in DataFrame.apply() where passing raw=True ignored args passed to the applied function (GH 55753) * Bug in Index.__getitem__() returning wrong result for Arrow dtypes and negative stepsize (GH 55832) * Fixed bug in to_numeric() converting to extension dtype for string[pyarrow_numpy] dtype (GH 56179) * Fixed bug in DataFrameGroupBy.min() and DataFrameGroupBy.max() not preserving extension dtype for empty object (GH 55619) * Fixed bug in DataFrame.__setitem__() casting Index with object-dtype to PyArrow backed strings when infer_string option is set (GH 55638) * Fixed bug in DataFrame.to_hdf() raising when columns have StringDtype (GH 55088) * Fixed bug in Index.insert() casting object-dtype to PyArrow backed strings when infer_string option is set (GH 55638) * Fixed bug in Series.__ne__() resulting in False for comparison between NA and string value for dtype="string[pyarrow_numpy]" (GH 56122) * Fixed bug in Series.mode() not keeping object dtype when infer_string is set (GH 56183) * Fixed bug in Series.reset_index() not preserving object dtype when infer_string is set (GH 56160) * Fixed bug in Series.str.split() and Series.str.rsplit() when pat=None for ArrowDtype with pyarrow.string (GH 56271) * Fixed bug in Series.str.translate() losing object dtype when string option is set (GH 56152) - Go back to Cython0, it has NOT been unpinned by upstream released version * https://github.com/pandas-dev/pandas/blob/v2.1.4/pyproject.toml#L8 * See also gh#jsonpickle/jsonpickle#460 * Fri Dec 01 2023 Steve Kowalik <steven.kowalik@suse.com> - Update to 2.1.3: * Reverted deprecation of fill_method=None in DataFrame.pct_change(), Series.pct_change(), DataFrameGroupBy.pct_change(), and SeriesGroupBy.pct_change(); the values 'backfill', 'bfill', 'pad', and 'ffill' are still deprecated * Fixed regressions + Fixed infinite recursion from operations that return a new object on some DataFrame subclasses + Fixed regression in DataFrame.join() where result has missing values and dtype is arrow backed string + Fixed regression in rolling() where non-nanosecond index or on column would produce incorrect results + Fixed regression in DataFrame.resample() which was extrapolating back to origin when origin was outside its bounds + Fixed regression in DataFrame.sort_index() which was not sorting correctly when the index was a sliced MultiIndex + Fixed regression in DataFrameGroupBy.agg() and SeriesGroupBy.agg() where if the option compute.use_numba was set to True, groupby methods not supported by the numba engine would raise a TypeError + Fixed performance regression with wide DataFrames, typically involving methods where all columns were accessed individually + Fixed regression in merge_asof() raising TypeError for by with datetime and timedelta dtypes + Fixed regression in read_parquet() when reading a file with a string column consisting of more than 2 GB of string data and using the "string" dtype + Fixed regression in DataFrame.to_sql() not roundtripping datetime columns correctly for sqlite when using detect_types + Fixed regression in construction of certain DataFrame or Series subclasses * Bug fixes + Bug in DatetimeIndex.diff() raising TypeError + Bug in Index.isin() raising for Arrow backed string and None value + Fix read_parquet() and read_feather() for CVE-2023-47248 + Fixed bug in DataFrameGroupBy reductions not preserving object dtype when infer_string is set + Fixed bug in SeriesGroupBy.value_counts() returning incorrect dtype for string columns + Fixed bug in Categorical.equals() if other has arrow backed string dtype + Fixed bug in DataFrame.__setitem__() not inferring string dtype for zero-dimensional array with infer_string=True + Fixed bug in DataFrame.idxmin() and DataFrame.idxmax() raising for arrow dtypes + Fixed bug in DataFrame.interpolate() raising incorrect error message + Fixed bug in Index.insert() raising when inserting None into Index with dtype="string[pyarrow_numpy]" + Fixed bug in Series.all() and Series.any() not treating missing values correctly for dtype="string[pyarrow_numpy]" + Fixed bug in Series.floordiv() for ArrowDtype + Fixed bug in Series.mode() not sorting values for arrow backed string dtype + Fixed bug in Series.rank() for string[pyarrow_numpy] dtype + Fixed bug in Series.str.extractall() for ArrowDtype dtype being converted to object + Fixed bug where PDEP-6 warning about setting an item of an incompatible dtype was being shown when creating a new conditional column + Silence Period[B] warnings introduced by GH 53446 during normal plotting activity + Fixed bug in Series constructor not inferring string dtype when NA is the first value and infer_string is set - Prepare for Python 3.12, include the flavor check. - Unpin Cython, upstream has moved onto 3. * Sat Oct 14 2023 Bernhard Wiedemann <bwiedemann@suse.com> - Fix random build failures * Sat Oct 07 2023 Ben Greiner <code@bnavigator.de> - Update to 2.1.1 [#]# Fixed regressions * Fixed regression in concat() when DataFrame ‘s have two different extension dtypes (GH 54848) * Fixed regression in merge() when merging over a PyArrow string index (GH 54894) * Fixed regression in read_csv() when usecols is given and dtypes is a dict for engine="python" (GH 54868) * Fixed regression in read_csv() when delim_whitespace is True (GH 54918, GH 54931) * Fixed regression in GroupBy.get_group() raising for axis=1 (GH 54858) * Fixed regression in DataFrame.__setitem__() raising AssertionError when setting a Series with a partial MultiIndex (GH 54875) * Fixed regression in DataFrame.filter() not respecting the order of elements for filter (GH 54980) * Fixed regression in DataFrame.to_sql() not roundtripping datetime columns correctly for sqlite (GH 54877) * Fixed regression in DataFrameGroupBy.agg() when aggregating a DataFrame with duplicate column names using a dictionary (GH 55006) * Fixed regression in MultiIndex.append() raising when appending overlapping IntervalIndex levels (GH 54934) * Fixed regression in Series.drop_duplicates() for PyArrow strings (GH 54904) * Fixed regression in Series.interpolate() raising when fill_value was given (GH 54920) * Fixed regression in Series.value_counts() raising for numeric data if bins was specified (GH 54857) * Fixed regression in comparison operations for PyArrow backed columns not propagating exceptions correctly (GH 54944) * Fixed regression when comparing a Series with datetime64 dtype with None (GH 54870) [#]# Bug fixes * Fixed bug for ArrowDtype raising NotImplementedError for fixed-size list (GH 55000) * Fixed bug in DataFrame.stack() with future_stack=True and columns a non-MultiIndex consisting of tuples (GH 54948) * Fixed bug in Series.dt.tz() with ArrowDtype where a string was returned instead of a tzinfo object (GH 55003) * Fixed bug in Series.pct_change() and DataFrame.pct_change() showing unnecessary FutureWarning (GH 54981) [#]# Other * Reverted the deprecation that disallowed Series.apply() returning a DataFrame when the passed-in callable returns a Series object (GH 52116) - Drop pandas-pr55073-pyarrow13.patch merged upstream * Sun Sep 10 2023 Ben Greiner <code@bnavigator.de> - Fix test failures with pyarrow 13 * Add pandas-pr55073-pyarrow13.patch * gh#pandas-dev/pandas#55073 * gh#pandas-dev/pandas#55048 * gh#pandas-dev/pandas#55020 * Tue Sep 05 2023 Ben Greiner <code@bnavigator.de> - Use git cloned archive gh#pandas-dev/pandas#54907 * Thu Aug 31 2023 Ben Greiner <code@bnavigator.de> - Update to 2.1.0 * https://pandas.pydata.org/pandas-docs/version/2.1.0/whatsnew/v2.1.0.html * Avoid NumPy object dtype for strings by default * DataFrame reductions preserve extension dtypes * Copy-on-Write improvements * New DataFrame.map() method and support for ExtensionArrays * New implementation of DataFrame.stack() * Other minor enhancements (see link above) [#]# Backwards incompatible API changes * pandas 2.1.0 supports Python 3.9 and higher * Increased minimum versions for numpy 1.22.3 and some optional dependencies * arrays.PandasArray has been renamed NumpyExtensionArray and the attached dtype name changed from PandasDtype to NumpyEADtype; importing PandasArray still works until the next major version (GH 53694) [#]# Deprecations * Deprecated silent upcasting in setitem-like Series operations * Deprecated parsing datetimes with mixed time zones * Other Deprecation (see link above) [#]# More * Performance Improvements (see link above) * Bug fixes (see linkl above) - Switch to meson build system * Sun Aug 13 2023 Dirk Müller <dmueller@suse.com> - update to 2.0.3: * Bug in Timestamp.weekday`() was returning incorrect results before '0000-02-29' * Fixed performance regression in merging on datetime-like columns * Fixed regression when DataFrame.to_string() creates extra space for string dtypes * Bug in DataFrame.convert_dtype() and Series.convert_dtype() when trying to convert ArrowDtype with dtype_backend="nullable_numpy" * Bug in RangeIndex.union() when using sort=True with another RangeIndex * Bug in Series.reindex() when expanding a non-nanosecond datetime or timedelta * Bug in read_csv() when defining dtype with bool[pyarrow] for the "c" and "python" engines * Bug in Series.str.split() and Series.str.rsplit() with expand=True * Bug in indexing methods (e.g. DataFrame.__getitem__()) where taking the entire DataFrame/Series would raise an OverflowError when Copy on Write was enabled the length of the array was over the maximum size a 32-bit integer can hold * Bug when constructing a DataFrame with columns of an ArrowDtype with a pyarrow.dictionary type that reindexes the data * Bug when indexing a DataFrame or Series with an Index with a timestamp ArrowDtype would raise an AttributeError - drop pandas-fix-tests.patch (upstream) * Thu Jun 22 2023 Guillaume GARDET <guillaume.gardet@opensuse.org> - Fix tests on aarch64: * pandas-fix-tests.patch * Sun Jun 11 2023 Johannes Kastl <kastl@b1-systems.de> - do not use %elif for python-numpy dependency condition * Wed Jun 07 2023 Ben Greiner <code@bnavigator.de> - Increase minimum memory constraints for tests * Sat May 27 2023 Ben Greiner <code@bnavigator.de> - Update to 2.0.2 [#]# Fixed regressions * Fixed performance regression in GroupBy.apply() (GH53195) * Fixed regression in merge() on Windows when dtype is np.intc (GH52451) * Fixed regression in read_sql() dropping columns with duplicated column names (GH53117) * Fixed regression in DataFrame.loc() losing MultiIndex name when enlarging object (GH53053) * Fixed regression in DataFrame.to_string() printing a backslash at the end of the first row of data, instead of headers, when the DataFrame doesn’t fit the line width (GH53054) * Fixed regression in MultiIndex.join() returning levels in wrong order (GH53093) [#]# Bug fixes * Bug in arrays.ArrowExtensionArray incorrectly assigning dict instead of list for .type with pyarrow.map_ and raising a NotImplementedError with pyarrow.struct (GH53328) * Bug in api.interchange.from_dataframe() was raising IndexError on empty categorical data (GH53077) * Bug in api.interchange.from_dataframe() was returning DataFrame’s of incorrect sizes when called on slices (GH52824) * Bug in api.interchange.from_dataframe() was unnecessarily raising on bitmasks (GH49888) * Bug in merge() when merging on datetime columns on different resolutions (GH53200) * Bug in read_csv() raising OverflowError for engine="pyarrow" and parse_dates set (GH53295) * Bug in to_datetime() was inferring format to contain "%H" instead of "%I" if date contained “AM” / “PM” tokens (GH53147) * Bug in DataFrame.convert_dtypes() ignores convert_* keywords when set to False dtype_backend="pyarrow" (GH52872) * Bug in DataFrame.convert_dtypes() losing timezone for tz-aware dtypes and dtype_backend="pyarrow" (GH53382) * Bug in DataFrame.sort_values() raising for PyArrow dictionary dtype (GH53232) * Bug in Series.describe() treating pyarrow-backed timestamps and timedeltas as categorical data (GH53001) * Bug in Series.rename() not making a lazy copy when Copy-on-Write is enabled when a scalar is passed to it (GH52450) * Bug in pd.array() raising for NumPy array and pa.large_string or pa.large_binary (GH52590) * Bug in DataFrame.__getitem__() not preserving dtypes for MultiIndex partial keys (GH51895) [#]# Other * Raised a better error message when calling Series.dt.to_pydatetime() with ArrowDtype with pyarrow.date32 or pyarrow.date64 type (GH52812) - Release to 2.0.1 [#]# Fixed regressions * Fixed regression for subclassed Series when constructing from a dictionary (GH52445) * Fixed regression in SeriesGroupBy.agg() failing when grouping with categorical data, multiple groupings, as_index=False, and a list of aggregations (GH52760) * Fixed regression in DataFrame.pivot() changing Index name of input object (GH52629) * Fixed regression in DataFrame.resample() raising on a DataFrame with no columns (GH52484) * Fixed regression in DataFrame.sort_values() not resetting index when DataFrame is already sorted and ignore_index=True (GH52553) * Fixed regression in MultiIndex.isin() raising TypeError for Generator (GH52568) * Fixed regression in Series.describe() showing RuntimeWarning for extension dtype Series with one element (GH52515) * Fixed regression when adding a new column to a DataFrame when the DataFrame.columns was a RangeIndex and the new key was hashable but not a scalar (GH52652) [#]# Bug fixes * Bug in Series.dt.days that would overflow int32 number of days (GH52391) * Bug in arrays.DatetimeArray constructor returning an incorrect unit when passed a non-nanosecond numpy datetime array (GH52555) * Bug in ArrowExtensionArray with duration dtype overflowing when constructed from data containing numpy NaT (GH52843) * Bug in Series.dt.round() when passing a freq of equal or higher resolution compared to the Series would raise a ZeroDivisionError (GH52761) * Bug in Series.median() with ArrowDtype returning an approximate median (GH52679) * Bug in api.interchange.from_dataframe() was unnecessarily raising on categorical dtypes (GH49889) * Bug in api.interchange.from_dataframe() was unnecessarily raising on large string dtypes (GH52795) * Bug in pandas.testing.assert_series_equal() where check_dtype=False would still raise for datetime or timedelta types with different resolutions (GH52449) * Bug in read_csv() casting PyArrow datetimes to NumPy when dtype_backend="pyarrow" and parse_dates is set causing a performance bottleneck in the process (GH52546) * Bug in to_datetime() and to_timedelta() when trying to convert numeric data with a ArrowDtype (GH52425) * Bug in to_numeric() with errors='coerce' and dtype_backend='pyarrow' with ArrowDtype data (GH52588) * Bug in ArrowDtype.__from_arrow__() not respecting if dtype is explicitly given (GH52533) * Bug in DataFrame.describe() not respecting ArrowDtype in include and exclude (GH52570) * Bug in DataFrame.max() and related casting different Timestamp resolutions always to nanoseconds (GH52524) * Bug in Series.describe() not returning ArrowDtype with pyarrow.float64 type with numeric data (GH52427) * Bug in Series.dt.tz_localize() incorrectly localizing timestamps with ArrowDtype (GH52677) * Bug in arithmetic between np.datetime64 and np.timedelta64 NaT scalars with units always returning nanosecond resolution (GH52295) * Bug in logical and comparison operations between ArrowDtype and numpy masked types (e.g. "boolean") (GH52625) * Fixed bug in merge() when merging with ArrowDtype one one and a NumPy dtype on the other side (GH52406) * Fixed segfault in Series.to_numpy() with null[pyarrow] dtype (GH52443) [#]# Other * DataFrame created from empty dicts had columns of dtype object. It is now a RangeIndex (GH52404) * Series created from empty dicts had index of dtype object. It is now a RangeIndex (GH52404) * Implemented Series.str.split() and Series.str.rsplit() for ArrowDtype with pyarrow.string (GH52401) * Implemented most str accessor methods for ArrowDtype with pyarrow.string (GH52401) * Supplying a non-integer hashable key that tests False in api.types.is_scalar() now raises a KeyError for RangeIndex.get_loc(), like it does for Index.get_loc(). Previously it raised an InvalidIndexError (GH52652). - Release to 2.0.0 [#]# Enhancements * Installing optional dependencies with pip extras * Index can now hold numpy numeric dtypes * Argument dtype_backend , to return pyarrow-backed or numpy-backed nullable dtypes * Copy-on-Write improvements * Other enhancements, see https://pandas.pydata.org/pandas-docs/version/2.0.2/whatsnew/v2.0.0.html#other-enhancements [#]# Notable bug fixes * DataFrameGroupBy.cumsum() and DataFrameGroupBy.cumprod() overflow instead of lossy casting to float * DataFrameGroupBy.nth() and SeriesGroupBy.nth() now behave as filtrations [#]# Backwards incompatible API changes * Construction with datetime64 or timedelta64 dtype with unsupported resolution * Value counts sets the resulting name to count * Disallow astype conversion to non-supported datetime64/timedelta64 dtypes * UTC and fixed-offset timezones default to standard-library tzinfo objects * Empty DataFrames/Series will now default to have a RangeIndex * DataFrame to LaTeX has a new render engine * Increased minimum versions for dependencies * Datetimes are now parsed with a consistent format * Other API changes, see https://pandas.pydata.org/pandas-docs/version/2.0.2/whatsnew/v2.0.0.html#other-api-changes [#]# Deprecations [#]# Removal of prior version deprecations/changes [#]# Performance improvements [#]# Bug fixes - Drop python38 test flavor and start testing python311 which has been missing since. * Mon May 08 2023 Johannes Kastl <kastl@b1-systems.de> - add sle15_python_module_pythons * Wed Feb 08 2023 Arun Persaud <arun@gmx.de> - specfile: * update copyright year * remove pandas-pr49886-fix-numpy-deprecations.patch, implemented upstreams - update to version 1.5.3: * Fixed regressions + Fixed performance regression in Series.isin() when values is empty (GH49839) + Fixed regression in DataFrame.memory_usage() showing unnecessary FutureWarning when DataFrame is empty (GH50066) + Fixed regression in DataFrameGroupBy.transform() when used with as_index=False (GH49834) + Enforced reversion of color as an alias for c and size as an alias for s in function DataFrame.plot.scatter() (GH49732) + Fixed regression in SeriesGroupBy.apply() setting a name attribute on the result if the result was a DataFrame (GH49907) + Fixed performance regression in setting with the at() indexer (GH49771) + Fixed regression in the methods apply, agg, and transform when used with NumPy functions that informed users to supply numeric_only=True if the operation failed on non-numeric dtypes; such columns must be dropped prior to using these methods (GH50538) + Fixed regression in to_datetime() raising ValueError when parsing array of float containing np.nan (GH50237) * Bug fixes + Bug in the Copy-on-Write implementation losing track of views when indexing a DataFrame with another DataFrame (GH50630) + Bug in Styler.to_excel() leading to error when unrecognized border-style (e.g. "hair") provided to Excel writers (GH48649) + Bug in Series.quantile() emitting warning from NumPy when Series has only NA values (GH50681) + Bug when chaining several Styler.concat() calls, only the last styler was concatenated (GH49207) + Fixed bug when instantiating a DataFrame subclass inheriting from typing.Generic that triggered a UserWarning on python 3.11 (GH49649) + Bug in pivot_table() with NumPy 1.24 or greater when the DataFrame columns has nested elements (GH50342) + Bug in pandas.testing.assert_series_equal() (and equivalent assert_ functions) when having nested data and using numpy >= 1.25 (GH50360) * Other + Note: If you are using DataFrame.to_sql(), read_sql(), read_sql_table(), or read_sql_query() with SQLAlchemy 1.4.46 or greater, you may see a sqlalchemy.exc.RemovedIn20Warning. These warnings can be safely ignored for the SQLAlchemy 1.4.x releases as pandas works toward compatibility with SQLAlchemy 2.0. + Reverted deprecation (GH45324) of behavior of Series.__getitem__() and Series.__setitem__() slicing with an integer Index; this will remain positional (GH49612) + A FutureWarning raised when attempting to set values inplace with DataFrame.loc() or DataFrame.iloc() has been changed to a DeprecationWarning (GH48673) * Fri Dec 23 2022 Ben Greiner <code@bnavigator.de> - Update to version 1.5.2 [#]# Fixed regressions * Fixed regression in MultiIndex.join() for extension array dtypes (GH49277) * Fixed regression in Series.replace() raising RecursionError with numeric dtype and when specifying value=None (GH45725) * Fixed regression in arithmetic operations for DataFrame with MultiIndex columns with different dtypes (GH49769) * Fixed regression in DataFrame.plot() preventing Colormap instance from being passed using the colormap argument if Matplotlib 3.6+ is used (GH49374) * Fixed regression in date_range() returning an invalid set of periods for CustomBusinessDay frequency and start date with timezone (GH49441) * Fixed performance regression in groupby operations (GH49676) * Fixed regression in Timedelta constructor returning object of wrong type when subclassing Timedelta (GH49579) [#]# Bug fixes * Bug in the Copy-on-Write implementation losing track of views in certain chained indexing cases (GH48996) * Fixed memory leak in Styler.to_excel() (GH49751) [#]# Other * Reverted color as an alias for c and size as an alias for s in function DataFrame.plot.scatter() (GH49732) - Add pandas-pr49886-fix-numpy-deprecations.patch * gh#pandas-dev/pandas#49887 - Move to PEP518 build * Sat Oct 22 2022 Arun Persaud <arun@gmx.de> - update to version 1.5.1: * Fixed regressions + Fixed Regression in Series.__setitem__() casting None to NaN for object dtype (GH48665) + Fixed Regression in DataFrame.loc() when setting values as a DataFrame with all True indexer (GH48701) + Regression in read_csv() causing an EmptyDataError when using an UTF-8 file handle that was already read from (GH48646) + Regression in to_datetime() when utc=True and arg contained timezone naive and aware arguments raised a ValueError (GH48678) + Fixed regression in DataFrame.loc() raising FutureWarning when setting an empty DataFrame (GH48480) + Fixed regression in DataFrame.describe() raising TypeError when result contains NA (GH48778) + Fixed regression in DataFrame.plot() ignoring invalid colormap for kind="scatter" (GH48726) + Fixed regression in MultiIndex.values`() resetting freq attribute of underlying Index object (GH49054) + Fixed performance regression in factorize() when na_sentinel is not None and sort=False (GH48620) + Fixed regression causing an AttributeError during warning emitted if the provided table name in DataFrame.to_sql() and the table name actually used in the database do not match (GH48733) + Fixed regression in to_datetime() when arg was a date string with nanosecond and format contained %f would raise a ValueError (GH48767) + Fixed regression in assert_frame_equal() raising for MultiIndex with Categorical and check_like=True (GH48975) + Fixed regression in DataFrame.fillna() replacing wrong values for datetime64[ns] dtype and inplace=True (GH48863) + Fixed DataFrameGroupBy.size() not returning a Series when axis=1 (GH48738) + Fixed Regression in DataFrameGroupBy.apply() when user defined function is called on an empty dataframe (GH47985) + Fixed regression in DataFrame.apply() when passing non-zero axis via keyword argument (GH48656) + Fixed regression in Series.groupby() and DataFrame.groupby() when the grouper is a nullable data type (e.g. Int64) or a PyArrow-backed string array, contains null values, and dropna=False (GH48794) + Fixed performance regression in Series.isin() with mismatching dtypes (GH49162) + Fixed regression in DataFrame.to_parquet() raising when file name was specified as bytes (GH48944) + Fixed regression in ExcelWriter where the book attribute could no longer be set; however setting this attribute is now deprecated and this ability will be removed in a future version of pandas (GH48780) + Fixed regression in DataFrame.corrwith() when computing correlation on tied data with method="spearman" (GH48826) * Bug fixes + Bug in Series.__getitem__() not falling back to positional for integer keys and boolean Index (GH48653) + Bug in DataFrame.to_hdf() raising AssertionError with boolean index (GH48667) + Bug in assert_index_equal() for extension arrays with non matching NA raising ValueError (GH48608) + Bug in DataFrame.pivot_table() raising unexpected FutureWarning when setting datetime column as index (GH48683) + Bug in DataFrame.sort_values() emitting unnecessary FutureWarning when called on DataFrame with boolean sparse columns (GH48784) + Bug in arrays.ArrowExtensionArray with a comparison operator to an invalid object would not raise a NotImplementedError (GH48833) * Other + Avoid showing deprecated signatures when introspecting functions with warnings about arguments becoming keyword-only (GH48692) * Mon Sep 19 2022 Arun Persaud <arun@gmx.de> - specfile: * update required versions - update to version 1.5.0: * long changelog, full version available at https://pandas.pydata.org/pandas-docs/stable/whatsnew/v1.5.0.html# * Sat Sep 10 2022 Arun Persaud <arun@gmx.de> - specfile: * update required version - update to version 1.4.4: * Fixed regressions + Fixed regression in DataFrame.fillna() not working on a DataFrame with a MultiIndex (GH47649) + Fixed regression in taking NULL objects from a DataFrame causing a segmentation violation. These NULL values are created by numpy.empty_like() (GH46848) + Fixed regression in concat() materializing the Index during sorting even if the Index was already sorted (GH47501) + Fixed regression in concat() or merge() handling of all-NaN ExtensionArrays with custom attributes (GH47762) + Fixed regression in calling bitwise numpy ufuncs (for example, np.bitwise_and) on Index objects (GH46769) + Fixed regression in cut() when using a datetime64 IntervalIndex as bins (GH46218) + Fixed regression in DataFrame.select_dtypes() where include="number" included BooleanDtype (GH46870) + Fixed regression in DataFrame.loc() raising error when indexing with a NamedTuple (GH48124) + Fixed regression in DataFrame.loc() not updating the cache correctly after values were set (GH47867) + Fixed regression in DataFrame.loc() not aligning index in some cases when setting a DataFrame (GH47578) + Fixed regression in DataFrame.loc() setting a length-1 array like value to a single value in the DataFrame (GH46268) + Fixed regression when slicing with DataFrame.loc() with DatetimeIndex with a DateOffset object for its freq (GH46671) + Fixed regression in setting None or non-string value into a string-dtype Series using a mask (GH47628) + Fixed regression in updating a DataFrame column through Series __setitem__ (using chained assignment) not updating column values inplace and using too much memory (GH47172) + Fixed regression in DataFrame.select_dtypes() returning a view on the original DataFrame (GH48090) + Fixed regression using custom Index subclasses (for example, used in xarray) with reset_index() or Index.insert() (GH47071) + Fixed regression in intersection() when the DatetimeIndex has dates crossing daylight savings time (GH46702) + Fixed regression in merge() throwing an error when passing a Series with a multi-level name (GH47946) + Fixed regression in DataFrame.eval() creating a copy when updating inplace (GH47449) + Fixed regression where getting a row using DataFrame.iloc() with SparseDtype would raise (GH46406) * Bug fixes + The FutureWarning raised when passing arguments (other than filepath_or_buffer) as positional in read_csv() is now raised at the correct stacklevel (GH47385) + Bug in DataFrame.to_sql() when method was a callable that did not return an int and would raise a TypeError (GH46891) + Bug in DataFrameGroupBy.value_counts() where subset had no effect (GH46383) + Bug when getting values with DataFrame.loc() with a list of keys causing an internal inconsistency that could lead to a disconnect between frame.at[x, y] vs frame[y].loc[x] (GH22372) + Bug in the Series.dt.strftime() accessor return a float instead of object dtype Series for all-NaT input, which also causes a spurious deprecation warning (GH45858) * Other + The minimum version of Cython needed to compile pandas is now 0.29.32 (GH47978) * Sat Jul 09 2022 Arun Persaud <arun@gmx.de> - update to version 1.4.3: * Behavior of concat with empty or all-NA DataFrame columns The behavior change in version 1.4.0 to stop ignoring the data type of empty or all-NA columns with float or object dtype in concat() (Ignoring dtypes in concat with empty or all-NA columns) has been reverted (GH45637). * Fixed regressions + Fixed regression in DataFrame.replace() when the replacement value was explicitly None when passed in a dictionary to to_replace also casting other columns to object dtype even when there were no values to replace (GH46634) + Fixed regression in DataFrame.to_csv() raising error when DataFrame contains extension dtype categorical column (GH46297, GH46812) + Fixed regression in representation of dtypes attribute of MultiIndex (GH46900) + Fixed regression when setting values with DataFrame.loc() updating RangeIndex when index was set as new column and column was updated afterwards (GH47128) + Fixed regression in DataFrame.fillna() and DataFrame.update() creating a copy when updating inplace (GH47188) + Fixed regression in DataFrame.nsmallest() led to wrong results when the sorting column has np.nan values (GH46589) + Fixed regression in read_fwf() raising ValueError when widths was specified with usecols (GH46580) + Fixed regression in concat() not sorting columns for mixed column names (GH47127) + Fixed regression in Groupby.transform() and Groupby.agg() failing with engine="numba" when the index was a MultiIndex (GH46867) + Fixed regression in NaN comparison for Index operations where the same object was compared (GH47105) + Fixed regression is Styler.to_latex() and Styler.to_html() where buf failed in combination with encoding (GH47053) + Fixed regression in read_csv() with index_col=False identifying first row as index names when header=None (GH46955) + Fixed regression in DataFrameGroupBy.agg() when used with list-likes or dict-likes and axis=1 that would give incorrect results; now raises NotImplementedError (GH46995) + Fixed regression in DataFrame.resample() and DataFrame.rolling() when used with list-likes or dict-likes and axis=1 that would raise an unintuitive error message; now raises NotImplementedError (GH46904) + Fixed regression in testing.assert_index_equal() when check_order=False and Index has extension or object dtype (GH47207) + Fixed regression in read_excel() returning ints as floats on certain input sheets (GH46988) + Fixed regression in DataFrame.shift() when axis is columns and fill_value is absent, freq is ignored (GH47039) + Fixed regression in DataFrame.to_json() causing a segmentation violation when DataFrame is created with an index parameter of the type PeriodIndex (GH46683) * Bug fixes + Bug in pandas.eval(), DataFrame.eval() and DataFrame.query() where passing empty local_dict or global_dict was treated as passing None (GH47084) + Most I/O methods no longer suppress OSError and ValueError when closing file handles (GH47136) * Other + The minimum version of Cython needed to compile pandas is now 0.29.30 (GH41935) * Tue Apr 05 2022 Ben Greiner <code@bnavigator.de> - Update to version 1.4.2 * Fixed regression in DataFrame.drop() and Series.drop() when Index had extension dtype and duplicates (GH45860) * Fixed regression in read_csv() killing python process when invalid file input was given for engine="c" (GH45957) * Fixed memory performance regression in Series.fillna() when called on a DataFrame column with inplace=True (GH46149) * Provided an alternative solution for passing custom Excel formats in Styler.to_excel(), which was a regression based on stricter CSS validation. Examples available in the documentation for Styler.format() (GH46152) * Fixed regression in DataFrame.replace() when a replacement value was also a target for replacement (GH46306) * Fixed regression in DataFrame.replace() when the replacement value was explicitly None when passed in a dictionary to to_replace (GH45601, GH45836) * Fixed regression when setting values with DataFrame.loc() losing MultiIndex names if DataFrame was empty before (GH46317) * Fixed regression when rendering boolean datatype columns with Styler() (GH46384) * Fixed regression in Groupby.rolling() with a frequency window that would raise a ValueError even if the datetimes within each group were monotonic (GH46061) * Fix some cases for subclasses that define their _constructor properties as general callables (GH46018) * Fixed “longtable” formatting in Styler.to_latex() when column_format is given in extended format (GH46037) * Fixed incorrect rendering in Styler.format() with hyperlinks="html" when the url contains a colon or other special characters (GH46389) * Improved error message in Rolling when window is a frequency and NaT is in the rolling axis (GH46087) - Copy back the installed package into the source tree * mimics upstreams test setup of an editable install * avoids conftest.py collection errors with pytest 7 * Sat Feb 12 2022 Arun Persaud <arun@gmx.de> - update to version 1.4.1: * Fixed regressions + Regression in Series.mask() with inplace=True and PeriodDtype and an incompatible other coercing to a common dtype instead of raising (GH45546) + Regression in assert_frame_equal() not respecting check_flags=False (GH45554) + Regression in DataFrame.loc() raising ValueError when indexing (getting values) on a MultiIndex with one level (GH45779) + Regression in Series.fillna() with downcast=False incorrectly downcasting object dtype (GH45603) + Regression in api.types.is_bool_dtype() raising an AttributeError when evaluating a categorical Series (GH45615) + Regression in DataFrame.iat() setting values leading to not propagating correctly in subsequent lookups (GH45684) + Regression when setting values with DataFrame.loc() losing Index name if DataFrame was empty before (GH45621) + Regression in join() with overlapping IntervalIndex raising an InvalidIndexError (GH45661) + Regression when setting values with Series.loc() raising with all False indexer and Series on the right hand side (GH45778) + Regression in read_sql() with a DBAPI2 connection that is not an instance of sqlite3.Connection incorrectly requiring SQLAlchemy be installed (GH45660) + Regression in DateOffset when constructing with an integer argument with no keywords (e.g. pd.DateOffset(n)) would behave like datetime.timedelta(days=0) (GH45643, GH45890) * Bug fixes + Fixed segfault in DataFrame.to_json() when dumping tz-aware datetimes in Python 3.10 (GH42130) + Stopped emitting unnecessary FutureWarning in DataFrame.sort_values() with sparse columns (GH45618) + Fixed window aggregations in DataFrame.rolling() and Series.rolling() to skip over unused elements (GH45647) + Fixed builtin highlighters in Styler to be responsive to NA with nullable dtypes (GH45804) + Bug in apply() with axis=1 raising an erroneous ValueError (GH45912) * Other + Reverted performance speedup of DataFrame.corr() for method=pearson to fix precision regression (GH45640, GH42761) * Tue Jan 25 2022 Ben Greiner <code@bnavigator.de> - Skip more tests on non-intel architectures boo#1167730 * Sun Jan 23 2022 Ben Greiner <code@bnavigator.de> - Update to version 1.4.0 * https://pandas.pydata.org/docs/whatsnew/v1.4.0.html * Enhancements - Improved warning messages - Index can hold arbitrary ExtensionArrays - Enhancements in Styler - Multi-threaded CSV reading with a new CSV Engine based on pyarrow - Rank function for rolling and expanding windows - Groupby positional indexing - DataFrame.from_dict and DataFrame.to_dict have new 'tight' option * Notable bug fixes - Inconsistent date string parsing - Ignoring dtypes in concat with empty or all-NA columns - Null-values are no longer coerced to NaN-value in value_counts and mode - mangle_dupe_cols in read_csv no longer renames unique columns conflicting with target names - unstack and pivot_table no longer raises ValueError for result that would exceed int32 limit - groupby.apply consistent transform detection * API changes - Index.get_indexer_for() no longer accepts keyword arguments (other than target); in the past these would be silently ignored if the index was not unique (GH42310) - Change in the position of the min_rows argument in DataFrame.to_string() due to change in the docstring (GH44304) - Reduction operations for DataFrame or Series now raising a ValueError when None is passed for skipna (GH44178) - read_csv() and read_html() no longer raising an error when one of the header rows consists only of Unnamed: columns (GH13054) - Changed the name attribute of several holidays in USFederalHolidayCalendar to match official federal holiday names. * Deprecations - Deprecated Int64Index, UInt64Index & Float64Index - Deprecated Frame.append and Series.append - Split out test runs into separate flavors, optimize memory usage in pytest-xdist runs * Tue Jan 04 2022 Ben Greiner <code@bnavigator.de> - Update to version 1.3.5 * Fixed regression in Series.equals() when comparing floats with dtype object to None (GH44190) * Fixed regression in merge_asof() raising error when array was supplied as join key (GH42844) * Fixed regression when resampling DataFrame with DateTimeIndex with empty groups and uint8, uint16 or uint32 columns incorrectly raising RuntimeError (GH43329) * Fixed regression in creating a DataFrame from a timezone-aware Timestamp scalar near a Daylight Savings Time transition (GH42505) * Fixed performance regression in read_csv() (GH44106) * Fixed regression in Series.duplicated() and Series.drop_duplicates() when Series has Categorical dtype with boolean categories (GH44351) * Fixed regression in GroupBy.sum() with timedelta64[ns] dtype containing NaT failing to treat that value as NA (GH42659) * Fixed regression in RollingGroupby.cov() and RollingGroupby.corr() when other had the same shape as each group would incorrectly return superfluous groups in the result (GH42915)
/usr/share/doc/packages/python312-pandas-performance /usr/share/doc/packages/python312-pandas-performance/README.md /usr/share/licenses/python312-pandas-performance /usr/share/licenses/python312-pandas-performance/LICENSE
Generated by rpm2html 1.8.1
Fabrice Bellet, Thu Oct 23 22:37:43 2025