Skip to content

v7.0

Latest
Compare
Choose a tag to compare
@r-devulap r-devulap released this 19 May 19:54
8a72081

Highlights

OpenMP Parallelization:

qsort, argsort and keyvalue_qsort routines now support multi-threading with OpenMP. This speeds up sorting medium to large arrays by 3-4x on both AVX-512 and AVX2. OpenMP is not enabled by default and needs to configured with Meson while building. Please refer to the README for details.

This feature has been contributed upstream to NumPy and is expected to be included in the upcoming 2.3.0 release. As with x86-simd-sort, OpenMP support in NumPy is not enabled by default; you will need to configure the Meson build system to enable OpenMP when building NumPy.

Miscellaneous

  • Accelerate qsort, qselect and partial_qsort for _Float16 on ICX.
  • Resolved a performance regression for 16-bit data types caused by the compiler dynamically allocating constant arrays.
  • Improve argsort performance for already sorted arrays by adding early detection.

List of PR's merged

New Contributors

Full Changelog: v6.0...v7.0