doc/source/dev/roadmap.rst
.. _scipy-roadmap:
This roadmap page contains only the most important ideas and needs for SciPy
going forward. For a more detailed roadmap, including per-subpackage status,
many more ideas, API stability and more, see :ref:scipy-roadmap-detailed.
NumPy has split its API from its execution engine with
__array_function__ and __array_ufunc__. This will enable parts of SciPy
to accept distributed arrays (e.g. dask.array.Array) and GPU arrays (e.g.
cupy.ndarray) that implement the ndarray interface. At the moment it is
not yet clear which algorithms will work out of the box, and if there are
significant performance gains when they do. We want to create a map of which
parts of the SciPy API work, and improve support over time.
In addition to making use of NumPy protocols like __array_function__, we can
make use of these protocols in SciPy as well. That will make it possible to
(re)implement SciPy functions like, e.g., those in scipy.signal for Dask
or GPU arrays (see
NEP 18 - use outside of NumPy <http://www.numpy.org/neps/nep-0018-array-function-protocol.html#use-outside-of-numpy>). NumPy's features in this areas are still evolving,
see e.g. NEP 37 - A dispatch protocol for NumPy-like modules <https://numpy.org/neps/nep-0037-array-module.html>,
and SciPy is an important "client" for those features.
Speed improvements, lower memory usage and the ability to parallelize
algorithms are beneficial to most science domains and use cases. We have
established an API design pattern for multiprocessing - using the workers
keyword - that can be adopted in many more functions.
Making it easier for users to use Numba's @njit in their code that relies
on SciPy functionality would unlock a lot of performance gain. That needs a
strategy though, e.g. based on Transonic <https://transonic.readthedocs.io/>__
or custom code that supports multiple just-in-time and ahead-of-time compilers
(as is explored in gh-23447 <https://github.com/scipy/scipy/pull/23447>__).
Finally, many individual functions can be optimized for performance.
scipy.optimize and scipy.interpolate functions are particularly often
requested in this respect.
SciPy now has continuous integration for ARM64 (or aarch64) and POWER8/9
(or ppc64le), and binaries are available via
Miniforge <https://github.com/conda-forge/miniforge>__. Wheels on PyPI for
these platforms are now also possible (with the manylinux2014 standard),
and requests for those are becoming more frequent.
Additionally, having IBM Z (or s390x) in CI is now possible with TravisCI
but not yet done - and manylinux2014 wheels for that platform are also
possible then. Finally, resolving open AIX build issues would help users.
SciPy sparse matrices are being replaced by sparse arrays.
The sparse matrix formats are mostly feature-complete, however their main issue
is that they act like numpy.matrix (which will be deprecated in NumPy at
some point). What we want is sparse arrays that act like numpy.ndarray
(See discussion at gh-18915 <https://github.com/scipy/scipy/issues/18915>_).
Sparse arrays support all features of sparse matrices as of 1.15.
In addition to 2D arrays, 1D sparse arrays are supported in DOK, COO, CSR formats.
Further functionality e.g. nD array support and broadcasting for some operations
is being developed. The future plan is:
A + B.numpy.matrix.