docs/usage_guide/integrations.rst
Modin is a drop-in replacement for Pandas, so we want it to interoperate with third-party libraries just as Pandas does. To see where Modin performs well and where it needs to improve, we've selected a number of important machine learning + visualization + statistics libraries, and then looked at examples (from their documentation, if possible) about how they work with Pandas. Then we ran those same workflows with Modin, and tracked what worked, and what failed.
In the table below, you'll see, for each third-party library we tested, the number of successful test calls / total test calls, and a qualitative description of how both Pandas and Modin integrate with that library.
In the deeper dive, you can view the Jupyter notebook we have used to test API calls and the corresponding Github issues filed. If you come across other issues/ examples
in your own workflows we encourage you to file an issue <https://github.com/modin-project/modin/issues/new/choose>_ or contribute a PR <https://github.com/modin-project/modin/pulls>_!
.. note:: These interoperability metrics are preliminary and not all APIs for each library have been tested. Feel free to add more!
Modin Interoperability by Library ''''''''''''''''''''''''''''''''' .. list-table:: :widths: 5 5 20 :header-rows: 1
.. |br| raw:: html
A Deeper Dive ''''''''''''''
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/seaborn.ipynb>__
Github Issues * https://github.com/modin-project/modin/issues/5435 * https://github.com/modin-project/modin/issues/5433
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/plotly.ipynb>__
Github Issues * https://github.com/modin-project/modin/issues/5447 * https://github.com/modin-project/modin/issues/5445
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/matplotlib.ipynb>__
Jupyter Notebook <https://github.com/lmodin-project/modin/blob/main/examples/jupyter/integrations/altair.ipynb>__
Github Issues * https://github.com/modin-project/modin/issues/5438
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/bokeh.ipynb>__
Github Issues * https://github.com/modin-project/modin/issues/5437
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/sklearn.ipynb>__
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/huggingface.ipynb>__
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/tensorflow.ipynb>__
Github Issues * https://github.com/modin-project/modin/issues/5439
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/NLTK.ipynb>__
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/xgboost.ipynb>__
Jupyter Notebook <https://github.com/modin-project/modin/blob/main/examples/jupyter/integrations/statsmodels.ipynb>__
Github Issues * https://github.com/modin-project/modin/issues/5440
Appendix: System Information ''''''''''''''''''''''''''''' The example scripts here were run on the following system: