Back to Modin

pandas on Python

docs/development/using_pandas_on_python.rst

0.37.11.0 KB
Original Source

pandas on Python

This section describes usage related documents for the pandas on Python component of Modin.

Modin uses pandas as the primary memory format of the underlying partitions and optimizes queries from the API layer in a specific way to this format. Since it is a default, you do not need to specify the pandas memory format, but we show how to explicitly set it below.

One of the execution engines that Modin uses is Python. This engine is sequential and used for debugging. To enable the pandas on Python execution you should set the following environment variables:

.. code-block:: bash

export MODIN_ENGINE=python export MODIN_STORAGE_FORMAT=pandas

or turn a debug mode on:

.. code-block:: bash

export MODIN_DEBUG=True export MODIN_STORAGE_FORMAT=pandas

or do the same in source code:

.. code-block:: python

import modin.config as cfg cfg.Engine.put('python') cfg.StorageFormat.put('pandas')

.. code-block:: python

import modin.config as cfg cfg.IsDebug.put(True) cfg.StorageFormat.put('pandas')