Back to Modin

Modin Usage Examples

docs/usage_guide/examples/index.rst

0.37.12.9 KB
Original Source

Modin Usage Examples

This section shows Modin usage examples in different scenarios like Modin on a local/remote cluster, the use of Modin spreadsheet.

Tutorials '''''''''

The following tutorials cover the basic usage of Modin. Here <https://www.youtube.com/watch?v=NglkafEmbhE>_ is a one hour video tutorial that walks through these basic exercises.

  • Exercise 1: Introduction to Modin [Source PandasOnRay <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_ray/local/exercise_1.ipynb>, Source PandasOnDask <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_dask/local/exercise_1.ipynb>]
  • Exercise 2: Speed Improvements with Modin [Source PandasOnRay <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_ray/local/exercise_2.ipynb>, Source PandasOnDask <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_dask/local/exercise_2.ipynb>]
  • Exercise 3: Defaulting to pandas with Modin [Source PandasOnRay <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_ray/local/exercise_3.ipynb>, Source PandasOnDask <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_dask/local/exercise_3.ipynb>]

The following tutorials covers more advanced features in Modin:

  • Exercise 4: Experimental Features in Modin (Spreadsheet, Progress Bar) [Source PandasOnRay <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_ray/local/exercise_4.ipynb>, Source PandasOnDask <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_dask/local/exercise_4.ipynb>]
  • Exercise 5: Setting up Modin in a Cluster Environment [Source PandasOnRay <https://github.com/modin-project/modin/blob/main/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/exercise_5.py>__]

How to get required dependencies for the tutorial notebooks and to run them please refer to the respective README.md <https://github.com/modin-project/modin/tree/main/examples/tutorial/jupyter/README.md>__ file.

Data Science Benchmarks '''''''''''''''''''''''

  • Using Modin with the NYC Taxi Dataset [Source <https://github.com/modin-project/modin/blob/main/examples/jupyter/Modin_Taxi.ipynb>__]
  • Using Modin with the Census Dataset (coming soon...)
  • Using Modin with the Plasticc Dataset (coming soon...)

Modin Spreadsheets ''''''''''''''''''

  • Using Modin along with the Spreadsheets API [Source <https://github.com/modin-project/modin/blob/main/examples/spreadsheet/tutorial.ipynb>__]

Modin with scikit-learn '''''''''''''''''''''''

  • Modin for Machine Learning with scikit-learn [Source <https://github.com/modin-project/modin/blob/main/examples/modin-scikit-learn-example.ipynb>__]