examples/01_prepare_data/README.md
In this directory, notebooks are provided to illustrate utility functions for data operations such as data import / export, data transformation, data split, etc., which are frequent data preparation tasks witnessed in recommendation system development.
| Notebook | Description |
|---|---|
| data_split | Details on splitting data (randomly, chronologically, etc). |
| data_transform | Guidance on how to transform (implicit / explicit) data for building collaborative filtering typed recommender. |
| wikidata knowledge graph | Details on how to create a knowledge graph using Wikidata |
Three methods of splitting the data for training and testing are demonstrated in this notebook. Each supports both Spark and pandas DataFrames.
Data transformation techniques which are commonly used in various recommendation scenarios are introduced and reviewed.