examples/06_benchmarks/README.md
In this folder we show benchmarks using different algorithms. To facilitate the benchmark computation, we provide a set of wrapper functions that can be found in the file benchmark_utils.py.
MovieLens is one of the most common datasets used in the literature in Recommendation Systems. The dataset consists of a collection of users, movies and movie ratings, there are several available sizes:
The MovieLens benchmark can be seen at movielens.ipynb. This illustrative comparison applies to collaborative filtering algorithms available in this repository such as Spark ALS, SVD, SAR, LightGCN and others using the Movielens dataset, using three environments (CPU, GPU and Spark). These algorithms are usable in a variety of recommendation tasks, including product or news recommendations.