Optimizing Collaborative Filtering for Accurate Rating Predictions in Very Sparse Datasets

TitleOptimizing Collaborative Filtering for Accurate Rating Predictions in Very Sparse Datasets
Publication TypeJournal Article
Year of Publication2026
AuthorsLapadaki S-A, Nanos J, Margaris D, Vassilakis C, Spiliotopoulos D
JournalFuture Internet
Volume18
Pagination114
ISSN1999-5903
Keywordscollaborative filtering, K-nearest neighbors (KNN), optimal prediction settings, prediction accuracy, Rating prediction, Recommender Systems, similarity metrics, sparse collaborative filtering, Sparse Datasets
Abstract

Collaborative filtering is one of the most widely used methods for user rating prediction in recommender systems. To evaluate a collaborative filtering system, rating datasets are typically used, which comprise thousands to millions of records consisting of user–item–rating tuples. Initially, a similarity metric is used to quantify the closeness between each user and every other user in the dataset, typically based on the ratings that each pair of users has given to the same items. Subsequently, the K users having the largest similarity to the target user are used to produce rating predictions, which lead to recommendations. A particularly challenging case arises when the rating dataset is very sparse. In this scenario, it is difficult not only to find users with commonly rated items but also to determine the optimal similarity metric and suitable values for variable K. Setting a small value for K results in extremely low prediction coverage, leading to unsuccessful recommendations, while setting a very large K value increases memory requirements and prediction/recommendation generation time. Through a multiparameter experiment, this work aims to determine the optimal settings for rating predictions when very sparse datasets are used in collaborative filtering recommender systems.

URLhttps://www.mdpi.com/1999-5903/18/2/114
DOI10.3390/fi18020114