Exploring Data Splitting Strategies for the Evaluation of Recommendation Models

被引：48

作者：

Meng, Zaigiao ^{[1
]}

McCreadie, Richard ^{[1
]}

Macdonald, Craig ^{[1
]}

Ounis, Iadh ^{[1
]}

机构：

[1] Univ Glasgow, Glasgow, Lanark, Scotland

来源：

RECSYS 2020: 14TH ACM CONFERENCE ON RECOMMENDER SYSTEMS | 2020年

关键词：

Recommender Systems; Spliting Strategy; Model Evaluation; Leave-one-out; Temporal Split;

D O I：

10.1145/3383313.3418479

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Effective methodologies for evaluating recommender systems are critical, so that different systems can be compared in a sound manner. A commonly overlooked aspect of evaluating recommender systems is the selection of the data splitting strategy. In this paper, we both show that there is no standard splitting strategy and that the selection of splitting strategy can have a strong impact on the ranking of recommender systems during evaluation. In particular, we perform experiments comparing three common data splitting strategies, examining their impact over seven state-of-the-art recommendation models on two datasets. Our results demonstrate that the splitting strategy employed is an important confounding variable that can markedly alter the ranking of recommender systems, making much of the currently published literature non-comparable, even when the same datasets and metrics are used.

引用

页码：681 / 686

页数：6

共 50 条

[21] Exploring new strategies for comparing deep learning models
Butler, Samantha J.
Price, Stanton R.
Hadia, Xian Mae D.
Price, Steven R.
Carley, Samantha C.
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
[22] Data splitting strategies for reducing the effect of model selection on inference
Faraway, JJ
DIMENSION REDUCTION, COMPUTATIONAL COMPLEXITY AND INFORMATION, 1998, 30 : 332 - 341
[23] Splitting Strategies for Video Images Processing in Medical Data Grid
Chong, Mien May
Latip, Rohaya Binti
SOFTWARE ENGINEERING AND COMPUTER SYSTEMS, PT 1, 2011, 179 : 709 - 722
[24] A Fair Data Market System with Data Quality Evaluation and Repairing Recommendation
Ding, Xiaoou
Wang, Hongzhi
Zhang, Dan
Li, Jianzhong
Gao, Hong
WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 855 - 858
[25] Virtual cells: Evaluation of different lot sizing splitting strategies
Chin, Shih Y.
International Journal of Manufacturing Research, 2013, 8 (01) : 18 - 42
[26] Recommendation as Generalization: Using Big Data to Evaluate Cognitive Models
Bourgin, David D.
Abbott, Joshua T.
Griffiths, Thomas L.
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2021, 150 (07) : 1398 - 1409
[27] Data-free Knowledge Distillation for Reusing Recommendation Models
Wang, Cheng
Sun, Jiacheng
Dong, Zhenhua
Zhu, Jieming
Li, Zhenguo
Li, Ruixuan
Zhang, Rui
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 386 - 395
[28] Rethinking the Evaluation for Conversational Recommendation in the Era of Large Language Models
Wang, Xiaolei
Tang, Xinyu
Xin, Wayne
Wang, Jingyuan
Wen, Ji-Rong
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10052 - 10065
[29] Development of a recommendation system with multiple subjective evaluation process models
Yano, E
Sueyoshi, E
Shinohara, I
Kato, T
2003 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2003, : 344 - 351
[30] User Assistance during Process Execution - An Experimental Evaluation of Recommendation Strategies
Haisjackl, Christian
Weber, Barbara
BUSINESS PROCESS MANAGEMENT WORKSHOPS, 2011, 66 : 134 - 145

← 1 2 3 4 5 →