Exploring Data Splitting Strategies for the Evaluation of Recommendation Models

被引:48
|
作者
Meng, Zaigiao [1 ]
McCreadie, Richard [1 ]
Macdonald, Craig [1 ]
Ounis, Iadh [1 ]
机构
[1] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
Recommender Systems; Spliting Strategy; Model Evaluation; Leave-one-out; Temporal Split;
D O I
10.1145/3383313.3418479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective methodologies for evaluating recommender systems are critical, so that different systems can be compared in a sound manner. A commonly overlooked aspect of evaluating recommender systems is the selection of the data splitting strategy. In this paper, we both show that there is no standard splitting strategy and that the selection of splitting strategy can have a strong impact on the ranking of recommender systems during evaluation. In particular, we perform experiments comparing three common data splitting strategies, examining their impact over seven state-of-the-art recommendation models on two datasets. Our results demonstrate that the splitting strategy employed is an important confounding variable that can markedly alter the ranking of recommender systems, making much of the currently published literature non-comparable, even when the same datasets and metrics are used.
引用
收藏
页码:681 / 686
页数:6
相关论文
共 50 条
  • [21] Exploring new strategies for comparing deep learning models
    Butler, Samantha J.
    Price, Stanton R.
    Hadia, Xian Mae D.
    Price, Steven R.
    Carley, Samantha C.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [22] Data splitting strategies for reducing the effect of model selection on inference
    Faraway, JJ
    DIMENSION REDUCTION, COMPUTATIONAL COMPLEXITY AND INFORMATION, 1998, 30 : 332 - 341
  • [23] Splitting Strategies for Video Images Processing in Medical Data Grid
    Chong, Mien May
    Latip, Rohaya Binti
    SOFTWARE ENGINEERING AND COMPUTER SYSTEMS, PT 1, 2011, 179 : 709 - 722
  • [24] A Fair Data Market System with Data Quality Evaluation and Repairing Recommendation
    Ding, Xiaoou
    Wang, Hongzhi
    Zhang, Dan
    Li, Jianzhong
    Gao, Hong
    WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 855 - 858
  • [25] Virtual cells: Evaluation of different lot sizing splitting strategies
    Chin, Shih Y.
    International Journal of Manufacturing Research, 2013, 8 (01) : 18 - 42
  • [26] Recommendation as Generalization: Using Big Data to Evaluate Cognitive Models
    Bourgin, David D.
    Abbott, Joshua T.
    Griffiths, Thomas L.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2021, 150 (07) : 1398 - 1409
  • [27] Data-free Knowledge Distillation for Reusing Recommendation Models
    Wang, Cheng
    Sun, Jiacheng
    Dong, Zhenhua
    Zhu, Jieming
    Li, Zhenguo
    Li, Ruixuan
    Zhang, Rui
    PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 386 - 395
  • [28] Rethinking the Evaluation for Conversational Recommendation in the Era of Large Language Models
    Wang, Xiaolei
    Tang, Xinyu
    Xin, Wayne
    Wang, Jingyuan
    Wen, Ji-Rong
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10052 - 10065
  • [29] Development of a recommendation system with multiple subjective evaluation process models
    Yano, E
    Sueyoshi, E
    Shinohara, I
    Kato, T
    2003 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2003, : 344 - 351
  • [30] User Assistance during Process Execution - An Experimental Evaluation of Recommendation Strategies
    Haisjackl, Christian
    Weber, Barbara
    BUSINESS PROCESS MANAGEMENT WORKSHOPS, 2011, 66 : 134 - 145