Exploring Data Splitting Strategies for the Evaluation of Recommendation Models

被引:48
|
作者
Meng, Zaigiao [1 ]
McCreadie, Richard [1 ]
Macdonald, Craig [1 ]
Ounis, Iadh [1 ]
机构
[1] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
Recommender Systems; Spliting Strategy; Model Evaluation; Leave-one-out; Temporal Split;
D O I
10.1145/3383313.3418479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective methodologies for evaluating recommender systems are critical, so that different systems can be compared in a sound manner. A commonly overlooked aspect of evaluating recommender systems is the selection of the data splitting strategy. In this paper, we both show that there is no standard splitting strategy and that the selection of splitting strategy can have a strong impact on the ranking of recommender systems during evaluation. In particular, we perform experiments comparing three common data splitting strategies, examining their impact over seven state-of-the-art recommendation models on two datasets. Our results demonstrate that the splitting strategy employed is an important confounding variable that can markedly alter the ranking of recommender systems, making much of the currently published literature non-comparable, even when the same datasets and metrics are used.
引用
收藏
页码:681 / 686
页数:6
相关论文
共 50 条
  • [41] Drug abstinence: exploring animal models and behavioral treatment strategies
    Peck, Joshua A.
    Ranaldi, Robert
    PSYCHOPHARMACOLOGY, 2014, 231 (10) : 2045 - 2058
  • [42] Drug abstinence: exploring animal models and behavioral treatment strategies
    Joshua A. Peck
    Robert Ranaldi
    Psychopharmacology, 2014, 231 : 2045 - 2058
  • [43] MODELS OF LATERAL HETEROGENEITY OF EARTH CONSISTENT WITH EIGENFREQUENCY SPLITTING DATA
    DAHLEN, FA
    GEOPHYSICAL JOURNAL OF THE ROYAL ASTRONOMICAL SOCIETY, 1976, 44 (01): : 77 - 105
  • [44] Evaluation of Imputation Strategies for Family Data
    Sung, Yun J.
    Duan, Yanan
    Rice, Treva K.
    Rankinen, Tuomo
    Bourchard, Claude
    Gu, C. C.
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 795 - 795
  • [45] Competences Network Based on Interaction Data for Recommendation and Evaluation Aims
    Yahiaoui, Soumaya
    Courtin, Christophe
    Maret, Pierre
    Tabourot, Laurent
    COMPLEX NETWORKS & THEIR APPLICATIONS VI, 2018, 689 : 989 - 1001
  • [46] SOIL FERTILITY EVALUATION FOR FERTILISER RECOMMENDATION USING HYPERION DATA
    Ghosh, Ranendu
    Padmanabhan, N.
    Patel, K. C.
    INTERNATIONAL CONFERENCE ON SENSORS & MODELS IN REMOTE SENSING & PHOTOGRAMMETRY, 2015, 41 (W5): : 241 - 247
  • [47] Data Split Strategies for Evolving Predictive Models
    Raykar, Vikas C.
    Saha, Amrita
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT I, 2015, 9284 : 3 - 19
  • [48] PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models
    Lee, Yunjae
    Kim, Hyeseong
    Rhu, Minsoo
    2024 ACM/IEEE 51ST ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, ISCA 2024, 2024, : 340 - 353
  • [49] A similarity-based automatic data recommendation approach for geographic models
    Zhu, Yunqiang
    Zhu, A-Xing
    Feng, Min
    Song, Jia
    Zhao, Hongwei
    Yang, Jie
    Zhang, Qiuyi
    Sun, Kai
    Zhang, Jinqu
    Yao, Ling
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2017, 31 (07) : 1403 - 1424
  • [50] Scalable Recommendation Models Fusing Multi-Source Heterogeneous Data
    Ji Z.-Y.
    Wu M.-D.
    Yang C.
    Li J.-D.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2021, 44 (03): : 106 - 111