Exploring Data Splitting Strategies for the Evaluation of Recommendation Models

被引:48
|
作者
Meng, Zaigiao [1 ]
McCreadie, Richard [1 ]
Macdonald, Craig [1 ]
Ounis, Iadh [1 ]
机构
[1] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
Recommender Systems; Spliting Strategy; Model Evaluation; Leave-one-out; Temporal Split;
D O I
10.1145/3383313.3418479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective methodologies for evaluating recommender systems are critical, so that different systems can be compared in a sound manner. A commonly overlooked aspect of evaluating recommender systems is the selection of the data splitting strategy. In this paper, we both show that there is no standard splitting strategy and that the selection of splitting strategy can have a strong impact on the ranking of recommender systems during evaluation. In particular, we perform experiments comparing three common data splitting strategies, examining their impact over seven state-of-the-art recommendation models on two datasets. Our results demonstrate that the splitting strategy employed is an important confounding variable that can markedly alter the ranking of recommender systems, making much of the currently published literature non-comparable, even when the same datasets and metrics are used.
引用
收藏
页码:681 / 686
页数:6
相关论文
共 50 条
  • [31] Evaluation and selection of group recommendation strategies for collaborative searching of learning objects
    Zapata, Alfredo
    Menendez, Victor H.
    Prieto, Manuel E.
    Romero, Cristobal
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2015, 76 : 22 - 39
  • [32] Exploring Models and Data for Image Question Answering
    Ren, Mengye
    Kiros, Ryan
    Zemel, Richard S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [33] A Case Study on Sampling Strategies for Evaluating Neural Sequential Item Recommendation Models
    Dallmann, Alexander
    Zoller, Daniel
    Hotho, Andreas
    15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 505 - 514
  • [34] Exploring Data Fusion Strategies in Medical Record Search
    Zhou, Xin-Ke
    Wu, Sheng-Li
    2016 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SECURITY (CSIS 2016), 2016, : 553 - 558
  • [35] Exploring Security Strategies for Enterprise Data Protection in Organisations
    Ajigini, Olusegun Ademolu
    PROCEEDINGS OF 9TH EUROPEAN CONFERENCE ON IS MANAGEMENT AND EVALUATION (ECIME 2015), 2015, : 1 - 10
  • [36] Exploring chemical space - Generative models and their evaluation
    Vogt, Martin
    ARTIFICIAL INTELLIGENCE IN THE LIFE SCIENCES, 2023, 3
  • [37] Exploring on role of location in intelligent news recommendation from data analysis perspective
    Lv, Pengtao
    Zhang, Qinghui
    Shi, Lei
    Guan, Zhenhan
    Fan, Yanfeng
    Li, Jie
    Zhong, Kaiyang
    Deveci, Muhammet
    INFORMATION SCIENCES, 2024, 662
  • [38] NOTE ON DATA-SPLITTING FOR EVALUATION OF SIGNIFICANCE LEVELS
    COX, DR
    BIOMETRIKA, 1975, 62 (02) : 441 - 444
  • [39] Strategies and Recommendation for Data Loading of FHIR-Based Data Marts with Focus on GDPR Compliance
    Anywar, Michael
    Schreiweis, Bjorn
    Ulrich, Hannes
    DIGITAL PROFESSIONALISM IN HEALTH AND CARE: DEVELOPING THE WORKFORCE, BUILDING THE FUTURE, VOL. 298, 2022, : 127 - 131
  • [40] Exploring active learning strategies for predictive models in mechanics of materials
    Chen, Yingbin
    Deierling, Phillip
    Xiao, Shaoping
    APPLIED PHYSICS A-MATERIALS SCIENCE & PROCESSING, 2024, 130 (08):