Statistical biases in Information Retrieval metrics for recommender systems

被引:79
|
作者
Bellogin, Alejandro [1 ]
Castells, Pablo [1 ]
Cantador, Ivan [1 ]
机构
[1] Univ Autonoma Madrid, Madrid, Spain
来源
INFORMATION RETRIEVAL JOURNAL | 2017年 / 20卷 / 06期
关键词
Evaluation; Recommender systems; Popularity bias; Sparsity bias; Cranfield;
D O I
10.1007/s10791-017-9312-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is an increasing consensus in the Recommender Systems community that the dominant error-based evaluation metrics are insufficient, and mostly inadequate, to properly assess the practical effectiveness of recommendations. Seeking to evaluate recommendation rankings-which largely determine the effective accuracy in matching user needs-rather than predicted rating values, Information Retrieval metrics have started to be applied for the evaluation of recommender systems. In this paper we analyse the main issues and potential divergences in the application of Information Retrieval methodologies to recommender system evaluation, and provide a systematic characterisation of experimental design alternatives for this adaptation. We lay out an experimental configuration framework upon which we identify and analyse specific statistical biases arising in the adaptation of Information Retrieval metrics to recommendation tasks, namely sparsity and popularity biases. These biases considerably distort the empirical measurements, hindering the interpretation and comparison of results across experiments. We develop a formal characterisation and analysis of the biases upon which we analyse their causes and main factors, as well as their impact on evaluation metrics under different experimental configurations, illustrating the theoretical findings with empirical evidence. We propose two experimental design approaches that effectively neutralise such biases to a large extent. We report experiments validating our proposed experimental variants, and comparing them to alternative approaches and metrics that have been defined in the literature with similar or related purposes.
引用
收藏
页码:606 / 634
页数:29
相关论文
共 50 条
  • [1] Statistical biases in Information Retrieval metrics for recommender systems
    Alejandro Bellogín
    Pablo Castells
    Iván Cantador
    [J]. Information Retrieval Journal, 2017, 20 : 606 - 634
  • [2] Quantum Computing for Information Retrieval and Recommender Systems
    Dacrema, Maurizio Ferrari
    Pasin, Andrea
    Cremonesi, Paolo
    Ferro, Nicola
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT V, 2024, 14612 : 358 - 362
  • [3] Information Retrieval and Folksonomies together for Recommender Systems
    Chevalier, Max
    Dattolo, Antonina
    Hubert, Gilles
    Pitassi, Emanuela
    [J]. E-COMMERCE AND WEB TECHNOLOGIES, 2011, 85 : 172 - +
  • [4] On Statistical Analysis and Optimization of Information Retrieval Effectiveness Metrics
    Wang, Jun
    Zhu, Jianhan
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 226 - 233
  • [5] Special Issue on Information Retrieval, Recommender Systems and Adaptive Systems
    Polignano, Marco
    Semeraro, Giovanni
    [J]. INFORMATION, 2022, 13 (10)
  • [6] Information Retrieval and Recommender Systems JUCS Special Issue
    Cacheda, Fidel
    Parapar, Javier
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2015, 21 (13) : 1706 - 1707
  • [7] Decision Biases in Recommender Systems
    Teppan, Erich Christian
    Zanker, Markus
    [J]. JOURNAL OF INTERNET COMMERCE, 2015, 14 (02) : 255 - 275
  • [8] Foreword for Workshop on Decision Making for Information Retrieval and Recommender Systems
    Xu, Da
    Schnabel, Tobias
    Cui, Xiquan
    Dean, Sarah
    Deshmukh, Aniket
    Yang, Bo
    Yu, Shipeng
    [J]. COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 920 - 920
  • [9] Using and Evaluating Quantum Computing for Information Retrieval and Recommender Systems
    Dacrema, Maurizio Ferrari
    Pasin, Andrea
    Cremonesi, Paolo
    Ferro, Nicola
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 3017 - 3020
  • [10] Novelty and Diversity Enhancement and Evaluation in Recommender Systems and Information Retrieval
    Vargas, Saul
    [J]. SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1281 - 1281