What Does Affect the Correlation Among Evaluation Measures?

被引:8
|
作者
Ferro, Nicola [1 ,2 ]
机构
[1] Univ Padua, Padua, Italy
[2] Dept Informat Engn, Via G Gradenigo 6-B, I-35131 Padua, Italy
关键词
Correlation analysis; Kendall's tau correlation; AP correlation; evaluation measures; general linear mixed models (GLMM); analysis of variance (ANOVA); grid of points (GoP); RELEVANCE;
D O I
10.1145/3106371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information Retrieval (IR) is well-known for the great number of adopted evaluation measures, with new ones popping up more and more frequently. In this context, correlation analysis is the tool used to study the evaluation measures and to let us understand if two measures rank systems similarly, if they grasp different aspects of system performances or actually reflect different user models, if a new measure is well motivated or not. To this end, the two most commonly used correlation coefficients are the Kendall's tau correlation and the AP correlation tau(AP) The goal of the article is to investigate the properties of the tool, that is, correlation analysis, we use to study evaluation measures. In particular, we investigate three research questions about these two correlation coefficients: (i) what is the effect of the number of systems and topics? (ii) what is the effect of removing low-performing systems? (iii) what is the effect of the experimental collections? To answer these research questions, we propose a methodology based on General Linear Mixed Model (GLMM) and ANalysis Of VAriance (ANOVA) to isolate the effects of the number of topics, number of systems, and experimental collections and to let us observe expected correlation values, net from these effects, which are stable and reliable. We learned that the effect of the number of topics is more prominent than the effect of the number of systems. Even if it produces different absolute values, the effect of removing low-performing systems does not seem to provide information substantially different from not removing them, especially when comparing a whole set of evaluation measures. Finally, we found out that both document corpora and topic sets affect the correlation among evaluation measures, the effect of the latter being more prominent. Moreover, there is a substantial interaction between evaluation measures, corpora and topic sets, meaning that the correlation between different evaluation measures can be substantially increased or decreased depending on the different corpora and topics at hand.
引用
收藏
页数:40
相关论文
共 50 条
  • [41] To what extent does syphilis affect the birth and placental weight?
    Fahlbusch, W
    DEUTSCHE MEDIZINISCHE WOCHENSCHRIFT, 1939, 65 : 1272 - 1274
  • [42] Does Smoking Affect OSA? What about Smoking Cessation?
    Pataka, Athanasia
    Kotoulas, Seraphim
    Kalamaras, George
    Tzinas, Asterios
    Grigoriou, Ioanna
    Kasnaki, Nectaria
    Argyropoulou, Paraskevi
    JOURNAL OF CLINICAL MEDICINE, 2022, 11 (17)
  • [43] What is a ketogenic diet and how does it affect the use of medicines?
    McArtney, Rowena
    Bailey, Alexandra
    Champion, Helena
    ARCHIVES OF DISEASE IN CHILDHOOD-EDUCATION AND PRACTICE EDITION, 2017, 102 (04): : 194 - 199
  • [44] To what extent does urbanisation affect fragmented grassland functioning?
    van der Walt, L.
    Cilliers, S. S.
    Kellner, K.
    Du Toit, M. J.
    Tongway, D.
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2015, 151 : 517 - 530
  • [45] What topoi does the French causal verb permettre affect?
    Nielsen, AE
    REVUE ROMANE, 1999, 34 (01) : 3 - 24
  • [46] What Happens to Retirement Plans, and Does This Affect Retirement Satisfaction?
    Principi, Andrea
    Smeaton, Deborah
    Cahill, Kevin
    Santini, Sara
    Barnes, Helen
    Socci, Marco
    INTERNATIONAL JOURNAL OF AGING & HUMAN DEVELOPMENT, 2020, 90 (02): : 152 - 175
  • [47] Brand Experience: What Is It? How Is It Measured? Does It Affect Loyalty?
    Brakus, J. Josko
    Schmitt, Bernd H.
    Zarantonello, Lia
    JOURNAL OF MARKETING, 2009, 73 (03) : 52 - 68
  • [48] CORRELATION AMONG SHEAR RATE MEASURES IN VASCULAR FLOWS
    FRIEDMAN, MH
    DETERS, OJ
    JOURNAL OF BIOMECHANICAL ENGINEERING-TRANSACTIONS OF THE ASME, 1987, 109 (01): : 25 - 26
  • [49] Correlation among measures of balance and validity in neuropsychological testing
    Lima, E.
    Hartline, K.
    Pawlenko, N.
    Patel, A.
    Riopelle, L.
    Herrera-Hamilton, A.
    CLINICAL NEUROPSYCHOLOGIST, 2016, 30 (03) : 434 - 435
  • [50] CORRELATION BETWEEN EDUCATION AND EARNINGS - WHAT DOES IT SIGNIFY
    BLAUG, M
    HIGHER EDUCATION, 1972, 1 (01) : 53 - 76