What Does Affect the Correlation Among Evaluation Measures?

被引:8
|
作者
Ferro, Nicola [1 ,2 ]
机构
[1] Univ Padua, Padua, Italy
[2] Dept Informat Engn, Via G Gradenigo 6-B, I-35131 Padua, Italy
关键词
Correlation analysis; Kendall's tau correlation; AP correlation; evaluation measures; general linear mixed models (GLMM); analysis of variance (ANOVA); grid of points (GoP); RELEVANCE;
D O I
10.1145/3106371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information Retrieval (IR) is well-known for the great number of adopted evaluation measures, with new ones popping up more and more frequently. In this context, correlation analysis is the tool used to study the evaluation measures and to let us understand if two measures rank systems similarly, if they grasp different aspects of system performances or actually reflect different user models, if a new measure is well motivated or not. To this end, the two most commonly used correlation coefficients are the Kendall's tau correlation and the AP correlation tau(AP) The goal of the article is to investigate the properties of the tool, that is, correlation analysis, we use to study evaluation measures. In particular, we investigate three research questions about these two correlation coefficients: (i) what is the effect of the number of systems and topics? (ii) what is the effect of removing low-performing systems? (iii) what is the effect of the experimental collections? To answer these research questions, we propose a methodology based on General Linear Mixed Model (GLMM) and ANalysis Of VAriance (ANOVA) to isolate the effects of the number of topics, number of systems, and experimental collections and to let us observe expected correlation values, net from these effects, which are stable and reliable. We learned that the effect of the number of topics is more prominent than the effect of the number of systems. Even if it produces different absolute values, the effect of removing low-performing systems does not seem to provide information substantially different from not removing them, especially when comparing a whole set of evaluation measures. Finally, we found out that both document corpora and topic sets affect the correlation among evaluation measures, the effect of the latter being more prominent. Moreover, there is a substantial interaction between evaluation measures, corpora and topic sets, meaning that the correlation between different evaluation measures can be substantially increased or decreased depending on the different corpora and topics at hand.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Does level of specificity affect measures of motivation to comply? A randomized evaluation
    Branscum, Paul
    Senkowski, Valerie
    TRANSLATIONAL BEHAVIORAL MEDICINE, 2019, 9 (02) : 373 - 379
  • [2] What does (and does not) affect crime in India?
    Hazra, Devika
    INTERNATIONAL JOURNAL OF SOCIAL ECONOMICS, 2020, 47 (04) : 503 - 521
  • [3] What progress does it measures?
    Meda, Dominique
    ESPRIT, 2009, (06) : 86 - 118
  • [4] What the need for closure scale measures and what it does not: Toward differentiating among related epistemic motives
    Neuberg, SL
    Judice, TN
    West, SG
    JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1997, 72 (06) : 1396 - 1412
  • [5] Does Cognitive Load Affect Measures of Consciousness?
    Nilsen, Andre Sevenius
    Storm, Johan Frederik
    Juel, Bjorn Erik
    BRAIN SCIENCES, 2024, 14 (09)
  • [6] Does Asymmetric Correlation Affect Portfolio Optimization?
    Fryd, Lukas
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2016 (ICNAAM-2016), 2017, 1863
  • [7] Globalisation: what is it and how does it affect health?
    Lee, K
    MEDICAL JOURNAL OF AUSTRALIA, 2004, 180 (04) : 156 - 158
  • [8] To what extent does immigration affect inequality?
    Berman, Yonatan
    Aste, Tomaso
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2016, 462 : 1029 - 1039
  • [9] What physics does affect the MRI threshold
    Ilgisonis, V. I.
    Khalzov, I. V.
    Lakhin, V. P.
    Smolyakov, A. I.
    PLASMAS IN THE LABORATORY AND IN THE UNIVERSE: INTERACTIONS, PATTERNS, AND TURBULENCE, 2010, 1242 : 23 - 30
  • [10] What is stress, and how does it affect reproduction?
    Dobson, H
    Smith, RF
    ANIMAL REPRODUCTION SCIENCE, 2000, 60 : 743 - 752