What Does Affect the Correlation Among Evaluation Measures?

被引:8
|
作者
Ferro, Nicola [1 ,2 ]
机构
[1] Univ Padua, Padua, Italy
[2] Dept Informat Engn, Via G Gradenigo 6-B, I-35131 Padua, Italy
关键词
Correlation analysis; Kendall's tau correlation; AP correlation; evaluation measures; general linear mixed models (GLMM); analysis of variance (ANOVA); grid of points (GoP); RELEVANCE;
D O I
10.1145/3106371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information Retrieval (IR) is well-known for the great number of adopted evaluation measures, with new ones popping up more and more frequently. In this context, correlation analysis is the tool used to study the evaluation measures and to let us understand if two measures rank systems similarly, if they grasp different aspects of system performances or actually reflect different user models, if a new measure is well motivated or not. To this end, the two most commonly used correlation coefficients are the Kendall's tau correlation and the AP correlation tau(AP) The goal of the article is to investigate the properties of the tool, that is, correlation analysis, we use to study evaluation measures. In particular, we investigate three research questions about these two correlation coefficients: (i) what is the effect of the number of systems and topics? (ii) what is the effect of removing low-performing systems? (iii) what is the effect of the experimental collections? To answer these research questions, we propose a methodology based on General Linear Mixed Model (GLMM) and ANalysis Of VAriance (ANOVA) to isolate the effects of the number of topics, number of systems, and experimental collections and to let us observe expected correlation values, net from these effects, which are stable and reliable. We learned that the effect of the number of topics is more prominent than the effect of the number of systems. Even if it produces different absolute values, the effect of removing low-performing systems does not seem to provide information substantially different from not removing them, especially when comparing a whole set of evaluation measures. Finally, we found out that both document corpora and topic sets affect the correlation among evaluation measures, the effect of the latter being more prominent. Moreover, there is a substantial interaction between evaluation measures, corpora and topic sets, meaning that the correlation between different evaluation measures can be substantially increased or decreased depending on the different corpora and topics at hand.
引用
收藏
页数:40
相关论文
共 50 条
  • [31] Does overconfidence affect corporate investment? CEO overconfidence measures revisited
    Malmendier, U
    Tate, G
    EUROPEAN FINANCIAL MANAGEMENT, 2005, 11 (05) : 649 - 659
  • [32] Substance misuse in schizophrenia: Does it affect compliance and other outcome measures?
    Hawthorn, TC
    Lewis, SW
    Hayhurst, KP
    Markwick, AJ
    Drake, RJ
    SCHIZOPHRENIA RESEARCH, 2004, 67 (01) : 217 - 217
  • [33] Does high stereotypic behavior expression affect productivity measures in sows?
    Tatemoto, Patricia
    Bernardino, Thiago
    Mazzocca Lopes Rodrigues, Frederico Augusto
    Zanella, Adroaldo Jose
    REVISTA BRASILEIRA DE ZOOTECNIA-BRAZILIAN JOURNAL OF ANIMAL SCIENCE, 2019, 48
  • [34] Deprivation: does it affect process of care and outcome measures in a Scottish population?
    Turner, M. E.
    Macleod, M. J.
    Barber, M. H.
    Dodds, H.
    Dennis, M. S.
    Langhorne, P.
    INTERNATIONAL JOURNAL OF STROKE, 2013, 8 : 26 - 26
  • [35] Does parity affect mortality among parous women?
    Koski-Rahikkala, H.
    Pouta, A.
    Pietilainen, K.
    Hartikainen, A. -L
    JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2006, 60 (11) : 968 - 973
  • [37] By what means does information technology affect employment and wages?
    Shaw, K
    PRODUCTIVITY, INEQUALITY AND THE DIGITAL ECONOMY: A TRANSATLANTIC PERSPECTIVE, 2002, : 229 - +
  • [38] English learning and teaching: what does affect have to do with it?
    Rangel Moraes Bezerra, Isabel Cristina
    SOLETRAS, 2013, (25): : 256 - 281
  • [39] To what extent does the fiber orientation affect mechanical performance?
    Sanal, Irem
    Zihnioglu, Nilufer Ozyurt
    CONSTRUCTION AND BUILDING MATERIALS, 2013, 44 : 671 - 681
  • [40] Why and what is waveform diversity, and how does it affect electromagnetics?
    Garnham, John W.
    Roman, Jaime R.
    2007 IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY: WORKSHOP AND TUTORIAL NOTES, VOLS 1-3, 2007, : 180 - 183