What Does Affect the Correlation Among Evaluation Measures?

被引:8
|
作者
Ferro, Nicola [1 ,2 ]
机构
[1] Univ Padua, Padua, Italy
[2] Dept Informat Engn, Via G Gradenigo 6-B, I-35131 Padua, Italy
关键词
Correlation analysis; Kendall's tau correlation; AP correlation; evaluation measures; general linear mixed models (GLMM); analysis of variance (ANOVA); grid of points (GoP); RELEVANCE;
D O I
10.1145/3106371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information Retrieval (IR) is well-known for the great number of adopted evaluation measures, with new ones popping up more and more frequently. In this context, correlation analysis is the tool used to study the evaluation measures and to let us understand if two measures rank systems similarly, if they grasp different aspects of system performances or actually reflect different user models, if a new measure is well motivated or not. To this end, the two most commonly used correlation coefficients are the Kendall's tau correlation and the AP correlation tau(AP) The goal of the article is to investigate the properties of the tool, that is, correlation analysis, we use to study evaluation measures. In particular, we investigate three research questions about these two correlation coefficients: (i) what is the effect of the number of systems and topics? (ii) what is the effect of removing low-performing systems? (iii) what is the effect of the experimental collections? To answer these research questions, we propose a methodology based on General Linear Mixed Model (GLMM) and ANalysis Of VAriance (ANOVA) to isolate the effects of the number of topics, number of systems, and experimental collections and to let us observe expected correlation values, net from these effects, which are stable and reliable. We learned that the effect of the number of topics is more prominent than the effect of the number of systems. Even if it produces different absolute values, the effect of removing low-performing systems does not seem to provide information substantially different from not removing them, especially when comparing a whole set of evaluation measures. Finally, we found out that both document corpora and topic sets affect the correlation among evaluation measures, the effect of the latter being more prominent. Moreover, there is a substantial interaction between evaluation measures, corpora and topic sets, meaning that the correlation between different evaluation measures can be substantially increased or decreased depending on the different corpora and topics at hand.
引用
收藏
页数:40
相关论文
共 50 条
  • [21] What does really affect the colonization of needleless connectors?
    Guembe, Maria
    Jesus Perez-Granda, Maria
    ENFERMEDADES INFECCIOSAS Y MICROBIOLOGIA CLINICA, 2020, 38 (03): : 97 - 98
  • [22] What Does Affect the Sexual Behaviour in Fibromyalgic Patients?
    Bazzichi, Laura
    Rossi, Alessandra
    Conversano, Ciro
    Giacomelli, Camillo
    Ferrari, Claudia
    De Feo, Francesca
    Sernissi, Francesca
    Doveri, Marica
    Carli, Linda
    Bombardieri, Stefano
    ARTHRITIS AND RHEUMATISM, 2011, 63 (10): : S369 - S369
  • [23] DIVERSITY AMONG CONTRACT LAWS: WHAT ARE THE MEASURES, AND WHAT ARE THE DRIVERS?
    Beale, Hugh
    REVISTA DE DERECHO CIVIL, 2024, 11 (04): : 163 - 191
  • [24] Encephalitis What Role does Age affect Outcome?
    Lichert, Frank
    FORTSCHRITTE DER NEUROLOGIE PSYCHIATRIE, 2014, 82 (05) : 246 - 246
  • [25] To What Extent Does Hyperglycemia Affect Bone Metabolism?
    Hayashi, Seiko
    Deshpande, Gautam
    Noto, Hiroshi
    DIABETES, 2019, 68
  • [26] About HIV What is it and how does it affect the brain?
    Simmons, Daniel B.
    McClean, Jeffrey C.
    NEUROLOGY, 2014, 83 (18) : E173 - E173
  • [27] Correlation of Multiple Measures of Voice Evaluation Among Individuals with Muscle Tension Dysphonia: An Exploratory Study
    Benoy, Jesnu Jose
    Jayakumar, Thirunavukkarasu
    INDIAN JOURNAL OF OTOLARYNGOLOGY AND HEAD & NECK SURGERY, 2024, 76 (06) : 5285 - 5292
  • [28] Does sex of client affect counselors' evaluation?
    Lee, DY
    Park, MJ
    Park, SH
    PSYCHOLOGICAL REPORTS, 2004, 94 (03) : 1205 - 1211
  • [29] What Is Rural Adversity, How Does It Affect Wellbeing and What Are the Implications for Action?
    Lawrence-Bourne, Joanne
    Dalton, Hazel
    Perkins, David
    Farmer, Jane
    Luscombe, Georgina
    Oelke, Nelly
    Bagheri, Nasser
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (19) : 1 - 13
  • [30] FAMILIARITY AND ATTENTION - DOES WHAT WE KNOW AFFECT WHAT WE NOTICE
    CHRISTIE, J
    KLEIN, R
    MEMORY & COGNITION, 1995, 23 (05) : 547 - 550