Pooling-based continuous evaluation of information retrieval systems

被引:20
|
作者
Tonon, Alberto [1 ]
Demartini, Gianluca [2 ]
Cudre-Mauroux, Philippe [1 ]
机构
[1] Univ Fribourg, CH-1700 Fribourg, Switzerland
[2] Univ Sheffield, Sheffield S10 2TN, S Yorkshire, England
来源
INFORMATION RETRIEVAL JOURNAL | 2015年 / 18卷 / 05期
关键词
Information retrieval evaluation; Crowdsourcing; Continuous evaluation; Poolingtechniques; RELEVANCE;
D O I
10.1007/s10791-015-9266-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The dominant approach to evaluate the effectiveness of information retrieval (IR) systems is by means of reusable test collections built following the Cranfield paradigm. In this paper, we propose a new IR evaluation methodology based on pooled test-collections and on the continuous use of either crowdsourcing or professional editors to obtain relevance judgements. Instead of building a static collection for a finite set of systems known a priori, we propose an IR evaluation paradigm where retrieval approaches are evaluated iteratively on the same collection. Each new retrieval technique takes care of obtaining its missing relevance judgements and hence contributes to augmenting the overall set of relevance judgements of the collection. We also propose two metrics: Fairness Score, and opportunistic number of relevant documents, which we then use to define new pooling strategies. The goal of this work is to study the behavior of standard IR metrics, IR system ranking, and of several pooling techniques in a continuous evaluation context by comparing continuous and non-continuous evaluation results on classic test collections. We both use standard and crowdsourced relevance judgements, and we actually run a continuous evaluation campaign over several existing IR systems.
引用
收藏
页码:445 / 472
页数:28
相关论文
共 50 条
  • [1] Pooling-based continuous evaluation of information retrieval systems
    Alberto Tonon
    Gianluca Demartini
    Philippe Cudré-Mauroux
    [J]. Information Retrieval Journal, 2015, 18 : 445 - 472
  • [2] Multi-armed bandits for adjudicating documents in pooling-based evaluation of information retrieval systems
    Losada, David E.
    Parapar, Javier
    Barreiro, Alvaro
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (05) : 1005 - 1025
  • [3] Pooling-based Visual Transformer with low complexity attention hashing for image retrieval
    Ren, Huan
    Guo, Jiangtao
    Cheng, Shuli
    Li, Yongming
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [4] Pooling-based data interpolation and backdating
    Marcellino, Massimiliano
    [J]. JOURNAL OF TIME SERIES ANALYSIS, 2007, 28 (01) : 53 - 71
  • [5] Exploratory Visualization Tool for the Continuous Evaluation of Information Retrieval Systems
    Gonzalez-Saez, Gabriela
    Galuscakova, Petra
    Deveaud, Romain
    Goeuriot, Lorraine
    Mulhem, Philippe
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3220 - 3224
  • [6] Spectrum Pooling-Based Optimal Internetwork Spectrum Sharing for Cognitive Radio Systems
    Si, Pengbo
    Yu, F. Richard
    Yang, Ruizhe
    Zhang, Yanhua
    [J]. 2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,
  • [7] Pooling-Based Quantitative Approach to Evaluating Binarization Algorithms
    Liu, Maofu
    Liu, Ya
    Liu, Zhenguang
    Hu, Huijun
    Fang, Wei
    [J]. IEEE MULTIMEDIA, 2017, 24 (01) : 86 - 92
  • [8] A pooling-based feature pyramid network for salient object detection
    Shi, Caijuan
    Zhang, Weiming
    Duan, Changyu
    Chen, Houru
    [J]. IMAGE AND VISION COMPUTING, 2021, 107
  • [9] An Auditory Saliency Pooling-Based LSTM Model for Speech Intelligibility Classification
    Gallardo-Antolin, Ascension
    Montero, Juan M.
    [J]. SYMMETRY-BASEL, 2021, 13 (09):
  • [10] Attention pooling-based convolutional neural network for sentence modelling
    Er, Meng Joo
    Zhang, Yong
    Wang, Ning
    Pratama, Mahardhika
    [J]. INFORMATION SCIENCES, 2016, 373 : 388 - 403