On Statistical Analysis and Optimization of Information Retrieval Effectiveness Metrics

被引:0
|
作者
Wang, Jun [1 ]
Zhu, Jianhan [1 ]
机构
[1] UCL, Dept Comp Sci, London WC1E 6BT, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new way of thinking for IR metric optimization. It is argued that the optimal ranking problem should be factorized into two distinct yet interrelated stages: the relevance prediction stage and ranking decision stage. During retrieval the relevance of documents is not known a priori, and the joint probability of relevance is used to measure the uncertainty of documents' relevance in the collection as a whole. The resulting optimization objective function in the latter stage is, thus, the expected value of the IR metric with respect to this probability measure of relevance. Through statistically analyzing the expected values of IR metrics under such uncertainty, we discover and explain some interesting properties of IR metrics that have not been known before. Our analysis and optimization framework do not assume a particular (relevance) retrieval model and metric, making it applicable to many existing IR models and metrics. The experiments on one of resulting applications have demonstrated its significance in adapting to various IR metrics.
引用
收藏
页码:226 / 233
页数:8
相关论文
共 50 条
  • [31] The effectiveness of stemming for information retrieval in Amharic
    Alemayehu, N
    Willett, P
    [J]. PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2003, 37 (04) : 254 - 259
  • [32] Click Model-Based Information Retrieval Metrics
    Chuklin, Aleksandr
    Serdyukov, Pavel
    de Rijke, Maarten
    [J]. SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 493 - 502
  • [33] On the reliability of information retrieval metrics based on graded relevance
    Sakai, Tetsuya
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (02) : 531 - 548
  • [34] Retrieval Effectiveness of Cross Language Information Retrieval Search Engines
    Foo, Schubert
    [J]. DIGITAL LIBRARIES: FOR CULTURAL HERITAGE, KNOWLEDGE DISSEMINATION, AND FUTURE CREATION: ICADL 2011, 2011, 7008 : 296 - 306
  • [35] Multiple Testing in Statistical Analysis of Systems-Based Information Retrieval Experiments
    Carterette, Benjamin A.
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, 30 (01)
  • [36] Analysis and Optimization of Statistical Data in Beijing for Traffic Information Services
    Li, Man
    Wang, Wenjia
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2008, : 128 - 133
  • [37] Statistical evaluation of music information retrieval experiments
    Flexer, Arthur
    [J]. JOURNAL OF NEW MUSIC RESEARCH, 2006, 35 (02) : 113 - 120
  • [38] Statistical models for monolingual and bilingual information retrieval
    Bertoldi, N
    Federico, M
    [J]. INFORMATION RETRIEVAL, 2004, 7 (1-2): : 53 - 72
  • [39] Another Look at Information Retrieval as Statistical Translation
    Liu, Yuqi
    Hu, Chengcheng
    Lin, Jimmy
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2749 - 2754
  • [40] Statistical learning for effective visual information retrieval
    Chang, EY
    Li, BT
    Wu, G
    Goh, K
    [J]. 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 609 - 612