Exploring term dependences in probabilistic information retrieval model

被引:0
|
作者
Cho, BH
Lee, C
Lee, GG
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci & Engn, Nam Gu, Pohang 790784, South Korea
[2] Voiceware Co Ltd, R&D Ctr, Gangnam Gu, Seoul 135280, South Korea
关键词
information retrieval; term dependence; Bahadur-Lazarsfeld expansion; probabilistic model; 2-Poisson model;
D O I
10.1016/S0306-4573(02)00078-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most previous information retrieval (IR) models assume that terms of queries and documents are statistically independent from each another. However, this kind of conditional independence assumption is obviously and openly understood to be wrong, so we present a new method of incorporating term dependence in probabilistic retrieval model by adapting Bahadur-Lazarsfeld expansion (BLE) to compensate the weakness of the assumption. In this paper, we describe a theoretic process to apply BLE to the general probabilistic models and the state-of-the-art 2-Poisson model. Through the experiments on two standard document collections, HANTEC2.0 in Korean and WT10g in English, we demonstrate that incorporation of term dependences using the BLE significantly contribute to the improvement of performance in at least two different language IR systems. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:505 / 519
页数:15
相关论文
共 50 条
  • [1] A New Term Frequency Normalization Model for Probabilistic Information Retrieval
    Jian, Fanghong
    Huang, Jimmy Xiangji
    Zhao, Jiashu
    He, Tingting
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1237 - 1240
  • [2] A probabilistic information retrieval model by document ranking using term dependencies
    You, Hyun-Jo
    Lee, Jung-Jin
    KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (05) : 763 - 782
  • [3] A probabilistic model for distributed information retrieval
    Baumgarten, C
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1997, : 258 - 266
  • [4] Modeling Term Associations for Probabilistic Information Retrieval
    Zhao, Jiashu
    Huang, Jimmy Xiangji
    Ye, Zheng
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2014, 32 (02)
  • [5] Analysis of Probabilistic model for Document Retrieval in Information Retrieval
    Tamrakar, Astha
    Vishwakarma, Santosh K.
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 760 - 765
  • [6] Rewarding Term Location Information to Enhance Probabilistic Information Retrieval
    Zhao, Jiashu
    Huang, Jimmy Xiangji
    Wu, Shicheng
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1137 - 1138
  • [7] Using Term Location Information to Enhance Probabilistic Information Retrieval
    Liu, Baiyan
    An, Xiangdong
    Huang, Jimmy Xiangji
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 883 - 886
  • [8] A PROBABILISTIC INFERENCE MODEL FOR INFORMATION-RETRIEVAL
    WONG, SKM
    YAO, YY
    INFORMATION SYSTEMS, 1991, 16 (03) : 301 - 321
  • [9] An information retrieval model based on probabilistic network
    Shi, DY
    Qi, ZW
    Fu, C
    You, JY
    2004 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING, PROCEEDINGS, 2004, : 423 - 426
  • [10] Modeling term proximity for probabilistic information retrieval models
    He, Ben
    Huang, Jimmy Xiangji
    Zhou, Xiaofeng
    INFORMATION SCIENCES, 2011, 181 (14) : 3017 - 3031