CRTER: Using Cross Terms to Enhance Probabilistic Information Retrieval

被引:0
|
作者
Zhao, Jiashu [1 ]
Huang, Jimmy Xiangji [1 ]
He, Ben [1 ]
机构
[1] York Univ, Dept Comp Sci & Engn, Informat Retrieval & Knowledge Management Res Lab, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Cross Term; Kernel; BM25; Proximity; Probabilistic IR;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Term proximity retrieval rewards a document where the matched query terms occur close to each other. Although term proximity is known to be effective in many Information Retrieval (IR) applications, the within-document distribution of each individual query term and how the query terms associate with each other, are not fully considered. In this paper, we introduce a pseudo term, namely Cross Term, to model term proximity for boosting retrieval performance. An occurrence of a query term is assumed to have an impact towards its neighboring text, which gradually weakens with the increase of the distance to the place of occurrence. We use a shape function to characterize such an impact. A Cross Term occurs when two query terms appear close to each other and their impact shape functions have an intersection. We propose a CRoss TErm Retrieval (CRTER) model that combines the Cross Terms' information with basic probabilistic weighting models to rank the retrieved documents. Extensive experiments on standard TREC collections illustrate the effectiveness of our proposed CRTER model.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [1] Using Term Location Information to Enhance Probabilistic Information Retrieval
    Liu, Baiyan
    An, Xiangdong
    Huang, Jimmy Xiangji
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 883 - 886
  • [2] UTILIZATION OF CROSS-TERMS TO ENHANCE THE LANGUAGE MODEL FOR INFORMATION RETRIEVAL
    Barakat, Huda Mohammed
    Ismail, Maizatul Akmar
    Ravana, Sri Devi
    [J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2013, 26 (03) : 196 - 210
  • [3] Rewarding Term Location Information to Enhance Probabilistic Information Retrieval
    Zhao, Jiashu
    Huang, Jimmy Xiangji
    Wu, Shicheng
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1137 - 1138
  • [4] Probabilistic Ranking of Documents Using Vectors in Information Retrieval
    Saini, Balwinder
    Singh, Vikram
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, 2015, 31 : 613 - 624
  • [5] A Probabilistic logic for information retrieval
    van Rijsbergen, CJ
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2005, 3408 : 1 - 6
  • [6] A topic-based term frequency normalization framework to enhance probabilistic information retrieval
    Jian, Fanghong
    Huang, Jimmy X.
    Zhao, Jiashu
    Ying, Zhiwei
    Wang, Yuqi
    [J]. COMPUTATIONAL INTELLIGENCE, 2020, 36 (02) : 486 - 521
  • [7] On the performance of medical information retrieval using MeSH terms - A survey
    Swetha, S.
    Uma, D.
    Suganya, P.
    Nivedhitha, V.
    Saravanakumar, K.
    [J]. Journal of Engineering Science and Technology Review, 2014, 7 (04) : 137 - 142
  • [8] Predicting Data Space Retrieval Using Probabilistic Hidden Information
    Tchuissang, Gile Narcisse Fanzou
    Wang, Ning
    Kuicheu, Nathalie Cindy
    Siewe, Francois
    Xu, De
    Liu, Shuoyan
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (07) : 1991 - 1994
  • [9] An adaptive information retrieval system using a probabilistic user model
    Saito, K
    Shioya, H
    Da-te, T
    [J]. COMPUTING ANTICIPATORY SYSTEMS, 2001, 573 : 694 - 703