CRTER: Using Cross Terms to Enhance Probabilistic Information Retrieval

被引:0
|
作者
Zhao, Jiashu [1 ]
Huang, Jimmy Xiangji [1 ]
He, Ben [1 ]
机构
[1] York Univ, Dept Comp Sci & Engn, Informat Retrieval & Knowledge Management Res Lab, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Cross Term; Kernel; BM25; Proximity; Probabilistic IR;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Term proximity retrieval rewards a document where the matched query terms occur close to each other. Although term proximity is known to be effective in many Information Retrieval (IR) applications, the within-document distribution of each individual query term and how the query terms associate with each other, are not fully considered. In this paper, we introduce a pseudo term, namely Cross Term, to model term proximity for boosting retrieval performance. An occurrence of a query term is assumed to have an impact towards its neighboring text, which gradually weakens with the increase of the distance to the place of occurrence. We use a shape function to characterize such an impact. A Cross Term occurs when two query terms appear close to each other and their impact shape functions have an intersection. We propose a CRoss TErm Retrieval (CRTER) model that combines the Cross Terms' information with basic probabilistic weighting models to rank the retrieved documents. Extensive experiments on standard TREC collections illustrate the effectiveness of our proposed CRTER model.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [41] Probabilistic Embeddings for Cross-Modal Retrieval
    Chun, Sanghyuk
    Oh, Seong Joon
    de Rezende, Rafael Sampaio
    Kalantidis, Yannis
    Larlus, Diane
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8411 - 8420
  • [42] Multilingual information access system using cross-language information retrieval
    Hayashi, Yoshihiko
    Matsuo, Yoshihiro
    Nagata, Masaaki
    Furuse, Osamu
    [J]. 2003, Nippon Telegraph and Telephone Corp. (52):
  • [43] Cross-media retrieval using probabilistic model of automatic image annotation
    Xia, Ying
    Wu, Yun Long
    Feng, Jiang Fan
    [J]. International Journal of Signal Processing, Image Processing and Pattern Recognition, 2015, 8 (04) : 145 - 154
  • [44] Cross Language Information Retrieval
    Poibeau, Thierry
    [J]. TRAITEMENT AUTOMATIQUE DES LANGUES, 2010, 51 (03): : 157 - 162
  • [45] Using Lasso RCCA for cross-language information retrieval
    Polajnar, Emil
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2018, 47 (09) : 2739 - 2748
  • [46] Using restricted CCA for cross-language information retrieval
    Polajnar, Emil
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (06) : 4618 - 4626
  • [47] Cross-language information retrieval using web directories
    Kimura, F
    Maeda, A
    Yoshikawa, M
    Uemura, S
    [J]. 2003 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS, AND SIGNAL PROCESSING, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2003, : 911 - 914
  • [48] SemApp: A Semantic Approach to Enhance Information Retrieval
    Neji, Sameh
    Chenaina, Tarek
    Shoeb, Abdullah M.
    Ben Ayed, Leila
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT III, 2021, 12951 : 62 - 78
  • [49] Transliteration Retrieval Model for Cross Lingual Information Retrieval
    Jan, Ea-Ee
    Lin, Shih-Hsiang
    Chen, Berlin
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 183 - +
  • [50] INEXPENSIVE INFORMATION RETRIEVAL SYSTEM USING COORDINATION OF TERMS WITH EDGE-NOTCHED CARDS
    BALAY, R
    GARDNER, J
    [J]. COLLEGE & RESEARCH LIBRARIES, 1966, 27 (06): : 464 - 469