Using Term Location Information to Enhance Probabilistic Information Retrieval

被引:22
|
作者
Liu, Baiyan [1 ]
An, Xiangdong [1 ]
Huang, Jimmy Xiangji [1 ]
机构
[1] York Univ, Sch Informat Technol, Informat Retrieval & Knowledge Management Res Lab, Toronto, ON M3J 1P3, Canada
关键词
Term location; probabilistic information retrieval; noun; NOUN PHRASES;
D O I
10.1145/2766462.2767827
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nouns are more important than other parts of speech in information retrieval and are more often found near the beginning or the end of sentences. In this paper, we investigate the effects of rewarding terms based on their location in sentences on information retrieval. Particularly, we propose a novel Term Location (TEL) retrieval model based on BM25 to enhance probabilistic information retrieval, where a kernel-based method is used to capture term placement patterns. Experiments on f ve TREC datasets of varied size and content indicate the proposed model signifcantly outperforms the optimized BM25 and DirichletLM in MAP over all datasets with all kernel functions, and excels the optimized BM25 and DirichletLM over most of the datasets in P@5 and P@20 with different kernel functions.
引用
收藏
页码:883 / 886
页数:4
相关论文
共 50 条
  • [1] Rewarding Term Location Information to Enhance Probabilistic Information Retrieval
    Zhao, Jiashu
    Huang, Jimmy Xiangji
    Wu, Shicheng
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1137 - 1138
  • [2] CRTER: Using Cross Terms to Enhance Probabilistic Information Retrieval
    Zhao, Jiashu
    Huang, Jimmy Xiangji
    He, Ben
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 155 - 164
  • [3] A topic-based term frequency normalization framework to enhance probabilistic information retrieval
    Jian, Fanghong
    Huang, Jimmy X.
    Zhao, Jiashu
    Ying, Zhiwei
    Wang, Yuqi
    [J]. COMPUTATIONAL INTELLIGENCE, 2020, 36 (02) : 486 - 521
  • [4] Modeling Term Associations for Probabilistic Information Retrieval
    Zhao, Jiashu
    Huang, Jimmy Xiangji
    Ye, Zheng
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2014, 32 (02)
  • [5] A probabilistic information retrieval model by document ranking using term dependencies
    You, Hyun-Jo
    Lee, Jung-Jin
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (05) : 763 - 782
  • [6] A probabilistic justification for using tf × idf term weighting in information retrieval
    Hiemstra D.
    [J]. International Journal on Digital Libraries, 2000, 3 (2) : 131 - 139
  • [7] An improvement of information retrieval system using word location information
    Kim, KwangYoung
    Jin, DuSeok
    Choi, YoonSoo
    Kim, JinSuk
    Suh, YoungKyoon
    Seo, Jerry
    [J]. International Conference on Computational Intelligence for Modelling, Control & Automation Jointly with International Conference on Intelligent Agents, Web Technologies & Internet Commerce, Vol 1, Proceedings, 2006, : 654 - 657
  • [8] Modeling term proximity for probabilistic information retrieval models
    He, Ben
    Huang, Jimmy Xiangji
    Zhou, Xiaofeng
    [J]. INFORMATION SCIENCES, 2011, 181 (14) : 3017 - 3031
  • [9] Exploring term dependences in probabilistic information retrieval model
    Cho, BH
    Lee, C
    Lee, GG
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (04) : 505 - 519
  • [10] Feature location using probabilistic ranking of methods based on execution scenarios and information retrieval
    Poshyvanyk, Denys
    Gueheneuc, Yann-Gael
    Marcus, Andrian
    Antoniol, Giuliano
    Rajlich, Vaclav
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2007, 33 (06) : 420 - 432