Two-level document ranking using mutual information in natural language information retrieval

被引:13
|
作者
Kang, HK [1 ]
Choi, KS [1 ]
机构
[1] KOREA ADV INST SCI & TECHNOL, DEPT COMP SCI, YUSONG GU, TAEJON 305701, SOUTH KOREA
关键词
D O I
10.1016/S0306-4573(96)00074-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information retrieval is to retrieve relevant information that satisfies user's information needs. There arises a problem of how to select only information that is relevant to the user. Ranking techniques are used to find the documents in a collection of documents that are most likely to be relevant to the user's query. However, we find out that there could be retrieved documents whose contexts may not be consistent to the query. Mutual information is a measure which represents the relation between a word and another word. So, we will re-evaluate the relation between the terms in the retrieved document and the terms in the query. In this paper, we discuss a model of natural language information retrieval system that is based on a two-level document ranking method using mutual information. At the first-level, we retrieve documents based on automatically constructed index terms. At the second-level, we reorder the retrieved documents using mutual information. We will show that our method achieves considerable retrieval effectiveness improvement over a traditional linear searching method. Also, we will analyse seven newly developed formulas that reorder the retrieved documents. Among the seven formulas, we will recommend one formula that dominates the others in terms of the retrieval effectiveness. (C) 1997 Elsevier Science Ltd.
引用
收藏
页码:289 / 306
页数:18
相关论文
共 50 条
  • [41] Privacy-aware document retrieval with two-level inverted indexing
    Yifan Qiao
    Shiyu Ji
    Changhai Wang
    Jinjin Shao
    Tao Yang
    [J]. Information Retrieval Journal, 2023, 26
  • [42] Document re-ranking by generality in bio-medical information retrieval
    Yan, X
    Li, X
    Song, DW
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 376 - 389
  • [43] A Neural Autoencoder Approach for Document Ranking and Query Refinement in Pharmacogenomic Information Retrieval
    Pfeiffer, Jonas
    Broscheit, Samuel
    Gemulla, Rainer
    Goeschl, Mathias
    [J]. SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), 2018, : 87 - 97
  • [44] Mean-Variance Analysis: A New Document Ranking Theory in Information Retrieval
    Wang, Jun
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 4 - 16
  • [45] Contextualisation of information retrieval process and document ranking task in web search tools
    Bouramoul, Abdelkrim
    [J]. INTERNATIONAL JOURNAL OF SPACE-BASED AND SITUATED COMPUTING, 2016, 6 (02) : 74 - 89
  • [46] Using KCCA for Japanese–English cross-language information retrieval and document classification
    Yaoyong Li
    John Shawe-Taylor
    [J]. Journal of Intelligent Information Systems, 2006, 27 : 117 - 133
  • [47] Two-level information hiding for quantum images using optimal LSB
    Luo, Gaofeng
    Zhou, Ri-Gui
    Mao, Yingling
    [J]. QUANTUM INFORMATION PROCESSING, 2019, 18 (10)
  • [48] Two-level information hiding for quantum images using optimal LSB
    Gaofeng Luo
    Ri-Gui Zhou
    Yingling Mao
    [J]. Quantum Information Processing, 2019, 18
  • [49] Information Encoding Using Two-Level Generation in a Quantum Dot Laser
    Maximov, M. V.
    Shernyakov, Yu. M.
    Gordeev, N. Yu.
    Nadtochiy, A. M.
    Zhukov, A. E.
    [J]. TECHNICAL PHYSICS LETTERS, 2023, 49 (SUPPL 3) : S219 - S221
  • [50] Information Encoding Using Two-Level Generation in a Quantum Dot Laser
    M. V. Maximov
    Yu. M. Shernyakov
    N. Yu. Gordeev
    A. M. Nadtochiy
    A. E. Zhukov
    [J]. Technical Physics Letters, 2023, 49 : S219 - S221