Two-level document ranking using mutual information in natural language information retrieval

被引:13
|
作者
Kang, HK [1 ]
Choi, KS [1 ]
机构
[1] KOREA ADV INST SCI & TECHNOL, DEPT COMP SCI, YUSONG GU, TAEJON 305701, SOUTH KOREA
关键词
D O I
10.1016/S0306-4573(96)00074-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information retrieval is to retrieve relevant information that satisfies user's information needs. There arises a problem of how to select only information that is relevant to the user. Ranking techniques are used to find the documents in a collection of documents that are most likely to be relevant to the user's query. However, we find out that there could be retrieved documents whose contexts may not be consistent to the query. Mutual information is a measure which represents the relation between a word and another word. So, we will re-evaluate the relation between the terms in the retrieved document and the terms in the query. In this paper, we discuss a model of natural language information retrieval system that is based on a two-level document ranking method using mutual information. At the first-level, we retrieve documents based on automatically constructed index terms. At the second-level, we reorder the retrieved documents using mutual information. We will show that our method achieves considerable retrieval effectiveness improvement over a traditional linear searching method. Also, we will analyse seven newly developed formulas that reorder the retrieved documents. Among the seven formulas, we will recommend one formula that dominates the others in terms of the retrieval effectiveness. (C) 1997 Elsevier Science Ltd.
引用
收藏
页码:289 / 306
页数:18
相关论文
共 50 条
  • [31] Using document dimensions for enhanced information retrieval
    Jayasooriya, T
    Manandhar, S
    [J]. APPLIED COMPUTING, PROCEEDINGS, 2004, 3285 : 145 - 152
  • [32] FAST RETRIEVAL OF THE SYMBOL INFORMATION USING THE HIGH-LEVEL LANGUAGE
    GITER, DM
    GOVOROVSKII, SB
    KARTASHOV, AP
    [J]. NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 2-INFORMATSIONNYE PROTSESSY I SISTEMY, 1985, (04): : 10 - 11
  • [33] Evaluating information information retrieval using document popularity: An implementation on MapReduce
    Evangelopoulos, Xenophon
    Giannakouris-Salalidis, Victor
    Iliadis, Lazaros
    Makris, Christos
    Plegas, Yannis
    Plerou, Antonia
    Sioutas, Spyros
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 51 : 16 - 23
  • [34] Fast document translation for cross-language information retrieval
    McCarley, JS
    Roukos, S
    [J]. MACHINE TRANSLATION AND THE INFORMATION SOUP, 1998, 1529 : 150 - 157
  • [35] A Polya Urn Document Language Model for Improved Information Retrieval
    Cummins, Ronan
    Paik, Jiaul H.
    Yuanhua, L., V
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2015, 33 (04) : 1 - 34
  • [36] Von neumann mutual information for anisotropic coupled oscillators interacting with a single two-level atom
    Abdalla, M
    Abdel-Aty, M
    Obada, ASF
    [J]. INTERNATIONAL JOURNAL OF THEORETICAL PHYSICS, 2005, 44 (09) : 1649 - 1662
  • [37] von Neumann Mutual Information for Anisotropic Coupled Oscillators Interacting with a Single Two-Level Atom
    M. Sebawe Abdalla
    M. Abdel-Aty
    A.-S. F. Obada
    [J]. International Journal of Theoretical Physics, 2005, 44 : 1649 - 1662
  • [38] Information Retrieval Ranking Using Machine Learning Techniques
    Pandey, Shweta
    Mathur, Iti
    Joshi, Nisheeth
    [J]. PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 86 - 92
  • [39] Probabilistic Ranking of Documents Using Vectors in Information Retrieval
    Saini, Balwinder
    Singh, Vikram
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, 2015, 31 : 613 - 624
  • [40] Privacy-aware document retrieval with two-level inverted indexing
    Qiao, Yifan
    Ji, Shiyu
    Wang, Changhai
    Shao, Jinjin
    Yang, Tao
    [J]. INFORMATION RETRIEVAL JOURNAL, 2023, 26 (1-2):