Two-level document ranking using mutual information in natural language information retrieval

被引:13
|
作者
Kang, HK [1 ]
Choi, KS [1 ]
机构
[1] KOREA ADV INST SCI & TECHNOL, DEPT COMP SCI, YUSONG GU, TAEJON 305701, SOUTH KOREA
关键词
D O I
10.1016/S0306-4573(96)00074-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information retrieval is to retrieve relevant information that satisfies user's information needs. There arises a problem of how to select only information that is relevant to the user. Ranking techniques are used to find the documents in a collection of documents that are most likely to be relevant to the user's query. However, we find out that there could be retrieved documents whose contexts may not be consistent to the query. Mutual information is a measure which represents the relation between a word and another word. So, we will re-evaluate the relation between the terms in the retrieved document and the terms in the query. In this paper, we discuss a model of natural language information retrieval system that is based on a two-level document ranking method using mutual information. At the first-level, we retrieve documents based on automatically constructed index terms. At the second-level, we reorder the retrieved documents using mutual information. We will show that our method achieves considerable retrieval effectiveness improvement over a traditional linear searching method. Also, we will analyse seven newly developed formulas that reorder the retrieved documents. Among the seven formulas, we will recommend one formula that dominates the others in terms of the retrieval effectiveness. (C) 1997 Elsevier Science Ltd.
引用
收藏
页码:289 / 306
页数:18
相关论文
共 50 条
  • [1] A resolving of word sense ambiguity using two-level document ranking method in information retrieval
    Kang, Hyun-Kyu
    Jeon, Heung Seok
    Ko, Myeong-Cheol
    Kim, Jin Soo
    Yang, Kiduk
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGY CONVERGENCE, PROCEEDINGS, 2007, : 315 - +
  • [2] Two-Level Private Information Retrieval
    Zhou, Ruida
    Tian, Chao
    Sun, Hua
    Plank, James S.
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 1919 - 1924
  • [3] Candidate document retrieval for cross-lingual plagiarism detection using two-level proximity information
    Ehsan, Nava
    Shakery, Azadeh
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (06) : 1004 - 1017
  • [4] Using Mutual Information Technique in Cross-Language Information Retrieval
    Sari, Syandra
    Adriani, Mirna
    [J]. DIGITAL LIBRARIES: UNIVERSAL AND UBIQUITOUS ACCESS TO INFORMATION, PROCEEDINGS, 2008, 5362 : 276 - +
  • [5] A probabilistic information retrieval model by document ranking using term dependencies
    You, Hyun-Jo
    Lee, Jung-Jin
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (05) : 763 - 782
  • [6] A Two-Level Cache for Distributed Information Retrieval in Search Engines
    Zhang, Weizhe
    He, Hui
    Ye, Jianwei
    [J]. SCIENTIFIC WORLD JOURNAL, 2013,
  • [7] Natural language in information retrieval
    Dura, E
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 537 - 540
  • [8] Natural language information retrieval
    Corston-Oliver, S
    [J]. COMPUTATIONAL LINGUISTICS, 2000, 26 (03) : 460 - 462
  • [9] Generalized Ensemble Model for Document Ranking in Information Retrieval
    Wang, Yanshan
    Choi, In-Chan
    Liu, Hongfang
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2017, 14 (01) : 123 - 151
  • [10] Multimodal interaction for information retrieval using natural language
    Revuelta-Martinez, Alejandro
    Rodriguez, Luis
    Garcia-Varea, Ismael
    Montero, Francisco
    [J]. COMPUTER STANDARDS & INTERFACES, 2013, 35 (05) : 428 - 441