A hybrid model to improve relevance in document retrieval

被引:0
|
作者
Department of Electronics and Communication, University of Allahabad, Allahabad, India [1 ]
不详 [2 ]
机构
来源
J. Digit. Inf. Manage. | 2006年 / 1卷 / 73-81期
关键词
Intelligent agents - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
In information retrieval community a lot of work is focused on increasing efficiency by capturing statistical features. The other dominant approach is to improve the relevance by capturing the semantic and contextual information which is invariably inefficient. Generally the two approaches are assumed to be diametrically opposite. In this paper we have tried to combine the two approaches by proposing a hybrid information retrieval model. The model works in two stages. The first stage is a statistical model and the second stage is based on semantics. We have first downsized the document collection for a given query using vector model and then used a conceptual graph (CG) based representation to rank the documents. Our main objective is to investigate the use of conceptual graphs as a precision tool in the second stage. The use of CGs brings semantic in the ranking process resulting in improved relevance. Three experiments have been conducted to demonstrate the feasibility and usefulness of our model. A test run is made on CACM-3204 collection. We observed 34.8% increase in precision for a subset of CACM queries. The second experiment is performed on a test collection specifically designed to test the strength of our model in situation where the same terms are being used in different context. Improved relevance has been observed in this case also. The application of this approach on results retrieved from LYCOS shown significant improvement. The proposed model is both efficient, scalable and domain independent.
引用
收藏
相关论文
共 50 条
  • [31] The precision improvement in document retrieval using ontology based relevance feedback
    Lim, SY
    Lee, WJ
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 438 - 446
  • [32] Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback
    Keyvanpour, M.
    Tavoli, R.
    Mozaffari, S.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2014, 27 (01): : 7 - 13
  • [33] Comparative Analysis of Relevance for SVM-Based Interactive Document Retrieval
    Murata, Hiroshi
    Onoda, Takashi
    Yamada, Seiji
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2013, 17 (02) : 149 - 156
  • [34] Extended structural relevance framework: a framework for evaluating structured document retrieval
    Ali, M. Sadek
    Consens, Mariano
    Lalmas, Mounia
    INFORMATION RETRIEVAL, 2012, 15 (06): : 558 - 590
  • [35] Genetic Algorithm Based to Improve HTML']HTML Document Retrieval
    Al-Dallal, Ammar
    Abdul-Wahab, Rasha S.
    2009 SECOND INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2009), 2009, : 343 - +
  • [36] A hybrid relevance-feedback approach to text retrieval
    Xu, Z
    Xu, XW
    Yu, K
    Tresp, V
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 281 - 293
  • [37] Hybrid pseudo-relevance feedback for microblog retrieval
    Chen, Lin
    Chun, Lin
    Ziyu, Lin
    Quan, Zou
    JOURNAL OF INFORMATION SCIENCE, 2013, 39 (06) : 773 - 788
  • [38] DOCUMENT RETRIEVAL USING A PROBABILISTIC KNOWLEDGE MODEL
    Wang, Shuguang
    Visweswaran, Shyam
    Hauskrecht, Milos
    KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 26 - +
  • [39] Document Retrieval Model Through Semantic Linking
    Ensan, Faezeh
    Bagheri, Ebrahim
    WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 181 - 190
  • [40] Users and experts in the document retrieval system model
    Danilowicz, C.
    1600, (21):