A hybrid model to improve relevance in document retrieval

被引:0
|
作者
Department of Electronics and Communication, University of Allahabad, Allahabad, India [1 ]
不详 [2 ]
机构
来源
J. Digit. Inf. Manage. | 2006年 / 1卷 / 73-81期
关键词
Intelligent agents - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
In information retrieval community a lot of work is focused on increasing efficiency by capturing statistical features. The other dominant approach is to improve the relevance by capturing the semantic and contextual information which is invariably inefficient. Generally the two approaches are assumed to be diametrically opposite. In this paper we have tried to combine the two approaches by proposing a hybrid information retrieval model. The model works in two stages. The first stage is a statistical model and the second stage is based on semantics. We have first downsized the document collection for a given query using vector model and then used a conceptual graph (CG) based representation to rank the documents. Our main objective is to investigate the use of conceptual graphs as a precision tool in the second stage. The use of CGs brings semantic in the ranking process resulting in improved relevance. Three experiments have been conducted to demonstrate the feasibility and usefulness of our model. A test run is made on CACM-3204 collection. We observed 34.8% increase in precision for a subset of CACM queries. The second experiment is performed on a test collection specifically designed to test the strength of our model in situation where the same terms are being used in different context. Improved relevance has been observed in this case also. The application of this approach on results retrieved from LYCOS shown significant improvement. The proposed model is both efficient, scalable and domain independent.
引用
收藏
相关论文
共 50 条
  • [41] A layered Bayesian network model for document retrieval
    de Campos, LM
    Fernández-Luna, JM
    Huete, JF
    ADVANCES IN INFORMATION RETRIEVAL, 2002, 2291 : 169 - 182
  • [42] FUZZY MODEL OF DOCUMENT-RETRIEVAL SYSTEMS
    TAHANI, V
    INFORMATION PROCESSING & MANAGEMENT, 1976, 12 (03) : 177 - 187
  • [43] Relevance Model Revisited: With Multiple Document Representations
    Chen, Ruey-Cheng
    Tsai, Chiung-Min
    Hsiang, Jieh
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 37 - 48
  • [44] AN EXTENDED RELATIONAL DOCUMENT-RETRIEVAL MODEL
    BLAIR, DC
    INFORMATION PROCESSING & MANAGEMENT, 1988, 24 (03) : 349 - 371
  • [45] Comparison of learning performance and retrieval performance for support vector machines based relevance feedback document retrieval
    Onoda, Takashi
    Murata, Hiroshi
    Yamada, Seiji
    PROCEEDING OF THE 2007 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS, 2007, : 249 - +
  • [46] A Topic based Document Relevance Ranking Model
    Gao, Yang
    Xu, Yue
    Li, Yuefeng
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 271 - 272
  • [47] Hybrid Spelling Correction and Query Expansion for Relevance Document Searching
    Soyusiawaty, Dewi
    Wolley, Denny Hilmawan Rahmatullah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) : 332 - 339
  • [48] Document reranking by term distribution and maximal marginal relevance for Chinese information retrieval
    Yang, Lingpeng
    Ji, Donghong
    Leong, Munkew
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (02) : 315 - 326
  • [49] Improving Similar Document Retrieval Using a Recursive Pseudo Relevance Feedback Strategy
    Williams, Kyle
    Giles, C. Lee
    2016 IEEE/ACM JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2016, : 275 - 276
  • [50] Within-Document Retrieval: A User-Centred Evaluation of Relevance Profiling
    David J. Harper
    Ivan Koychev
    Yixing Sun
    Iain Pirie
    Information Retrieval, 2004, 7 : 265 - 290