A retrospective study of a hybrid document-context based retrieval model

被引:26
|
作者
Wu, H. C. [1 ]
Luk, Robert W. P.
Wong, K. F.
Kwok, K. L.
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Shatin, Hong Kong, Peoples R China
[3] CUNY Queens Coll, Dept Comp Sci, Flushing, NY 11367 USA
关键词
information retrieval; model; theory; retrospective experiment;
D O I
10.1016/j.ipm.2006.10.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes our novel retrieval model that is based on contexts of query terms in documents (i.e., document contexts). Our model is novel because it explicitly takes into account of the document contexts instead of implicitly using the document contexts to find query expansion terms. Our model is based on simulating a user making relevance decisions, and it is a hybrid of various existing effective models and techniques. It estimates the relevance decision preference of a document context as the log-odds and uses smoothing techniques as found in language models to solve the problem of zero probabilities. It combines these estimated preferences of document contexts using different types of aggregation operators that comply with different relevance decision principles (e.g., aggregate relevance principle). Our model is evaluated using retrospective experiments (i.e.,, with full relevance information), because such experiments can (a) reveal the potential of our model, (b) isolate the problems of the model from those of the parameter estimation, (c) provide information about the major factors affecting the retrieval effectiveness of the model, and (d) show that whether the model obeys the probability ranking principle. Our model is promising as its mean average precision is 60-80% in our experiments using different TREC ad hoc English collections and the NTCIR-5 ad hoc Chinese collection. Our experiments showed that (a) the operators that are consistent with aggregate relevance principle were effective in combining the estimated preferences, and (b) that estimating probabilities using the contexts in the relevant documents can produce better retrieval effectiveness than using the entire relevant documents. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1308 / 1331
页数:24
相关论文
共 50 条
  • [1] Castsearch - Context based spoken document retrieval
    Molgaard, Lasse Lohilahti
    Jorgensen, Kasper Winther
    Hansen, Lars Kai
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 93 - +
  • [2] A hybrid model to improve relevance in document retrieval
    Department of Electronics and Communication, University of Allahabad, Allahabad, India
    不详
    J. Digit. Inf. Manage., 2006, 1 (73-81):
  • [3] Attentional Matrix Factorization with Document-context awareness and Implicit API Relationship for Service Recommendation
    Mo Nguyen
    Yu, Jian
    Yongchareon, Sira
    Han, Yanbo
    Wang, Guiling
    PROCEEDINGS OF THE AUSTRALASIAN COMPUTER SCIENCE WEEK MULTICONFERENCE (ACSW 2020), 2020,
  • [4] A novel context matching based technique for web document retrieval
    Zakos, J
    Verma, B
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 909 - 913
  • [5] Representing Context Information for Document Retrieval
    Carrillo, Maya
    Villatoro-Tello, Esau
    Lopez-Lopez, A.
    Eliasmith, Chris
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 239 - 250
  • [6] Document retrieval in the context of question answering
    Monz, C
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 571 - 579
  • [7] A Document Retrieval Model Based on Digital Signal Filtering
    Costa, Alberto
    Di Buccio, Emanuele
    Melucci, Massimo
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2015, 34 (01)
  • [8] Markov network retrieval model based on document cliques
    Wang, Mingwen, 1600, Science Press (51):
  • [9] Genetic algorithm based model for effective document retrieval
    Department of Computer Science, Jamia Hamdard, Hamdard Nagar, New Delhi 110 062, India
    不详
    Lect. Notes Electr. Eng., (191-201):
  • [10] A model based on Influence Diagrams for structured document retrieval
    Xu, JM
    Zhao, S
    Chai, BF
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3225 - 3231