Hybrid XML retrieval: Combining information retrieval and a native XML database

被引:8
|
作者
Pehcevski, J [1 ]
Thom, JA
Vercoustre, AM
机构
[1] RMIT Univ, Melbourne, Vic, Australia
[2] INRIA, Rocquencourt, France
来源
INFORMATION RETRIEVAL | 2005年 / 8卷 / 04期
关键词
XML information retrieval; XML databases; eXist; Zettair; INEX;
D O I
10.1007/s10791-005-0748-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database: and using a hybrid system that takes full article answers from Zettair and uses eXist to extract elements from those articles. For the content-only topics, we undertake a preliminary analysis of the INEX 2003 relevance assessments in order to identify the types of highly relevant document components. Further analysis identifies two complementary sub-cases of relevance assessments (General and Specific) and two categories of topics (Broad and Narrow). We develop a novel retrieval module that for a content-only topic utilises the information from the resulting answer list of a native XML database and dynamically determines the preferable units of retrieval, which we call Coherent Retrieval Elements. The results of our experiments show that-when each of the three systems is evaluated against different retrieval scenarios (such as different cases of relevance assessments, different topic categories and different choices of evaluation metrics)-the XML retrieval systems exhibit varying behaviour and the best performance can be reached for different values of the retrieval parameters. In the case of INEX 2003 relevance assessments for the content-only topics, our newly developed hybrid XML retrieval system is substantially more effective than either Zettair or eXist, and yields a robust and a very effective XML retrieval.
引用
下载
收藏
页码:571 / 600
页数:30
相关论文
共 50 条
  • [41] Compact representations in XML retrieval
    Huang, Fang
    Watt, Stuart
    Harper, David
    Clark, Malcolm
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 64 - 72
  • [42] Specificity aboutness in XML retrieval
    Blanke, Tobias
    Lalmas, Mounia
    INFORMATION RETRIEVAL, 2011, 14 (01): : 68 - 88
  • [43] Relevance feedback in XML retrieval
    Pan, HL
    CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2004 WORKSHOPS, PROCEEDINGS, 2004, 3268 : 187 - 196
  • [44] An Approach to XML Path Retrieval
    Song Ling
    Li Shengen
    Cui Wei
    Zhang Dongmei
    Niu Xiaofei
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 513 - 516
  • [45] The dynamic retrieval of XML elements
    Crouch, Carolyn J.
    Khanna, Sudip
    Potnis, Poorva
    Doddapaneni, Nagendra
    ADVANCES IN XML INFORMATION RETRIEVAL AND EVALUATION, 2006, 3977 : 268 - 281
  • [46] Specificity Aboutness in XML Retrieval
    Blanke, Tobias
    Lalmas, Mounia
    ADVANCES IN INFORMATION RETRIEVAL THEORY, 2009, 5766 : 176 - 187
  • [47] Contextualization models for XML retrieval
    Arvola, Paavo
    Kekalainen, Jaana
    Junkkari, Marko
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (05) : 762 - 776
  • [48] A voting method for XML retrieval
    Hubert, G
    ADVANCES IN XML INFORMATION RETRIEVAL, 2005, 3493 : 183 - 195
  • [49] Relevance feedback for XML retrieval
    Mass, Y
    Mandelbrod, M
    ADVANCES IN XML INFORMATION RETRIEVAL, 2005, 3493 : 303 - 310
  • [50] Relevance feedback in XML retrieval
    Pan, Hanglin
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3268 : 187 - 196