Hybrid XML retrieval: Combining information retrieval and a native XML database

被引:8
|
作者
Pehcevski, J [1 ]
Thom, JA
Vercoustre, AM
机构
[1] RMIT Univ, Melbourne, Vic, Australia
[2] INRIA, Rocquencourt, France
来源
INFORMATION RETRIEVAL | 2005年 / 8卷 / 04期
关键词
XML information retrieval; XML databases; eXist; Zettair; INEX;
D O I
10.1007/s10791-005-0748-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database: and using a hybrid system that takes full article answers from Zettair and uses eXist to extract elements from those articles. For the content-only topics, we undertake a preliminary analysis of the INEX 2003 relevance assessments in order to identify the types of highly relevant document components. Further analysis identifies two complementary sub-cases of relevance assessments (General and Specific) and two categories of topics (Broad and Narrow). We develop a novel retrieval module that for a content-only topic utilises the information from the resulting answer list of a native XML database and dynamically determines the preferable units of retrieval, which we call Coherent Retrieval Elements. The results of our experiments show that-when each of the three systems is evaluated against different retrieval scenarios (such as different cases of relevance assessments, different topic categories and different choices of evaluation metrics)-the XML retrieval systems exhibit varying behaviour and the best performance can be reached for different values of the retrieval parameters. In the case of INEX 2003 relevance assessments for the content-only topics, our newly developed hybrid XML retrieval system is substantially more effective than either Zettair or eXist, and yields a robust and a very effective XML retrieval.
引用
下载
收藏
页码:571 / 600
页数:30
相关论文
共 50 条
  • [31] XML structure retrieval without Schema and DTD information
    Kim, JW
    Jin, SH
    Cho, KC
    IC'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS 1 AND 2, 2003, : 138 - 143
  • [32] Layered solution for SLCA problem in xml information retrieval
    Department of Computer Science and Technology, Peking University, Beijing 100871, China
    不详
    Ruan Jian Xue Bao, 2007, 4 (919-932):
  • [33] XML information retrieval from spoken word archives
    Aly, Robin
    Hiemstra, Djoerd
    Ordelman, Roeland
    van der Werff, Laurens
    de Jong, Franciska
    EVALUATION OF MULTILINGUAL AND MULTI-MODAL INFORMATION RETRIEVAL, 2007, 4730 : 770 - +
  • [34] Modeling and information retrieval on XML-based dataweb
    Hocine, A
    Lo, M
    ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2000, 1909 : 398 - 408
  • [35] Improving information retrieval using XML and topic maps
    Schweiger, Ralf
    Dudeck, Joachim
    CHARTING THE TOPIC MAPS RESEARCH AND APPLICATIONS LANDSCAPE, 2006, 3873 : 253 - 262
  • [36] Information retrieval of sequential data in heterogeneous XML databases
    Popovici, E
    Marteau, PF
    Ménier, G
    ADAPTIVE MULTIMEDIA RETRIEVAL: USER, CONTEXT, AND FEEDBACK, 2006, 3877 : 236 - 250
  • [37] An Extended Vector Space Model for XML Information Retrieval
    Guo Yongming
    Chen Dehua
    Le JIajin
    WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 797 - +
  • [38] Rich Media Indexing and Retrieval in an Object XML Database System
    Li, Qing
    Mak, Hon Chung
    Zhao, Jianmin
    Zhu, Xinzhong
    JCPC: 2009 JOINT CONFERENCE ON PERVASIVE COMPUTING, 2009, : 875 - +
  • [39] The use of summaries in XML retrieval
    Szlavik, Zoltan
    Tombros, Anastasios
    Lalmas, Mounia
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2006, 4172 : 75 - 86
  • [40] Specificity aboutness in XML retrieval
    Tobias Blanke
    Mounia Lalmas
    Information Retrieval, 2011, 14 : 68 - 88