Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database

被引:0
|
作者
Jovan Pehcevski
James A. Thom
Anne-Marie Vercoustre
机构
[1] RMIT University,
[2] INRIA,undefined
来源
Information Retrieval | 2005年 / 8卷
关键词
XML information retrieval; XML databases; eXist; Zettair; INEX;
D O I
暂无
中图分类号
学科分类号
摘要
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that takes full article answers from Zettair and uses eXist to extract elements from those articles. For the content-only topics, we undertake a preliminary analysis of the INEX 2003 relevance assessments in order to identify the types of highly relevant document components. Further analysis identifies two complementary sub-cases of relevance assessments (General and Specific) and two categories of topics (Broad and Narrow). We develop a novel retrieval module that for a content-only topic utilises the information from the resulting answer list of a native XML database and dynamically determines the preferable units of retrieval, which we call Coherent Retrieval Elements. The results of our experiments show that—when each of the three systems is evaluated against different retrieval scenarios (such as different cases of relevance assessments, different topic categories and different choices of evaluation metrics)—the XML retrieval systems exhibit varying behaviour and the best performance can be reached for different values of the retrieval parameters. In the case of INEX 2003 relevance assessments for the content-only topics, our newly developed hybrid XML retrieval system is substantially more effective than either Zettair or eXist, and yields a robust and a very effective XML retrieval.
引用
收藏
页码:571 / 600
页数:29
相关论文
共 50 条
  • [1] Hybrid XML retrieval: Combining information retrieval and a native XML database
    Pehcevski, J
    Thom, JA
    Vercoustre, AM
    [J]. INFORMATION RETRIEVAL, 2005, 8 (04): : 571 - 600
  • [2] Database and information retrieval techniques for XML
    Consens, MP
    Baeza-Yates, R
    [J]. ADVANCES IN COMPUTER SCIENCE - ASIAN 2005, PROCEEDINGS: DATA MANAGEMENT ON THE WEB, 2005, 3818 : 22 - 27
  • [3] Tamino -: A database system combining text retrieval and XML
    Schöning, H
    [J]. INTELLIGENT SEARCH ON XML DATA: APPLICATIONS, LANGUAGES, MODELS IMPLEMENTATIONS AND BENCHMARKS, 2003, 2818 : 77 - 89
  • [4] Combining Strategies for XML Retrieval
    Gao, Ning
    Deng, Zhi-Hong
    Jiang, Jia-Jian
    Lv, Sheng-Long
    Yu, Hang
    [J]. COMPARATIVE EVALUATION OF FOCUSED RETRIEVAL, 2011, 6932 : 319 - 331
  • [5] Hybrid XML retrieval revisited
    Pehcevski, J
    Thom, JA
    Tahaghoghi, SMM
    Vercoustre, AM
    [J]. ADVANCES IN XML INFORMATION RETRIEVAL, 2005, 3493 : 153 - 167
  • [6] XML information retrieval mapped to hybrid storage model
    Shin, D
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 1137 - 1137
  • [7] XML information retrieval and information extraction
    Fuhr, N
    [J]. TEXT MINING: THEORETICAL ASPECTS AND APPLICATIONS, 2003, : 21 - 32
  • [8] Using a native XML database for encoded archival description search and retrieval
    Cornish, A
    [J]. INFORMATION TECHNOLOGY AND LIBRARIES, 2004, 23 (04) : 181 - 184
  • [9] Efficient Preprocesses for Fast Storage and Query Retrieval in Native XML Database
    Su-Cheng, Haw
    Lee, Chien-Sing
    [J]. IETE TECHNICAL REVIEW, 2009, 26 (01) : 28 - 40
  • [10] System of information retrieval in XML documents
    Smadhi, S
    [J]. ISSUES AND TRENDS OF INFORMATION TECHNOLOGY MANAGEMENT IN CONTEMPORARY ORGANIZATIONS, VOLS 1 AND 2, 2002, : 736 - 739