A survey in indexing and searching XML documents

被引:27
|
作者
Luk, RWP [1 ]
Leong, HV
Dillon, TS
Chan, ATS
Croft, WB
Allan, J
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[2] Univ Massachusetts, Dept Comp Sci, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
关键词
D O I
10.1002/asi.10056
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
XML holds the promise to yield (1) a more precise search by providing additional information in the elements, (2) a better integrated search of documents from heterogeneous sources, (3) a powerful search paradigm using structural as well as content specifications, and (4) data and information exchange to share resources and to support cooperative search. We survey several indexing techniques for XML documents, grouping them into flat-file, semistructured, and structured indexing paradigms. Searching techniques and supporting techniques for searching are reviewed, including full text search, and multistage search. Because searching XML documents can be very flexible, various search result presentations are discussed, as well as database and information retrieval system integration and XML query languages. We also survey various retrieval models, examining how they would be used or extended for retrieving XML documents. To conclude the article, we discuss various open issues that XML poses with respect to information retrieval and database research.
引用
收藏
页码:415 / 437
页数:23
相关论文
共 50 条
  • [1] Indexing and searching XML documents based on content and structure synopses
    He, Weimin
    Fegaras, Leonidas
    Levine, David
    [J]. DATA MANAGEMENT: DATA, DATA EVERYWHERE, PROCEEDINGS, 2007, 4587 : 58 - +
  • [2] Querying and indexing XML documents
    Hu, Gongzhu
    Hammad, Rafat
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2005, 5 (01) : S219 - S233
  • [3] Indexing techniques for query of XML documents
    Wang, Y
    Sun, JL
    Dong, JX
    [J]. COMPUTER SCIENCE AND TECHNOLOGY IN NEW CENTURY, 2001, : 581 - 584
  • [4] Searching XML documents - Preliminary work
    Hassler, Marcus
    Bouchachia, Abdelhamid
    [J]. ADVANCES IN XML INFORMATION RETRIEVAL AND EVALUATION, 2006, 3977 : 119 - 133
  • [5] Semantic Indexing for XML Documents using RDBMS
    Ihsan, Imran
    Kiyani, Faisal Fayyaz
    Qadir, M. Abdul
    Rehman, Mohib ur
    [J]. 2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICICT), 2015,
  • [6] Storing and indexing XML documents upside down
    Mathis, Christian
    Haerder, Theo
    Schmidt, Karsten
    [J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2009, 24 (1-2): : 51 - 68
  • [7] Indexing techniques for processing generalized XML documents
    Qadah, Ghassan Z.
    [J]. COMPUTER STANDARDS & INTERFACES, 2017, 49 : 34 - 43
  • [8] A hybird method for efficient indexing of XML documents
    Sun, W
    Liu, DX
    [J]. DEEC 2005: INTERNATIONAL WORKSHOP ON DATA ENGINEERING ISSUES IN E-COMMERCE, PROCEEDINGS, 2005, : 139 - 143
  • [9] Path bitmap indexing for retrieval of XML documents
    Lee, Jae-Min
    Hwang, Byung-Yeon
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, 2006, 3885 : 329 - 339
  • [10] TMIX: Temporal Model for Indexing XML Documents
    Bin-Thalab, Rasha
    El-Tazi, Neamat
    El-Sharkawi, Mohamed E.
    [J]. 2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,