LAF: a new XML encoding and indexing strategy for keyword-based XML search

被引:2
|
作者
Deng, Zhi-Hong [1 ]
Xiang, Yong-Qing [1 ]
Gao, Ning [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Minist Educ, Key Lab Machine Percept, Beijing 100871, Peoples R China
来源
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
XML keyword search; LAF; two-layer index; ABS; SLCA;
D O I
10.1002/cpe.2906
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As a large number of corpuses are represented, stored and published in XML format, how to find useful information from XML databases has become an increasingly important issue. Keyword search enables web users to easily access XML data without the need to learn a structured query language or to study complex data schemas. Most existing indexing strategies for XML keyword search are based upon Dewey encoding. In this paper, we proposed a new encoding method called Level Order and Father (LAF) for XML documents. With LAF encoding, we devised a new index structure, called two-layer LAF inverted index, which can greatly decrease the space complexity compared with Dewey encoding-based inverted index. Furthermore, with two-layer LAF inverted index, we proposed a new keyword query algorithm called Algorithm based on Binary Search (ABS) that can quickly find all Smallest Lowest Common Ancestor. We experimentally evaluate two-layer LAF inverted index and ABS algorithm on four real XML data sets selected from Wikipedia. The experimental results prove the advantages of our index method and querying algorithm. The space consumed by two-layer LAF index is less than half of that consumed by Dewey inverted index. Moreover, ABS is about one to two orders of magnitude faster than the classic Stack algorithm. Concurrency and Computation: Practice and Experience, 2012.(c) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:1604 / 1621
页数:18
相关论文
共 50 条
  • [1] KEMB: A Keyword-Based XML Message Broker
    Li, Guoliang
    Feng, Jianhua
    Wang, Jianyong
    Zhou, Lizhu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (07) : 1035 - 1049
  • [2] Structural feedback for keyword-based XML retrieval
    Schenkel, Ralf
    Theobald, Martin
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 326 - 337
  • [3] An effective and efficient approach for keyword-based XML retrieval
    Li, XG
    Gong, H
    Wang, DL
    Yu, G
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 56 - 67
  • [4] XML keyword search algorithm based on Level-Traverse encoding
    Yao, Quanzhu
    Tian, Bing
    He, Wangyun
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1553 - +
  • [5] XDMA: A Dual Indexing and Mutual Summation Based Keyword Search Algorithm for XML Databases
    Selvaganesan, S.
    Haw, Su-Cheng
    Soon, Lay-Ki
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2014, 24 (04) : 591 - 615
  • [6] A NLG based XML Keyword Search Technology
    Yan Qiu-yan
    Xia Shi-xiong
    ADVANCING SCIENCE THROUGH COMPUTATION, 2008, : 405 - 408
  • [7] A Survey on XML Keyword Search
    Tian, Zongqi
    Lu, Jiaheng
    Li, Deying
    WEB TECHNOLOGIES AND APPLICATIONS, 2011, 6612 : 460 - 471
  • [8] An Extension of LCA based XML Keyword Search
    Supasitthimethee, Umaporn
    Shimizu, Toshiyuki
    Yoshikawa, Masatoshi
    Porkaew, Kriengkrai
    2008 INTERNATIONAL WORKSHOP ON INFORMATION-EXPLOSION AND NEXT GENERATION SEARCH : INGS 2008, PROCEEDINGS, 2008, : 104 - +
  • [9] An encoding scheme for indexing XML data
    Zhang, WS
    Liu, DX
    Li, J
    2004 IEEE INTERNATIONAL CONFERNECE ON E-TECHNOLOGY, E-COMMERE AND E-SERVICE, PROCEEDINGS, 2004, : 525 - 528
  • [10] Effective keyword search in XML documents based on MIU
    Xu, Jianjun
    Lu, Jiaheng
    Wang, Wei
    Shi, Baile
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2006, 3882 : 702 - 716