LAF: a new XML encoding and indexing strategy for keyword-based XML search

被引:2
|
作者
Deng, Zhi-Hong [1 ]
Xiang, Yong-Qing [1 ]
Gao, Ning [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Minist Educ, Key Lab Machine Percept, Beijing 100871, Peoples R China
来源
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
XML keyword search; LAF; two-layer index; ABS; SLCA;
D O I
10.1002/cpe.2906
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As a large number of corpuses are represented, stored and published in XML format, how to find useful information from XML databases has become an increasingly important issue. Keyword search enables web users to easily access XML data without the need to learn a structured query language or to study complex data schemas. Most existing indexing strategies for XML keyword search are based upon Dewey encoding. In this paper, we proposed a new encoding method called Level Order and Father (LAF) for XML documents. With LAF encoding, we devised a new index structure, called two-layer LAF inverted index, which can greatly decrease the space complexity compared with Dewey encoding-based inverted index. Furthermore, with two-layer LAF inverted index, we proposed a new keyword query algorithm called Algorithm based on Binary Search (ABS) that can quickly find all Smallest Lowest Common Ancestor. We experimentally evaluate two-layer LAF inverted index and ABS algorithm on four real XML data sets selected from Wikipedia. The experimental results prove the advantages of our index method and querying algorithm. The space consumed by two-layer LAF index is less than half of that consumed by Dewey inverted index. Moreover, ABS is about one to two orders of magnitude faster than the classic Stack algorithm. Concurrency and Computation: Practice and Experience, 2012.(c) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:1604 / 1621
页数:18
相关论文
共 50 条
  • [31] On User-Centric XML Keyword Search
    Amini, Leila M.
    Keyvanpour, MohammadReza
    2018 4TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2018, : 51 - 57
  • [32] XQSuggest: An Interactive XML Keyword Search System
    Li, Jiang
    Wang, Junhu
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 340 - 347
  • [33] Object-Oriented XML Keyword Search
    Wu, Huayu
    Bao, Zhifeng
    CONCEPTUAL MODELING - ER 2011, 2011, 6998 : 402 - 410
  • [34] Removing the MisMatch Headache in XML Keyword Search
    Zeng, Yong
    Bao, Zhifeng
    Ling, Tok Wang
    Li, Guoliang
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 1109 - 1110
  • [35] Integrating keyword search into XML query processing
    Florescu, D
    Kossmann, D
    Manolescu, I
    COMPUTER NETWORKS, 2000, 33 (1-6) : 119 - 135
  • [36] Schema-Independence in XML Keyword Search
    Thuy Ngoc Le
    Bao, Zhifeng
    Ling, Tok Wang
    CONCEPTUAL MODELING, 2014, 8824 : 71 - 85
  • [37] Survey on Keyword Search over XML Documents
    Thuy Ngoc Le
    Ling, Tok Wang
    SIGMOD RECORD, 2016, 45 (03) : 17 - 28
  • [38] MapReduce Implementation of XML Keyword Search Algorithm
    Zhang, Yong
    Li, Quanlin
    Liu, Bo
    2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 721 - 728
  • [39] A query refinement framework for xml keyword search
    Zhifeng Bao
    Yi Yu
    Jian Shen
    Zhangjie Fu
    World Wide Web, 2017, 20 : 1469 - 1505
  • [40] Semantic relevance ranking for XML keyword search
    Lou, Ying
    Li, Zhanhuai
    Chen, Qun
    INFORMATION SCIENCES, 2012, 190 : 127 - 143