An effective and efficient approach for keyword-based XML retrieval

被引:0
|
作者
Li, XG [1 ]
Gong, H [1 ]
Wang, DL [1 ]
Yu, G [1 ]
机构
[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110004, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
IR-style keyword-based search on XML document has become the most common tool for XML query, as users need not to know the structural information of the target XML document before constructing a query. For a keyword-based search engine for XML document, the key issue is how to return some sets of meaningfully related nodes to user's query efficiently. An ordinary solution of current approaches is to store the relationship of each pair of nodes in an XML document to an index. Obviously, this will lead to serious storage overhead. In this paper, we propose an enhanced inverted index structure (PN-Inverted Index) that stores path information in addition to node ID, and import and extend the concept of LCA to PLCA. Efficient algorithms with these concepts are designed to check the relationship of arbitrary number of nodes. Compared with existing approaches, our approach need not create additional relationship index but just utilize the existing inverted index that is much common for IR-style keyword search engine. Experimental results show that with the promise of returning meaningful answers, our search engine offers great performance benefits. Although the size of the inverted index is increased, the total size of indices of search engine is smaller than the existing approaches.
引用
收藏
页码:56 / 67
页数:12
相关论文
共 50 条
  • [1] Structural feedback for keyword-based XML retrieval
    Schenkel, Ralf
    Theobald, Martin
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 326 - 337
  • [2] Efficient Declustering Techniques for keyword-based Information Retrieval
    Behl, S
    Verma, RM
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2002, : 294 - 300
  • [3] Keyword-based Vehicle Retrieval
    Park, Eun-Ju
    Kim, Hoyoung
    Jeong, Seonghwan
    Kang, Byungkon
    Kwon, YoungMin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4215 - 4222
  • [4] Efficient Skyline Keyword-Based Tree Retrieval on Attributed Graphs
    Wu, Dingming
    Zhang, Zhaofen
    Jensen, Christian S.
    Lu, Kezhong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6056 - 6070
  • [5] An approach for keyword-based image retrieval in unannotated image databases
    Jiao, Jun
    Jiang, Yuan
    Li, Ming
    Zhou, Zhi-Hua
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2009, 22 (03): : 422 - 426
  • [6] KEMB: A Keyword-Based XML Message Broker
    Li, Guoliang
    Feng, Jianhua
    Wang, Jianyong
    Zhou, Lizhu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (07) : 1035 - 1049
  • [7] Keyword-based information retrieval for the WoT
    Xylomenos, George
    Zafeiratos, Evangelos
    Prokopakis, Marios
    Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, SEC 2019, 2019, : 407 - 412
  • [8] Keyword-based information retrieval for the WoT
    Xylomenos, George
    Zafeiratos, Evangelos
    Prokopakis, Marios
    SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 407 - 412
  • [9] An effective approach to candidate retrieval for cross-language plagiarism detection: A fusion of conceptual and keyword-based schemes
    Roostaee, Meysam
    Sadreddini, Mohammad Hadi
    Fakhrahmad, Seyed Mostafa
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (02)
  • [10] LAF: a new XML encoding and indexing strategy for keyword-based XML search
    Deng, Zhi-Hong
    Xiang, Yong-Qing
    Gao, Ning
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (11): : 1604 - 1621