Finding and ranking compact connected trees for effective keyword proximity search in XML documents

被引:13
|
作者
Feng, Jianhua [1 ]
Li, Guoliang [1 ]
Wang, Jianyong [1 ]
Zhou, Lizhu [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol, Beijing 10084, Peoples R China
基金
中国国家自然科学基金;
关键词
Lowest common ancestor (LCA); Compact LCA (CLCA); Maximal CLCA (MCLCA); Compact connected trees (CCTrees); Maximal CCTrees (MCCTrees);
D O I
10.1016/j.is.2009.05.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study the problem of keyword proximity search in XML documents. We take the disjunctive semantics among the keywords into consideration and find top-k relevant compact connected trees (CCTrees) as the answers of keyword proximity queries. We first introduce the notions of compact lowest common ancestor (CLCA) and maximal CLCA (MCLCA), and then propose compact connected trees and maximal CCTrees (MCCTrees) to efficiently and effectively answer keyword proximity queries. We give the theoretical upper bounds of the numbers of CLCAs, MCLCAs, CCTrees and MCCTrees, respectively. We devise an efficient algorithm to generate all MCCTrees, and propose a ranking mechanism to rank MCCTrees. Our extensive experimental study shows that our method achieves both high efficiency and effectiveness, and outperforms existing state-of-the-art approaches significantly. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:186 / 203
页数:18
相关论文
共 39 条
  • [1] Keyword proximity search in XML trees
    Hristidis, V
    Koudas, N
    Papakonstantinou, Y
    Srivastava, D
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (04) : 525 - 539
  • [2] Effective XML Keyword Search with Relevance Oriented Ranking
    Bao, Zhifeng
    Ling, Tok Wang
    Chen, Bo
    Lu, Jiaheng
    [J]. ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 517 - +
  • [3] Effective keyword search in XML documents based on MIU
    Xu, Jianjun
    Lu, Jiaheng
    Wang, Wei
    Shi, Baile
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2006, 3882 : 702 - 716
  • [4] Effective Keyword Search for Candidate Fragments of XML Documents
    Wen, Yanlong
    Zhang, Haiwei
    Zhang, Ying
    Zhang, Lu
    Xu, Lei
    Yuan, Xiaojie
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2011, 2011, 6637 : 427 - 439
  • [5] Exploiting ID references for effective keyword search in XML documents
    Chen, Bo
    Lu, Jiaheng
    Ling, Tok Wang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 529 - +
  • [6] Keyword proximity search on XML graphs
    Hristidis, V
    Papakonstantinou, Y
    Balmin, A
    [J]. 19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 367 - 378
  • [7] Effective Keyword Search for Precise Return Information over XML Documents
    Lou, Ying
    Li, Zhanhuai
    Han, Meng
    Xu, Juan
    [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 567 - 571
  • [8] Semantic relevance ranking for XML keyword search
    Lou, Ying
    Li, Zhanhuai
    Chen, Qun
    [J]. INFORMATION SCIENCES, 2012, 190 : 127 - 143
  • [9] Survey on Keyword Search over XML Documents
    Thuy Ngoc Le
    Ling, Tok Wang
    [J]. SIGMOD RECORD, 2016, 45 (03) : 17 - 28
  • [10] XDist: an effective XML keyword search system with re-ranking model based on keyword distribution
    Ning Gao
    ZhiHong Deng
    ShengLong Lü
    [J]. Science China Information Sciences, 2014, 57 : 1 - 17