Efficient Mining of Frequent Closed XML Query Pattern

被引:0
|
作者
Jian-Hua Feng
Qian Qian
Jian-Yong Wang
Li-Zhu Zhou
机构
[1] Tsinghua University,Department of Computer Science and Technology
关键词
computer software; frequent closed pattern; data mining; XML; XPath;
D O I
暂无
中图分类号
学科分类号
摘要
Previous research works have presented convincing arguments that a frequent pattern mining algorithm should not mine all frequent but only the closed ones because the latter leads to not only more compact yet complete result set but also better efficiency. Upon discovery of frequent closed XML query patterns, indexing and caching can be effectively adopted for query performance enhancement. Most of the previous algorithms for finding frequent patterns basically introduced a straightforward generate-and-test strategy. In this paper, we present SOLARIA*, an efficient algorithm for mining frequent closed XML query patterns without candidate maintenance and costly tree-containment checking. Efficient algorithm of sequence mining is involved in discovering frequent tree-structured patterns, which aims at replacing expensive containment testing with cheap parent-child checking in sequences. SOLARIA* deeply prunes unrelated search space for frequent pattern enumeration by parent-child relationship constraint. By a thorough experimental study on various real-life data, we demonstrate the efficiency and scalability of SOLARIA* over the previous known alternative. SOLARIA* is also linearly scalable in terms of XML queries’ size.
引用
收藏
页码:725 / 735
页数:10
相关论文
共 50 条
  • [21] Mining frequent closed itemsets using conditional frequent pattern tree
    Singh, SR
    Patra, BK
    Giri, D
    Proceedings of the IEEE INDICON 2004, 2004, : 501 - 504
  • [22] Research of frequent pattern mining from XML data based on heterogeneous XML schema
    College of Computer Science, Chongqing University, Chongqing 400044, China
    不详
    J. Comput. Inf. Syst., 2008, 3 (787-794):
  • [23] Discovery of frequent query patterns in XML pattern graph with DTD cardinality constraints
    Liu, Yunfeng
    Wang, Tengjiao
    CISIS 2008: THE SECOND INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, PROCEEDINGS, 2008, : 256 - +
  • [24] An Efficient Close Frequent Pattern Mining Algorithm
    Tan, Jun
    Bu, Yingyong
    Yang, Bo
    ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 528 - 531
  • [25] Efficient frequent pattern mining on web logs
    Sun, LP
    Zhang, XZ
    ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 533 - 542
  • [26] Caching frequent XML query patterns
    Zhan, X
    Li, JZ
    Wang, HZ
    He, ZY
    ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, PROCEEDINGS, 2006, 3842 : 68 - 75
  • [27] Weigted-FP-Tree Based XML Query Pattern Mining
    Gu, Mi Sug
    Hwang, Jeong Hee
    Ryu, Keun Ho
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 417 - 428
  • [28] Efficient mining of frequent closed sequences with time constraints
    Huang G.
    Li M.
    Ren J.
    Journal of Convergence Information Technology, 2011, 6 (10) : 129 - 136
  • [29] An efficient algorithm for incrementally mining frequent closed itemsets
    Show-Jane Yen
    Yue-Shi Lee
    Chiu-Kuang Wang
    Applied Intelligence, 2014, 40 : 649 - 668
  • [30] Fast and memory efficient mining of frequent closed itemsets
    Lucchese, C
    Orlando, S
    Perego, R
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (01) : 21 - 36