Efficient data mining for maximal frequent subtrees

被引:0
|
作者
Xiao, YQ [1 ]
Yao, JF [1 ]
Li, ZG [1 ]
Dunham, MH [1 ]
机构
[1] Georgia Coll & State Univ, Dept Math & Comp Sci, Milledgeville, GA 31061 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new type of tree mining is defined in this paper, which uncovers maximal frequent induced subtrees from a database of unordered labeled trees. A novel algorithm, PathJoin, is proposed. The algorithm uses a compact data structure, FST-Forest, which compresses the trees and still keeps the original tree structure. PathJoin generates candidate subtrees by joining the frequent paths in FST-Forest. Such candidate subtree generation is localized and thus substantially reduces the number of candidate subtrees. Experiments with synthetic data sets show that the algorithm is effective and efficient.
引用
收藏
页码:379 / +
页数:2
相关论文
共 50 条
  • [1] Mining maximal and closed frequent free subtrees
    Guo, Ping
    Hou, Yang Z.
    Zhuang, Jun
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 : 198 - 205
  • [2] CMTreeMiner: Mining both closed and maximal frequent subtrees
    Chi, Y
    Yang, YR
    Xia, Y
    Muntz, RR
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 63 - 73
  • [3] Mining closed and maximal frequent induced free subtrees
    Shiozaki, Hitohiro
    Ozaki, Tomonobu
    Ohkawa, Takenao
    [J]. ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 14 - +
  • [4] An efficient algorithm for mining both closed and maximal frequent free subtrees using canonical forms
    Guo, P
    Zhou, Y
    Zhuang, J
    Chen, T
    Kang, YR
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 96 - 107
  • [5] Mining subtrees with frequent occurrence of similar subtrees
    Tosaka, Hisashi
    Nakamura, Atsuyoshi
    Kudo, Mineichi
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2007, 4755 : 286 - +
  • [6] BEST: An Efficient Algorithm for Mining Frequent Unordered Embedded Subtrees
    Chowdhury, Israt Jahan
    Nayak, Richi
    [J]. PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 459 - 471
  • [7] BOSTER: An Efficient Algorithm for Mining Frequent Unordered Induced Subtrees
    Chowdhury, Israt J.
    Nayak, Richi
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2014, PT I, 2014, 8786 : 146 - 155
  • [8] Mining closed and maximal frequent subtrees from databases of labeled rooted trees
    Chi, Y
    Xia, Y
    Yang, YR
    Muntz, RR
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (02) : 190 - 202
  • [9] A novel method for mining frequent subtrees from XML data
    Zhang, WS
    Liu, DX
    Zhang, JP
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 300 - 305
  • [10] An efficient algorithm for mining maximal frequent patterns over data streams
    Yang, Junrui
    Wei, Yanjun
    Zhou, Fenfen
    [J]. 2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL II, 2015,