Compressed hierarchical mining of frequent closed patterns from dense data sets

被引:5
|
作者
Ji, Liping [1 ]
Tan, Kian-Lee [1 ]
Tung, Anthony K. H. [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117543, Singapore
关键词
frequent closed patterns; progressive; dense data sets; data mining; parallel mining;
D O I
10.1109/TKDE.2007.1047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of finding frequent closed patterns ( FCPs) from very dense data sets. We introduce two compressed hierarchical FCP mining algorithms: C-Miner and B-Miner. The two algorithms compress the original mining space, hierarchically partition the whole mining task into independent subtasks, and mine each subtask progressively. The two algorithms adopt different task partitioning strategies: C-Miner partitions the mining task based on Compact Matrix Division, whereas B-Miner partitions the task based on Base Rows Projection. The compressed hierarchical mining algorithms enhance the mining efficiency and facilitate a progressive refinement of results. Moreover, because the subtasks can be mined independently, C-Miner and B-Miner can be readily paralleled without incurring significant communication overhead. We have implemented C-Miner and B-Miner, and our performance study on synthetic data sets and real dense microarray data sets shows their effectiveness over existing schemes. We also report experimental results on parallel versions of these two methods.
引用
收藏
页码:1175 / 1187
页数:13
相关论文
共 50 条
  • [21] Approximating the number of frequent sets in dense data
    Boley, Mario
    Grosskreutz, Henrik
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2009, 21 (01) : 65 - 89
  • [22] Data Stream Frequent Closed Item Sets Mining Based on Fast Sliding Window
    Chen Zhihua
    Luo Jun
    [J]. MECHANICAL AND ELECTRONICS ENGINEERING III, PTS 1-5, 2012, 130-134 : 3702 - 3707
  • [23] A New Closed Frequent Items Mining from Tree Data
    Du, Qinsheng
    Li, Xiongfei
    Li, Wei
    [J]. 2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 400 - 403
  • [24] Distributed Mining of Constrained Frequent Sets from Uncertain Data
    Cuzzocrea, Alfredo
    Leung, Carson K.
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT I: ICA3PP 2011, 2011, 7916 : 40 - +
  • [25] Mining frequent patterns from univariate uncertain data
    Liu, Ying-Ho
    [J]. DATA & KNOWLEDGE ENGINEERING, 2012, 71 (01) : 47 - 68
  • [26] Mining Frequent Patterns from Network Data Flow
    Lin, Xin
    Deng, Zhi-Hong
    Ma, Hao
    Tang, Shi-Wei
    Zhang, Bei
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 528 - +
  • [27] Complete Mining of Frequent Patterns from Graphs: Mining Graph Data
    Akihiro Inokuchi
    Takashi Washio
    Hiroshi Motoda
    [J]. Machine Learning, 2003, 50 : 321 - 354
  • [28] Complete mining of frequent patterns from graphs: Mining graph data
    Inokuchi, A
    Washio, T
    Motoda, H
    [J]. MACHINE LEARNING, 2003, 50 (03) : 321 - 354
  • [29] Mining frequent closed itemsets for large data
    Fu, HG
    Nguifo, EM
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 328 - 335
  • [30] CFGM: An algorithm for closed frequent graph patterns mining
    Peng, He
    Zhang, Defu
    [J]. INFORMATION SCIENCES, 2023, 625 : 327 - 341