Frequent pattern mining in attributed trees: algorithms and applications

被引:11
|
作者
Pasquier, Claude [1 ,2 ,3 ]
Sanhes, Jeremy [3 ]
Flouvat, Frederic [3 ]
Selmaoui-Folcher, Nazha [3 ]
机构
[1] Univ Nice Sophia Antipolis, I3S, UMR 7271, F-06900 Sophia Antipolis, France
[2] CNRS, I3S, UMR 7271, F-06900 Sophia Antipolis, France
[3] Univ New Caledonia, Pole Pluridisciplinaire Mat & Environm, Noumea 98851, New Caledonia
关键词
Tree mining; Frequent pattern mining; Attributed tree; Condensed representation;
D O I
10.1007/s10115-015-0831-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent pattern mining is an important data mining task with a broad range of applications. Initially focused on the discovery of frequent itemsets, studies were extended to mine structural forms like sequences, trees or graphs. In this paper, we introduce a new domain of patterns, attributed trees (atrees), and a method to extract these patterns in a forest of atrees. Attributed trees are trees in which vertices are associated with itemsets. Mining this type of patterns (called asubtrees), which combines tree mining and itemset mining, requires the exploration of a huge search space. To make our approach scalable, we investigate the mining of condensed representations. For attributed trees, the classical concept of closure involves both itemset closure and structural closure. We present three algorithms for mining all patterns, closed patterns w.r.t. itemsets (content) and/or structure in attributed trees. We show that, for low support values, mining content-closed attributed trees is a good compromise between non-redundancy of solutions and execution time.
引用
收藏
页码:491 / 514
页数:24
相关论文
共 50 条
  • [31] Reframing in Frequent Pattern Mining
    Ahmed, Chowdhury Farhan
    Samiullah, Md.
    Lachiche, Nicolas
    Kull, Meelis
    Flach, Peter
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 799 - 806
  • [32] Mining frequent closed rooted trees
    Balcazar, Jose L.
    Bifet, Albert
    Lozano, Antoni
    MACHINE LEARNING, 2010, 78 (1-2) : 1 - 33
  • [33] Performance Evaluation of Frequent Pattern Mining Algorithms using Web Log Data for Web Usage Mining
    Gashaw, Yonas
    Liu, Fang
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [34] Mining maximal frequent itemsets with frequent pattern list
    Qian, Jin
    Ye, Feiyue
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 628 - 632
  • [35] Mining frequent closed itemsets with the frequent pattern list
    Tseng, FC
    Hsu, CC
    Chen, H
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 653 - 654
  • [36] A survey of frequent subgraph mining algorithms
    Jiang, Chuntao
    Coenen, Frans
    Zito, Michele
    KNOWLEDGE ENGINEERING REVIEW, 2013, 28 (01): : 75 - 105
  • [37] Frequent Subgraph Mining Algorithms - A Survey
    Ramraj, T.
    Prabhakar, R.
    GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 : 197 - 204
  • [38] Efficient mining frequent itemsets algorithms
    Mohamed, Marghny H.
    Darwieesh, Mohammed M.
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2014, 5 (06) : 823 - 833
  • [39] Enabling Far-Edge Analytics: Performance Profiling of Frequent Pattern Mining Algorithms
    Alam, Khubaib Amjad
    Ahmad, Rodina
    Ko, Kwangman
    IEEE ACCESS, 2017, 5 : 8236 - 8249
  • [40] Efficient mining frequent itemsets algorithms
    Marghny H. Mohamed
    Mohammed M. Darwieesh
    International Journal of Machine Learning and Cybernetics, 2014, 5 : 823 - 833