Frequent pattern mining in attributed trees: algorithms and applications

被引:11
|
作者
Pasquier, Claude [1 ,2 ,3 ]
Sanhes, Jeremy [3 ]
Flouvat, Frederic [3 ]
Selmaoui-Folcher, Nazha [3 ]
机构
[1] Univ Nice Sophia Antipolis, I3S, UMR 7271, F-06900 Sophia Antipolis, France
[2] CNRS, I3S, UMR 7271, F-06900 Sophia Antipolis, France
[3] Univ New Caledonia, Pole Pluridisciplinaire Mat & Environm, Noumea 98851, New Caledonia
关键词
Tree mining; Frequent pattern mining; Attributed tree; Condensed representation;
D O I
10.1007/s10115-015-0831-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent pattern mining is an important data mining task with a broad range of applications. Initially focused on the discovery of frequent itemsets, studies were extended to mine structural forms like sequences, trees or graphs. In this paper, we introduce a new domain of patterns, attributed trees (atrees), and a method to extract these patterns in a forest of atrees. Attributed trees are trees in which vertices are associated with itemsets. Mining this type of patterns (called asubtrees), which combines tree mining and itemset mining, requires the exploration of a huge search space. To make our approach scalable, we investigate the mining of condensed representations. For attributed trees, the classical concept of closure involves both itemset closure and structural closure. We present three algorithms for mining all patterns, closed patterns w.r.t. itemsets (content) and/or structure in attributed trees. We show that, for low support values, mining content-closed attributed trees is a good compromise between non-redundancy of solutions and execution time.
引用
收藏
页码:491 / 514
页数:24
相关论文
共 50 条
  • [41] Mining Frequent Subgraph Pattern over a Collection of Attributed-Graphs and Construction of a Relation Hierarchy for Result Reporting
    Perner, Petra
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, ICDM 2017, 2017, 10357 : 323 - 344
  • [42] Bi-pattern mining of attributed networks
    Henry Soldano
    Guillaume Santini
    Dominique Bouthinon
    Sophie Bary
    Emmanuel Lazega
    Applied Network Science, 4
  • [43] Mining Frequent Sequential Subgraph Evolutions in Dynamic Attributed Graphs
    Cheng, Zhi
    Andriamampianina, Landy
    Ravat, Franck
    Song, Jiefu
    Valles-Parlangeau, Nathalie
    Fournier-Viger, Philippe
    Selmaoui-Folcher, Nazha
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 66 - 78
  • [44] Bi-pattern mining of attributed networks
    Soldano, Henry
    Santini, Guillaume
    Bouthinon, Dominique
    Bary, Sophie
    Lazega, Emmanuel
    APPLIED NETWORK SCIENCE, 2019, 4 (01)
  • [45] Efficiently mining frequent embedded unordered trees
    Zaki, MJ
    FUNDAMENTA INFORMATICAE, 2005, 66 (1-2) : 33 - 52
  • [46] Mining frequent trees based on topology projection
    Ma, HB
    Wang, C
    Li, RL
    Yong, L
    Hu, YF
    WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 394 - 404
  • [47] Mining frequent patterns with the pattern tree
    Huang, H
    Wu, XD
    Relue, R
    NEW GENERATION COMPUTING, 2005, 23 (04) : 315 - 337
  • [48] Frequent tree pattern mining: A survey
    Jimenez, Aida
    Berzal, Fernando
    Cubero, Juan-Carlos
    INTELLIGENT DATA ANALYSIS, 2010, 14 (06) : 603 - 622
  • [49] Pattern mining in frequent dynamic subgraphs
    Borgwardt, Karsten M.
    Kriegel, Hans-Peter
    Wackersreuther, Peter
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 818 - +
  • [50] Improvisation in Frequent Pattern Mining Technique
    Gajera, Sagar
    Badheka, Manmay
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 295 - 303