Incremental frequent itemsets mining based on frequent pattern tree and multi-scale

被引:20
|
作者
Xun, Yaling [1 ]
Cui, Xiaohui [1 ]
Zhang, Jifu [1 ]
Yin, Qingxia [1 ]
机构
[1] Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Taiyuan 030024, Peoples R China
关键词
Frequent itemsets mining; Multi-scale; Incremental mining; Frequent pattern tree; Association rules; ASSOCIATION RULES; ALGORITHM; MAINTENANCE;
D O I
10.1016/j.eswa.2020.113805
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-scale can reveal the structure and hierarchical characteristics of the data objects to reflect their essence from different perspectives and levels. An incremental frequent itemsets mining algorithm based on frequent pattern tree is proposed by incorporating multi-scale theory(simplified to FP-tree and Multi-Scale based Incremental Mining, FPMSIM). FPMSIM uses the classic FP-Growth to construct a pattern tree and generate frequent itemsets for more fine-grained dataset which is called benchmark scale dataset. The newly added dataset is also independently mined as a benchmark scale dataset. The ultimate frequent itemsets for the target scales are derived by means of the scale-up process. In which, some unknown itemsets counts need to be estimated by comparing the similarity among benchmark scale datasets. In this way, severe dataset rescanning and tree structure adjustment overhead are avoided during the maintenance process. The experimental results show that although the support estimation error will lead to incomplete frequent itemsets mining, it can be offset by the performance gains in the mining efficiency and I/O cost, especially in the field of big data.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] A Novel Incremental Algorithm for Frequent Itemsets Mining in Dynamic Datasets
    Hernandez-Leon, Raudel
    Hernandez-Palancar, Jose
    Carrasco-Ochoa, J. A.
    Martinez-Trinidad, J. Fco
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 145 - +
  • [32] TDUP: an approach to incremental mining of frequent itemsets with three-way-decision pattern updating
    Li, Yao
    Zhang, Zhi-Heng
    Chen, Wen-Bin
    Min, Fan
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (02) : 441 - 453
  • [33] TDUP: an approach to incremental mining of frequent itemsets with three-way-decision pattern updating
    Yao Li
    Zhi-Heng Zhang
    Wen-Bin Chen
    Fan Min
    [J]. International Journal of Machine Learning and Cybernetics, 2017, 8 : 441 - 453
  • [34] Frequent tree pattern mining: A survey
    Jimenez, Aida
    Berzal, Fernando
    Cubero, Juan-Carlos
    [J]. INTELLIGENT DATA ANALYSIS, 2010, 14 (06) : 603 - 622
  • [35] Mining frequent patterns with the pattern tree
    Huang, H
    Wu, XD
    Relue, R
    [J]. NEW GENERATION COMPUTING, 2005, 23 (04) : 315 - 337
  • [36] Discovery of Frequent Itemsets: Frequent Item Tree-Based Approach
    Kumar, A. V. Senthil
    Wahidabanu, R. S. D.
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2007, 1 (01) : 42 - 55
  • [37] Mining frequent patterns with the pattern tree
    Hao Huang
    Xindong Wu
    Richard Relue
    [J]. New Generation Computing, 2005, 23 : 315 - 337
  • [38] Mining updated frequent itemsets based on directed itemsets graph
    Wen Lei
    Li Min-qiang
    [J]. Proceedings of 2004 Chinese Control and Decision Conference, 2004, : 690 - 693
  • [39] The Algorithm of Mining Frequent Itemsets Based on MapReduce
    He, Bo
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND ENGINEERING APPLICATION, ICSCTEA 2013, 2014, 250 : 529 - 534
  • [40] Mining maximum frequent itemsets based on directed itemsets graph
    Wen Lei
    [J]. PROCEEDINGS OF 2004 CHINESE CONTROL AND DECISION CONFERENCE, 2004, : 681 - 683