SDFP-Growth Algorithm as a Novelty of Association Rule Mining Optimization

被引:0
|
作者
Siswanto, Boby [1 ]
Soeparno, Haryono [1 ]
Sianipar, Nesti Fronika [2 ,3 ]
Budiharto, Widodo [4 ]
机构
[1] Bina Nusantara Univ, Comp Sci Dept, BINUS Grad Program Doctor Comp Sci, South Jakarta 11480, Indonesia
[2] Bina Nusantara Univ, Fac Engn, Biotechnol Dept, South Jakarta 11480, Indonesia
[3] Bina Nusantara Univ, Food Biotechnol Res Ctr, South Jakarta 11480, Indonesia
[4] Bina Nusantara Univ, Sch Comp Sci, Comp Sci Dept, South Jakarta 11480, Indonesia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Optimization; Itemsets; Data mining; Memory management; Computer science; Set theory; Program processors; Association rule mining; SDFP-growth algorithm; dimensionality reduction; optimization; FP-tree pruning;
D O I
10.1109/ACCESS.2024.3361667
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An essential element of association rules is the strong confidence values that depend onthe support value threshold, which determines the optimum number of datasets. The existing method fordetermining the support value threshold is carried out manually by trial and error; the user determinesa support value such as 10%, 30%, or 60% according to their instincts. If the support value thresholdis inappropriate, it produces useless frequent patterns, overburdens computer resources, and wastes time.The formula for predicting the maximum count of frequent patterns was 2n- 1, wherenis the number ofdistinct items in the dataset. This paper proposes a new SDFP-growth algorithm that does not require manualdetermination of the support threshold value. The SDFP-growth algorithm will perform dimensionalityreduction on the original dataset that will generate level 1 and level 2 smaller datasets, thus automaticallyproducing a dataset with an optimum amount of data with a minimum support value threshold. The proposedformula for predicting the maximum number of frequent patterns will become2(|A|)- 1, which is|A|willalways be smaller thann. Experiments were performed on five various datasets, which reduced the numberof data dimensions by more than 3% on the Level 1 dataset and more than 69% on the Level 2 datasetby maintaining the confidence value of the strong rules. In the execution time evaluated, we found anoptimization of more than 2% on the level 1 dataset and more than 94% on the level 2 dataset.
引用
收藏
页码:21491 / 21502
页数:12
相关论文
共 50 条
  • [21] Association Rule Mining for the Infrared Countermeasure by the PF-Growth Algorithm
    Xu Yang
    Fang Yang-Wang
    Wu You-Li
    Zhang Dan-Xu
    Huang Chen
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 8043 - 8048
  • [22] Optimization of association rule mining queries
    Jeudy, Baptiste
    Boulicaut, Jean-François
    Intelligent Data Analysis, 2002, 6 (04) : 341 - 357
  • [23] A Hybrid Bees Swarm Optimization and Tabu Search Algorithm for Association Rule Mining
    Djenouri, Youcef
    Drias, Habiba
    Chemchem, Amine
    2013 WORLD CONGRESS ON NATURE AND BIOLOGICALLY INSPIRED COMPUTING (NABIC), 2013, : 120 - 125
  • [24] Optimization of Association Rule Mining A Two Step Breakdown Variation of Apriori Algorithm
    Fatah, Polla A.
    Hamarash, Ibrahim
    2015 Internet Technologies and Applications (ITA) Proceedings of the Sixth International Conference (ITA 15), 2015, : 275 - 280
  • [25] Securing association rule mining with FP growth algorithm in horizontally partitioned database
    Patil, Vaishali
    Vasappanavara, Ramesh
    Ghorpade, Tushar
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON CONTROL, COMPUTING, COMMUNICATION AND MATERIALS (ICCCCM), 2016,
  • [26] Algorithm for mining fuzzy association rule based on TD-FP-growth
    Huo, Wei-Gang
    Shao, Xiu-Li
    Kongzhi yu Juece/Control and Decision, 2009, 24 (10): : 1504 - 1508
  • [27] Association Rule Mining Based on Bat Algorithm
    Heraguemi, Kamel Eddine
    Kamel, Nadjet
    Drias, Habiba
    BIO-INSPIRED COMPUTING - THEORIES AND APPLICATIONS, BIC-TA 2014, 2014, 472 : 182 - 186
  • [28] A Bacterial Colony Algorithm for Association Rule Mining
    da Cunha, Danilo Souza
    Xavier, Rafael Silveira
    de Castro, Leandro Nunes
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 96 - 103
  • [29] Algorithm of Mining Association Rule Based on Matrix
    Lin, Zi-zhi
    Shu, Si-Hui
    Ding, Yun
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 786 - 791
  • [30] Based On The Possibility Of An Association Rule Mining Algorithm
    Xu, Zhi-Wei
    Zhang, Xue-Feng
    Zhang, Hai-Wang
    WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 187 - +