EIFDD: An efficient approach for erasable itemset mining of very dense datasets

被引:19
|
作者
Giang Nguyen [1 ]
Tuong Le [2 ,3 ]
Bay Vo [2 ,3 ]
Bac Le [4 ]
机构
[1] Ho Chi Minh City Univ Technol, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] Ton Duc Thang Univ, Div Data Sci, Ho Chi Minh City, Vietnam
[3] Ton Duc Thang Univ, Fac Informat Technol, Ho Chi Minh City, Vietnam
[4] VNU, Univ Sci, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Pattern mining; Erasable itemset; Subsume concept; Dense datasets; K FREQUENT PATTERNS; ASSOCIATION RULES; NC-SETS; ALGORITHM; STREAMS;
D O I
10.1007/s10489-014-0644-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Erasable itemset mining, first proposed in 2009, is an interesting problem in supply chain optimization. The dPidset structure, a very effective structure for mining erasable itemsets, was introduced in 2014. The dPidset structure outperforms previous structures such as PID_List and NC_Set. Algorithms based on dPidset can effectively mine erasable itemsets. However, for very dense datasets, the mining time and memory usage are large. Therefore, this paper proposes an effective approach that uses the subsume concept for mining erasable itemsets for very dense datasets. The subsume concept is used to help early determine the information of a large number of erasable itemsets without the usual computational cost. Then, the erasable itemsets for very dense datasets (EIFDD) algorithm, which uses the subsume concept and the dPidset structure for the erasable itemset mining of very dense datasets, is proposed. An illustrative example is given to demonstrate the proposed algorithm. Finally, an experiment is conducted to show the effectiveness of EIFDD.
引用
收藏
页码:85 / 94
页数:10
相关论文
共 50 条
  • [41] Efficient mining of class association rules with the itemset constraint
    Dang Nguyen
    Nguyen, Loan T. T.
    Vo, Bay
    Pedrycz, Witold
    KNOWLEDGE-BASED SYSTEMS, 2016, 103 : 73 - 88
  • [42] Probabilistic Support Prediction: Fast Frequent Itemset Mining in Dense Data
    Sadeequllah, Muhammad
    Rauf, Azhar
    Rehman, Saif Ur
    Alnazzawi, Noha
    IEEE ACCESS, 2024, 12 : 39330 - 39350
  • [43] Privacy-Preserving Frequent Itemset Mining for Sparse and Dense Data
    Laud, Peeter
    Pankova, Alisa
    SECURE IT SYSTEMS, NORDSEC 2017, 2017, 10674 : 139 - 155
  • [44] Fuzzy Association Rule Mining Algorithm for Fast and Efficient Performance on Very Large Datasets
    Mangalampalli, Ashish
    Pudi, Vikram
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 1163 - 1168
  • [45] DOLPHIN: An Efficient Algorithm for Mining Distance-Based Outliers in Very Large Datasets
    Angiulli, Fabrizio
    Fassetti, Fabio
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (01)
  • [46] An efficient approach for incremental erasable utility pattern mining from non-binary data
    Baek, Yoonji
    Kim, Hanju
    Cho, Myungha
    Kim, Hyeonmo
    Lee, Chanhee
    Ryu, Taewoong
    Kim, Heonho
    Vo, Bay
    Gan, Vincent W.
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    Pedrycz, Witold
    Yun, Unil
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (10) : 5919 - 5958
  • [47] CC-IFIM: an efficient approach for incremental frequent itemset mining based on closed candidates
    Maged Magdy
    Fayed F. M. Ghaleb
    Dawlat A. El A. Mohamed
    Wael Zakaria
    The Journal of Supercomputing, 2023, 79 : 7877 - 7899
  • [48] An Efficient Outlier Detection Approach Over Uncertain Data Stream Based on Frequent Itemset Mining
    Hao, Shangbo
    Cai, Saihua
    Sun, Ruizhi
    Li, Sicong
    INFORMATION TECHNOLOGY AND CONTROL, 2019, 48 (01): : 34 - 46
  • [49] CC-IFIM: an efficient approach for incremental frequent itemset mining based on closed candidates
    Magdy, Maged
    Ghaleb, Fayed F. M.
    Mohamed, Dawlat A. El A.
    Zakaria, Wael
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (07): : 7877 - 7899
  • [50] New parallel algorithms for frequent itemset mining in very large databases
    Veloso, A
    Meira, W
    Parthasarathy, S
    15TH SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2003, : 158 - 166