Efficient Algorithms for Mining Frequent Patterns from Sparse and Dense Databases

被引:2
|
作者
Vu, Lan [1 ]
Alaghband, Gita [1 ]
机构
[1] Univ Colorado Denver, Dept Comp Sci & Engn, 1380 Lawrence St, Denver, CO 80204 USA
关键词
Data mining; frequent pattern mining; association rule mining; frequent itemset; transactional database;
D O I
10.1515/jisys-2014-0040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we present a new approach for frequent pattern mining (FPM) that runs fast for both sparse and dense databases. Two algorithms, FEM and DFEM, based on our approach are also introduced. FEM applies a fixed threshold as the condition for switching between the two mining strategies; meanwhile, DFEM adopts this threshold dynamically at runtime to best fit the characteristics of the database during the mining process, especially when minimum support threshold is low. Additionally, we present optimization techniques for the proposed algorithms to speed the mining process, reduce the memory usage, and optimize the I/O cost. We also analyze in depth the performance of FEM and DFEM and compare them with several existing algorithms. The experimental results show that FEM and DFEM achieve a significant improvement in execution time and consume less memory than many popular FPM algorithms including the well-known Apriori, FP-growth, and Eclat.
引用
收藏
页码:181 / 197
页数:17
相关论文
共 50 条
  • [31] A new framework for mining frequent interaction patterns from meeting databases
    Fariha, Anna
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Samiullah, Md.
    Pervin, Suraiya
    Cao, Longbing
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 45 : 103 - 118
  • [32] Efficient mining of constrained frequent patterns from streams
    Leung, Carson Kai-Sang
    Khan, Quamrul I.
    [J]. 10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 61 - 68
  • [33] Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
    Soon M. Chung
    Congnan Luo
    [J]. Knowledge and Information Systems, 2008, 16 : 359 - 391
  • [34] Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
    Chung, Soon M.
    Luo, Congnan
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (03) : 359 - 391
  • [35] Fast and Efficient Mining of Frequent and Maximal Periodic Patterns in Spatiotemporal Databases for shifted instances
    Obulesu, O.
    Reddy, A. Rama Mohan
    [J]. 2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 35 - 40
  • [36] Interactive Mining of Probabilistic Frequent Patterns in Uncertain Databases
    Lin, Ming-Yen
    Fu, Cheng-Tai
    Hsueh, Sue-Chen
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2022, 30 (02) : 263 - 283
  • [37] Towards Efficient Discovery of Periodic-Frequent Patterns in Dense Temporal Databases Using Complements
    Veena, P.
    Tarun, Sreepada
    Kiran, R. Uday
    Dao, Minh-Son
    Zettsu, Koji
    Watanobe, Yutaka
    Zhang, Ji
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT II, 2022, 13427 : 204 - 215
  • [38] Efficient Mining of Weighted Frequent Itemsets in Uncertain Databases
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION (MLDM 2016), 2016, 9729 : 236 - 250
  • [39] Parallel and Distributed Algorithms for Frequent Pattern Mining in Large Databases
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    [J]. IETE TECHNICAL REVIEW, 2009, 26 (01) : 55 - 66
  • [40] Mining image frequent patterns based on a frequent pattern list in image databases
    Ye-In Chang
    Jun-Hong Shen
    Chia-En Li
    Zih-Siang Chen
    Ming-Hsuan Tu
    [J]. The Journal of Supercomputing, 2020, 76 : 2597 - 2621