An Efficient Approach for Mining Association Rules from Sparse and Dense Databases

被引:0
|
作者
Vu, Lan [1 ]
Alaghband, Gita [1 ]
机构
[1] Univ Colorado, Dept Comp Sci & Engn, Denver, CO 80202 USA
关键词
data mining; frequent pattern mining; association rule mining; frequent itemset; transactional database;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Association rule mining (ARM) is an important task in data mining. This task is computationally intensive and requires large memory usage. Many existing methods for ARM perform efficiently on either sparse or dense data but not both. We address this issue by presenting a new approach for ARM that runs fast for both sparse and dense databases by detecting the characteristic of data subsets in database and applying a combination of two mining strategies: one is for the sparse data subsets and the other is for the dense ones. Two algorithms, FEM and DFEM, based on our approach are introduced in this paper. FEM applies a fixed threshold as the condition for switching between the two mining strategies while DFEM adopts this threshold dynamically at runtime to best fit the characteristics of the database during the mining process, especially when minimum support threshold is low. Additionally, we present optimization techniques for the proposed algorithms to speed up the mining process, reduce the memory usage and optimize the UO cost. We also analyze in-depth the performance of FEM and DFEM and compare them with several existing algorithms. The experimental results show that FEM and DFEM achieve a significant improvement in execution time and consume less memory than many popular ARM algorithms including the well-known Apriori, FP-growth and Eclat on both sparse and dense databases.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Efficient Algorithms for Mining Frequent Patterns from Sparse and Dense Databases
    Vu, Lan
    Alaghband, Gita
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2015, 24 (02) : 181 - 197
  • [2] An Efficient Approach for Mining Positive and Negative Association Rules from Large Transactional Databases
    Kishor, Peddi
    Porika, Sammulal
    [J]. 2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 1, 2016, : 85 - 89
  • [3] An efficient sampling approach for mining all association rules in large databases
    Department of Computer Science and Engineering, Shiraz University, Shiraz, Iran
    [J]. Iran. J. Electr. Comput. Eng., 2008, 1 (73-78):
  • [4] Efficient mining of association rules in text databases
    Holt, JD
    Chung, SM
    [J]. PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION KNOWLEDGE MANAGEMENT, CIKM'99, 1999, : 234 - 242
  • [5] Efficient mining of association rules in distributed databases
    Cheung, DW
    Ng, VT
    Fu, AW
    Fu, YJ
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) : 911 - 922
  • [6] An efficient graph-based approach to mining association rules for large databases
    Department of Computer Science and Engineering, National Sun Yat-Sen University, No. 70 Lienhai Rd., Kaohsiung 80424, Taiwan
    [J]. Int. J. Intell. Inf. Database Syst., 2009, 3 (259-274): : 259 - 274
  • [7] An Efficient Framework for Mining Association Rules in the Distributed Databases
    Goyal, Lalit Mohan
    Beg, M. M. Sufyan
    Ahmad, Tanvir
    [J]. COMPUTER JOURNAL, 2018, 61 (05): : 645 - 657
  • [8] Efficient mining of categorized association rules in large databases
    Tseng, SM
    [J]. SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 3606 - 3610
  • [9] Evolutionary approach for mining association rules on dynamic databases
    Shenoy, PD
    Srinivasa, KG
    Venugopal, KR
    Patnaik, LM
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2003, 2637 : 325 - 336
  • [10] Mining association rules from biological databases
    Rodríguez, A
    Carazo, JM
    Trelles, O
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2005, 56 (05): : 493 - 504