An Efficient Approach for Mining Association Rules from Sparse and Dense Databases

被引:0
|
作者
Vu, Lan [1 ]
Alaghband, Gita [1 ]
机构
[1] Univ Colorado, Dept Comp Sci & Engn, Denver, CO 80202 USA
关键词
data mining; frequent pattern mining; association rule mining; frequent itemset; transactional database;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Association rule mining (ARM) is an important task in data mining. This task is computationally intensive and requires large memory usage. Many existing methods for ARM perform efficiently on either sparse or dense data but not both. We address this issue by presenting a new approach for ARM that runs fast for both sparse and dense databases by detecting the characteristic of data subsets in database and applying a combination of two mining strategies: one is for the sparse data subsets and the other is for the dense ones. Two algorithms, FEM and DFEM, based on our approach are introduced in this paper. FEM applies a fixed threshold as the condition for switching between the two mining strategies while DFEM adopts this threshold dynamically at runtime to best fit the characteristics of the database during the mining process, especially when minimum support threshold is low. Additionally, we present optimization techniques for the proposed algorithms to speed up the mining process, reduce the memory usage and optimize the UO cost. We also analyze in-depth the performance of FEM and DFEM and compare them with several existing algorithms. The experimental results show that FEM and DFEM achieve a significant improvement in execution time and consume less memory than many popular ARM algorithms including the well-known Apriori, FP-growth and Eclat on both sparse and dense databases.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Mining similar association rules from transaction databases
    Wang, SL
    Kuo, CY
    Hong, TP
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 486 - 489
  • [22] Parallel mining of association rules from text databases
    John D. Holt
    Soon M. Chung
    [J]. The Journal of Supercomputing, 2007, 39 : 273 - 299
  • [23] An efficient interestingness based algorithm for mining association rules in medical databases
    Wasan, Siri Krishan
    Bhatnagar, Vasudha
    Kaur, Harleen
    [J]. ADVANCES AND INNOVATIONS IN SYSTEMS, COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2007, : 167 - +
  • [24] An Efficient Incremental Maintenance for Association Rules Mining Based on Distributed Databases
    Darwish, Mahmoud
    Elgohery, Rania
    Badr, Nagwa
    Faheem, Hossam
    [J]. 2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 404 - 408
  • [25] Mining Fuzzy Association Rules in Databases
    Kuok, Chan Man
    Fu, Ada
    Wong, Man Hon
    [J]. SIGMOD Record (ACM Special Interest Group on Management of Data), 1998, 27 (01): : 41 - 46
  • [26] Mining association rules in temporal databases
    Ye, XF
    Keane, JA
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 2803 - 2808
  • [27] Mining dynamic association rules in databases
    Liu, JF
    Rong, G
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 688 - 695
  • [28] An efficient approach for mining association rules from high utility itemsets
    Sahoo, Jayakrushna
    Das, Ashok Kumar
    Goswami, A.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5754 - 5778
  • [29] An efficient algorithm for finding dense regions for mining quantitative association rules
    Lian, W
    Cheung, DW
    Yiu, SM
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2005, 50 (3-4) : 471 - 490
  • [30] An efficient algorithm for mining association rules for large itemsets in large databases: from sequential to parallel
    Wong, AKY
    Wu, SL
    Feng, L
    [J]. ENGINEERING INTELLIGENT SYSTEMS FOR ELECTRICAL ENGINEERING AND COMMUNICATIONS, 2000, 8 (02): : 109 - 117