Tree-based partitioning of data for association rule mining

被引:12
|
作者
Ahmed, Shakil [1 ]
Coenen, Frans [1 ]
Leng, Paul [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 3BX, Merseyside, England
关键词
association rules; partial support; data structures; set-enumeration tree;
D O I
10.1007/s10115-006-0010-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most computationally demanding aspect of Association Rule Mining is the identification and counting of support of the frequent sets of items that occur together sufficiently often to be the basis of potentially interesting rules. The task increases in difficulty with the scale of the data and also with its density. The greatest challenge is posed by data that is too large to be contained in primary memory, especially when high data density and/or low support thresholds give rise to very large numbers of candidates that must be counted. In this paper, we consider strategies for partitioning the data to deal effectively with such cases. We describe a partitioning approach which organises the data into tree structures that can be processed independently. We present experimental results that show the method scales well for increasing dimensions of data and performs significantly better than alternatives, especially when dealing with dense data and low support thresholds.
引用
收藏
页码:315 / 331
页数:17
相关论文
共 50 条
  • [1] Tree-based partitioning of date for association rule mining
    Shakil Ahmed
    Frans Coenen
    Paul Leng
    [J]. Knowledge and Information Systems, 2006, 10 : 315 - 331
  • [2] A new tree-based approach for evaluating rule antecedent constraint in confabulation based association rule mining
    Soltani, Azadeh
    Akbarzadeh-T, Mohammad-R.
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2015, 19 (01) : 1 - 14
  • [3] Strategies for partitioning data in association rule mining
    Ahmed, S
    Coenen, R
    Leng, P
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XX, 2004, : 127 - 139
  • [4] A tree partitioning method for memory management in association rule mining
    Ahmed, S
    Coenen, F
    Leng, P
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2004, 3181 : 331 - 340
  • [5] Data Mining with a Tree-Based Scan Statistic
    Brown, Jeffrey S.
    Dashevsky, Inna
    Fireman, Bruce
    Herrinton, Lisa
    McClure, David
    Murphy, Michael
    Raebel, Marsha
    Sturtevant, Jessica
    Kulldorff, Martin
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2011, 20 : S331 - S331
  • [6] A Novel Data Partitioning Approach for Association Rule Mining on Grids
    Tlili, Raja
    Slimani, Yahya
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2012, 5 (04): : 1 - 20
  • [7] A novel data partitioning approach for association rule mining on grids
    [J]. Tlili, R. (raja_tlili@yahoo.fr), 1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Prof B.H.Kang's Office,, Australia (05):
  • [8] A prefix tree-based model for mining association rules from quantitative temporal data
    Huang, YP
    Kao, LJ
    Sandnes, FE
    [J]. INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 158 - 163
  • [9] Tree-Based Contrast Subspace Mining for Categorical Data
    Sia, Florence
    Alfred, Rayner
    Lim, Yuto
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 1714 - 1722
  • [10] Tree-Based Contrast Subspace Mining for Categorical Data
    Florence Sia
    Rayner Alfred
    Yuto Lim
    [J]. International Journal of Computational Intelligence Systems, 2020, 13 : 1714 - 1722