Parallel mining of maximal frequent itemsets from databases

被引:4
|
作者
Chung, SM [1 ]
Luo, C [1 ]
机构
[1] Wright State Univ, Dept Comp Sci & Engn, Dayton, OH 45435 USA
关键词
parallel data mining; maximal frequent itemsets; association rules; scalability;
D O I
10.1109/TAI.2003.1250181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a parallel algorithm for mining maximal frequent itemsets from databases. A frequent itemset is maximal if none of its supersets is frequent. The new parallel algorithm is named Parallel Max-Miner (PMM), and it is a parallel version of the sequential Max-Miner algorithm [3]. Most of existing mining algorithms discover the frequent k-itemsets on the kth pass over the databases, and then generate the candidate (k + 1)-itemsets for the next pass. Compared to those level-wise algorithms, PMM looks ahead at each pass and prunes more candidate itemsets by checking the frequences of their supersets. We implemented PMM on a cluster of workstations, and evaluated its performance for various cases. PMM demonstrated better performance than other sequential and parallel algorithms, and its performance is quite scalable, even when there are large maximal frequent itemsets (i.e., long patterns) in databases.
引用
收藏
页码:134 / 139
页数:6
相关论文
共 50 条
  • [1] Mining Maximal Frequent Itemsets over Sampling Databases
    Li, Haifeng
    PROCEEDINGS OF THE 2015 2ND INTERNATIONAL FORUM ON ELECTRICAL ENGINEERING AND AUTOMATION (IFEEA 2015), 2016, 54 : 28 - 31
  • [2] Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
    Soon M. Chung
    Congnan Luo
    Knowledge and Information Systems, 2008, 16 : 359 - 391
  • [3] Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
    Chung, Soon M.
    Luo, Congnan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (03) : 359 - 391
  • [4] Incremental mining of weighted maximal frequent itemsets from dynamic databases
    Yun, Unil
    Lee, Gangin
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 54 : 304 - 327
  • [5] Distributed mining of maximal frequent itemsets from Databases on a cluster of workstations
    Chung, SM
    Luo, CN
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID - CCGRID 2004, 2004, : 499 - 507
  • [6] Parallel mining of maximal frequent itemsets in PC clusters
    Veng, Vong Chan
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 480 - 485
  • [7] Mining maximal frequent itemsets for large scale transaction databases
    Xia, R
    Yuan, W
    Ding, SC
    Liu, J
    Zhou, HB
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1480 - 1485
  • [8] Mining Frequent Itemsets from Multidimensional Databases
    Bay Vo
    Bac Le
    Nguyen, Thang N.
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2011, PT I, 2011, 6591 : 177 - 186
  • [9] A Novel Parallel Algorithm for Frequent Itemsets Mining in Large Transactional Databases
    Huan Phan
    Bac Le
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 272 - 287
  • [10] Mining Frequent Gradual Itemsets from Large Databases
    Di-Jorio, Lisa
    Laurent, Anne
    Teisseire, Maguelonne
    ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 297 - +