Scalable parallel algorithm for mining frequent patterns on message passing multiprocessor systems

被引:0
|
作者
Javed, A [1 ]
Khokhar, A [1 ]
机构
[1] Univ Illinois, Dept CS, Chicago, IL 60612 USA
关键词
frequent pattern mining; parallel processing; association rule; data mining;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.
引用
收藏
页码:157 / 162
页数:6
相关论文
共 50 条
  • [41] An Improved Algorithm for Mining Maximal Frequent Patterns
    Hu, Yan
    Han, Ruixue
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 746 - 749
  • [42] Effective algorithm for mining compressed frequent patterns
    School of Software, Beijing University of Aeronautics and Astronautics, Beijing 100191, China
    不详
    Beijing Hangkong Hangtian Daxue Xuebao, 2009, 5 (640-643):
  • [43] Efficient Biorthogonal Lanczos Algorithm on Message Passing Parallel Computer
    Kim, Sun Kyung
    METHODS AND TOOLS OF PARALLEL PROGRAMMING MULTICOMPUTERS, 2010, 6083 : 293 - 299
  • [44] A PARALLEL ALGORITHM TO EVALUATE CHEBYSHEV SERIES ON A MESSAGE PASSING ENVIRONMENT
    Barrio, Roberto
    Sabadell, Javier
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1998, 20 (03): : 964 - 969
  • [45] Message-passing parallel algorithm for Bayesian image restoration
    Doallo, R
    Eiroa, J
    Sanjurjo, J
    Carazo, JM
    CISST'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, VOLS I AND II, 2000, : 361 - 367
  • [46] Efficient parallel minimum spanning tree algorithm on message passing parallel machine
    Wang, Guangrong
    Gu, Naijie
    Ruan Jian Xue Bao/Journal of Software, 2000, 11 (07): : 889 - 898
  • [47] A Fast Parallel Algorithm for Discovering Frequent Patterns
    Lin, Kawuu W.
    Luo, Yu-Chin
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 398 - 403
  • [48] A scalable algorithm for mining maximal frequent sequences using a sample
    Luo, Congnan
    Chung, Soon M.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 15 (02) : 149 - 179
  • [49] A scalable algorithm for mining maximal frequent sequences using a sample
    Congnan Luo
    Soon M. Chung
    Knowledge and Information Systems, 2008, 15 : 149 - 179
  • [50] A scalable algorithm for mining maximal frequent sequences using sampling
    Luo, C
    Chung, SM
    ICTAI 2004: 16TH IEEE INTERNATIONALCONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, : 156 - 165