Frequent Pattern Mining on Message Passing Multiprocessor Systems

被引:0
|
作者
Asif Javed
Ashfaq Khokhar
机构
[1] University of Illinois at Chicago,
来源
关键词
frequent pattern mining; parallel processing; association rule; data mining;
D O I
暂无
中图分类号
学科分类号
摘要
Extraction of frequent patterns in transaction-oriented database is crucial to several data mining tasks such as association rule generation, time series analysis, classification, etc. Most of these mining tasks require multiple passes over the database and if the database size is large, which is usually the case, scalable high performance solutions involving multiple processors are required. This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.
引用
收藏
页码:321 / 334
页数:13
相关论文
共 50 条
  • [31] Effective Algorithm for Frequent Pattern Mining
    Aditya, S. P.
    Hemanth, M.
    Lakshmikanth, C. K.
    Suneetha, K. R.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON IOT AND ITS APPLICATIONS (IEEE ICIOT), 2017,
  • [32] Fault Tolerant Frequent Pattern Mining
    Shohdy, Sameh
    Vishnu, Abhinav
    Agrawal, Gagan
    [J]. PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 12 - 21
  • [33] On closed constrained frequent pattern mining
    Bonchi, F
    Lucchese, C
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 35 - 42
  • [34] A Workflow Frequent Pattern Mining Algorithm
    Zhao, Weidong
    Ye, Mao
    [J]. 2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 3685 - 3688
  • [35] Mining frequent patterns with the pattern tree
    Hao Huang
    Xindong Wu
    Richard Relue
    [J]. New Generation Computing, 2005, 23 : 315 - 337
  • [36] A New Algorithm for Mining Frequent Pattern
    李力
    靳蕃
    [J]. Railway Engineering Science, 2002, (01) : 10 - 20
  • [37] FSP: Frequent Substructure Pattern mining
    Han, Shuguo
    Ng, Wee Keong
    Yu, Yang
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1347 - +
  • [38] SIMPLE LOW-COST MULTIPROCESSOR BASED ON MESSAGE PASSING FIFO LINKS
    GHOSHAL, SK
    GUHA, S
    ARIFF, SM
    RAJARAMAN, V
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 1990, 14 (05) : 297 - 300
  • [39] INTEGRATION OF MESSAGE-PASSING AND SHARED-MEMORY IN THE STANFORD FLASH MULTIPROCESSOR
    HEINLEIN, J
    GHARACHORLOO, K
    DRESSER, S
    GUPTA, A
    [J]. SIGPLAN NOTICES, 1994, 29 (11): : 38 - 50
  • [40] Correcting errors in message passing systems
    Pedersen, JB
    Wagner, A
    [J]. HIGH-LEVEL PARALLEL PROGRAMMING MODELS AND SUPPORTIVE ENVIRONMENTS, PROCEEDINGS, 2001, 2026 : 122 - 137