Scalable parallel algorithm for mining frequent patterns on message passing multiprocessor systems

被引：0

作者：

Javed, A ^{[1
]}

Khokhar, A ^{[1
]}

机构：

[1] Univ Illinois, Dept CS, Chicago, IL 60612 USA

来源：

PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS | 2003年

关键词：

frequent pattern mining; parallel processing; association rule; data mining;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.

引用

页码：157 / 162

页数：6

共 50 条

[1] Frequent Pattern Mining on Message Passing Multiprocessor Systems
Asif Javed
Ashfaq Khokhar
Distributed and Parallel Databases, 2004, 16 : 321 - 334
[2] Frequent pattern mining on message passing multiprocessor systems
Javed, A
Khokhar, A
DISTRIBUTED AND PARALLEL DATABASES, 2004, 16 (03) : 321 - 334
[3] A PARALLEL GRAPH PARTITIONING ALGORITHM FOR A MESSAGE-PASSING MULTIPROCESSOR
GILBERT, JR
ZMIJEWSKI, E
LECTURE NOTES IN COMPUTER SCIENCE, 1988, 297 : 498 - 513
[4] A PARALLEL GRAPH PARTITIONING ALGORITHM FOR A MESSAGE-PASSING MULTIPROCESSOR
GILBERT, JR
ZMIJEWSKI, E
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1987, 16 (06) : 427 - 449
[5] Parallel Heuristic Search Algorithms for Message Passing Multiprocessor Systems
Rajpal, S. P.
Kumar, S.
Cosmetics and Toiletries, 110 (01):
[6] A cost effective scheduling algorithm for message passing multiprocessor systems
Bansal, S
Kumar, P
Singh, K
PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2002, : 47 - 52
[7] Parallel algorithm for mining maximal frequent patterns
Wang, H
Xiao, ZT
Zhang, HJ
Jiang, SY
ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2003, 2834 : 241 - 248
[8] Parallel Frequent Patterns Mining Algorithm on GPU
Zhou, Jiayi
Yu, Kun-Ming
Wu, Bin-Chang
2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
[9] MODELING AND EVALUATION OF A NEW MESSAGE-PASSING SYSTEM FOR PARALLEL MULTIPROCESSOR SYSTEMS
AZARIA, H
ELOVICI, Y
PARALLEL COMPUTING, 1993, 19 (06) : 633 - 649
[10] A SPACE-EFFICIENT PARALLEL SEQUENCE COMPARISON ALGORITHM FOR A MESSAGE-PASSING MULTIPROCESSOR
HUANG, XQ
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1989, 18 (03) : 223 - 239

← 1 2 3 4 5 →