A high-performance distributed algorithm for mining association rules

被引:0
|
作者
Assaf Schuster
Ran Wolff
Dan Trock
机构
[1] Technion—Israel Institute of Technology,Department of Computer Science
[2] Technion—Israel Institute of Technology,Department of Electrical Engineering
来源
关键词
Association rule; Data mining; Distributed data mining; High-performance computing;
D O I
暂无
中图分类号
学科分类号
摘要
We present a new distributed association rule mining (D-ARM) algorithm that demonstrates superlinear speed-up with the number of computing nodes. The algorithm is the first D-ARM algorithm to perform a single scan over the database. As such, its performance is unmatched by any previous algorithm. Scale-up experiments over standard synthetic benchmarks demonstrate stable run time regardless of the number of computers. Theoretical analysis reveals a tighter bound on error probability than the one shown in the corresponding sequential algorithm. As a result of this tighter bound and by utilizing the combined memory of several computers, the algorithm generates far fewer candidates than comparable sequential algorithms—the same order of magnitude as the optimum.
引用
收藏
页码:458 / 475
页数:17
相关论文
共 50 条
  • [21] A privacy-preserving mining algorithm of association rules in distributed databases
    Liu, Jie
    Piao, Xiufeng
    Huang, Shaobin
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 2, 2006, : 746 - +
  • [22] Distributed algorithm for mining association rules based on FP-tree
    He, Bo
    Kongzhi yu Juece/Control and Decision, 2012, 27 (04): : 618 - 622
  • [23] An algorithm research for distributed association rules mining with constraints based on sampling
    Li, Hong
    Chen, Song-qiao
    Du, Jian-feng
    Yi, Li-jun
    Xiao, Wei
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 478 - 483
  • [24] Distributed Data Access Control Algorithm Using Mining Association Rules
    Rajkumar, N.
    Sivanandam, S. N.
    Thomas, J. Stanly
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (08): : 306 - 311
  • [25] Association rules mining algorithm
    Bhowmik, R
    Proceedings of the ISCA 20th International Conference on Computers and Their Applications, 2005, : 86 - 90
  • [26] Mining of association rules in distributed database
    Li, Shijun
    Zheng, Peng
    Zhou, Dongru
    Wuhan Shuili Dianli Daxue Xuebao/Journal of Wuhan University of Hydraulic and Electric Engineering, 1999, 32 (06): : 91 - 93
  • [27] Mining Association Rules in Distributed System
    Li, Zou
    Xu, Liang
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 1051 - 1054
  • [28] Performance evaluation of distributed algorithms for mining association rules on workstation cluster
    Shimomura, T
    Shibusawa, S
    2000 INTERNATIONAL WORKSHOPS ON PARALLEL PROCESSING, PROCEEDINGS, 2000, : 361 - 368
  • [29] A High-Performance Algorithm for Mining Repeating Patterns
    Su, Ja-Hwung
    Hong, Tzung-Pei
    Chin, Chu-Yu
    Liao, Zhi-Feng
    Cheng, Shyr-Yuan
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 631 - 640
  • [30] High-Performance Biomedical Association Mining with MapReduce
    Ji, Yanqing
    Tian, Yun
    Shen, Fangyang
    Tran, John
    2015 12TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY - NEW GENERATIONS, 2015, : 465 - 470