A high-performance distributed algorithm for mining association rules

被引:0
|
作者
Assaf Schuster
Ran Wolff
Dan Trock
机构
[1] Technion—Israel Institute of Technology,Department of Computer Science
[2] Technion—Israel Institute of Technology,Department of Electrical Engineering
来源
关键词
Association rule; Data mining; Distributed data mining; High-performance computing;
D O I
暂无
中图分类号
学科分类号
摘要
We present a new distributed association rule mining (D-ARM) algorithm that demonstrates superlinear speed-up with the number of computing nodes. The algorithm is the first D-ARM algorithm to perform a single scan over the database. As such, its performance is unmatched by any previous algorithm. Scale-up experiments over standard synthetic benchmarks demonstrate stable run time regardless of the number of computers. Theoretical analysis reveals a tighter bound on error probability than the one shown in the corresponding sequential algorithm. As a result of this tighter bound and by utilizing the combined memory of several computers, the algorithm generates far fewer candidates than comparable sequential algorithms—the same order of magnitude as the optimum.
引用
收藏
页码:458 / 475
页数:17
相关论文
共 50 条
  • [1] A high-performance distributed algorithm for mining association rules
    Schuster, A
    Wolff, R
    Trock, D
    KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (04) : 458 - 475
  • [2] A high-performance distributed algorithm for mining association rules
    Schuster, A
    Wolff, R
    Trock, D
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 291 - 298
  • [3] An efficient distributed algorithm for mining association rules
    Zhao, Yan
    Yao, Yong
    Liu, Zhijng
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 41 - 44
  • [4] A fast distributed algorithm for mining association rules
    Cheung, DW
    Han, JW
    Ng, VT
    Fu, AW
    Fu, YJ
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED INFORMATION SYSTEMS, 1996, : 31 - 42
  • [5] An efficient algorithm for mining distributed association rules
    Li, YJ
    Lin, XM
    Tsang, CP
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1169 - 1175
  • [6] An efficient distributed algorithm for mining association rules
    Farzanyar, Zahra
    Kangavari, Mohammadreza
    Hashemi, Sattar
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, 2006, 4330 : 383 - +
  • [7] ENHANCING THE PERFORMANCE OF DISTRIBUTED MINING OF ASSOCIATION RULES
    Tlili, Raja
    Slimani, Yahya
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 391 - 396
  • [8] A Sampling Algorithm for Mining Association Rules in Distributed Database
    Shi Yue-mei
    Hu Guo-hua
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 431 - 434
  • [9] A new efficient distributed algorithm for mining association rules
    Zhao, Yan
    Zhou, Hong
    Liu, Zhijing
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 493 - 495
  • [10] Privacy preserving distributed mining algorithm of association rules
    Department of Computer Science, Xi'an Jiaotong University, Xi'an 710049, China
    不详
    Jisuanji Gongcheng, 2006, 21 (35-37):