Probabilistic Frequent Itemset Mining on a GPU Cluster

被引:3
|
作者
Kozawa, Yusuke [1 ]
Amagasa, Toshiyuki [2 ]
Kitagawa, Hiroyuki [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan
[2] Univ Tsukuba, Fac Engn Informat & Syst, Tsukuba, Ibaraki 3058573, Japan
来源
关键词
GPU; uncertain databases; probabilistic frequent itemsets;
D O I
10.1587/transinf.E97.D.779
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Probabilistic frequent itemset mining, which discovers frequent itemsets from uncertain data, has attracted much attention due to inherent uncertainty in the real world. Many algorithms have been proposed to tackle this problem, but their performance is not satisfactory because handling uncertainty incurs high processing cost. To accelerate such computation, we utilize GPUs (Graphics Processing Units). Our previous work accelerated an existing algorithm with a single GPU. In this paper, we extend the work to employ multiple GPUs. Proposed methods minimize the amount of data that need to be communicated among GPUs, and achieve load balancing as well. Based on the methods, we also present algorithms on a GPU cluster. Experiments show that the single-node methods realize near-linear speedups, and the methods on a GPU cluster of eight nodes achieve up to a 7.1 times speedup.
引用
收藏
页码:779 / 789
页数:11
相关论文
共 50 条
  • [1] Exploiting GPU and cluster parallelism in single scan frequent itemset mining
    Djenouri, Youcef
    Djenouri, Djamel
    Belhadi, Asma
    Cano, Alberto
    [J]. INFORMATION SCIENCES, 2019, 496 : 363 - 377
  • [2] Probabilistic Frequent Itemset Mining in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    Verhein, Florian
    Zuefle, Andreas
    [J]. KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 119 - 127
  • [3] Frequent Itemset Mining on Correlated Probabilistic Databases
    Kalaz, Yasemin Asan
    Raman, Rajeev
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 84 - 98
  • [4] GPApriori: GPU-Accelerated Frequent Itemset Mining
    Zhang, Fan
    Zhang, Yan
    Bakos, Jason
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 590 - 594
  • [5] Model-based probabilistic frequent itemset mining
    Thomas Bernecker
    Reynold Cheng
    David W. Cheung
    Hans-Peter Kriegel
    Sau Dan Lee
    Matthias Renz
    Florian Verhein
    Liang Wang
    Andreas Zuefle
    [J]. Knowledge and Information Systems, 2013, 37 : 181 - 217
  • [6] Model-based probabilistic frequent itemset mining
    Bernecker, Thomas
    Cheng, Reynold
    Cheung, David W.
    Kriegel, Hans-Peter
    Lee, Sau Dan
    Renz, Matthias
    Verhein, Florian
    Wang, Liang
    Zuefle, Andreas
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 181 - 217
  • [7] Probabilistic frequent itemset mining over uncertain data streams
    Li, Haifeng
    Zhang, Ning
    Zhu, Jianming
    Wang, Yue
    Cao, Huaihu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 112 : 274 - 287
  • [8] Efficient weighted probabilistic frequent itemset mining in uncertain databases
    Li, Zhiyang
    Chen, Fengjuan
    Wu, Junfeng
    Liu, Zhaobin
    Liu, Weijiang
    [J]. EXPERT SYSTEMS, 2021, 38 (05)
  • [9] Probabilistic Frequent Pattern Growth for Itemset Mining in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    Verhein, Florian
    Zuefle, Andreas
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 38 - 55
  • [10] Probabilistic Frequent Itemset Mining Algorithm over Uncertain Databases with Sampling
    Li, Hai-Feng
    Zhang, Ning
    Zhang, Yue-Jin
    Wang, Yue
    [J]. FUZZY SYSTEMS AND DATA MINING II, 2016, 293 : 159 - 166