GPU-Accelerated Parallel Hierarchical Extreme Learning Machine on Flink for Big Data

被引:81
|
作者
Chen, Cen [1 ,2 ]
Li, Kenli [1 ,2 ]
Ouyang, Aijia [1 ,2 ,3 ]
Tang, Zhuo [1 ,2 ]
Li, Keqin [1 ,2 ,4 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[2] Natl Supercomp Ctr, Changsha 410082, Hunan, Peoples R China
[3] Zunyi Normal Coll, Dept Informat Engn, Zunyi 563006, Peoples R China
[4] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
基金
中国国家自然科学基金;
关键词
Big data; deep learning (DL); Flink; GPGPU; hierarchical extreme learning machine (H-ELM); parallel; FEEDFORWARD NETWORKS; HIDDEN NODES; MAPREDUCE; APPROXIMATION; CLASSIFICATION; OPTIMIZATION; REGRESSION; ALGORITHM; SPMV;
D O I
10.1109/TSMC.2017.2690673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The extreme learning machine (ELM) has become one of the most important and popular algorithms of machine learning, because of its extremely fast training speed, good generalization, and universal approximation/classification capability. The proposal of hierarchical ELM (H-ELM) extends ELM from single hidden layer feedforward networks to multilayer perceptron, greatly strengthening the applicability of ELM. Generally speaking, during training H-ELM, large-scale datasets (DSTs) are needed. Therefore, how to make use of H-ELM framework in processing big data is worth further exploration. This paper proposes a parallel H-ELM algorithm based on Flink, which is one of the in-memory cluster computing platforms, and graphics processing units (GPUs). Several optimizations are adopted to improve the performance, such as cache-based scheme, reasonable partitioning strategy, memory mapping scheme for mapping specific Java virtual machine objects to buffers. Most importantly, our proposed framework for utilizing GPUs to accelerate Flink for big data is general. This framework can be utilized to accelerate many other variants of ELM and other machine learning algorithms. To the best of our knowledge, it is the first kind of library, which combines in-memory cluster computing with GPUs to parallelize H-ELM. The experimental results have demonstrated that our proposed GPU-accelerated parallel H-ELM named as GPH-ELM can efficiently process large-scale DSTs with good performance of speedup and scalability, leveraging the computing power of both CPUs and GPUs in the cluster.
引用
收藏
页码:2740 / 2753
页数:14
相关论文
共 50 条
  • [21] GPU-Accelerated Parallel Monte Carlo Analysis of Analog Circuits by Hierarchical Graph-based Solver
    Zhu, Yan
    Tan, Sheldon X. -D.
    [J]. 2015 20TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2015, : 719 - 724
  • [22] GPU-accelerated Parallel 3D Image Thinning
    Hu, Bingfeng
    Yang, Xuan
    [J]. 2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 149 - 152
  • [23] A GPU-accelerated adaptive FSAI preconditioner for massively parallel simulations
    Isotton, Giovanni
    Janna, Carlo
    Bernaschi, Massimo
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2022, 36 (02): : 153 - 166
  • [24] GPU-Accelerated Graph Clustering via Parallel Label Propagation
    Kozawa, Yusuke
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 567 - 576
  • [25] GUNREAL: GPU-accelerated UNsupervised REinforcement and Auxiliary Learning
    Coppens, Youri
    Shirahata, Koichi
    Fukagai, Takuya
    Tomita, Yasumoto
    Ike, Atsushi
    [J]. 2017 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2017, : 330 - 336
  • [26] ELM-MapReduce: MapReduce Accelerated Extreme Learning Machine for Big Spatial Data Analysis
    Chen, Jiaoyan
    Zheng, Guozhou
    Chen, Huajun
    [J]. 2013 10TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2013, : 400 - 405
  • [27] GPU-accelerated Outlier Detection for Continuous Data Streams
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, John Jaehwan
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 1133 - 1142
  • [28] GPU-Accelerated Adaptive Compression Framework for Genomics Data
    Guo, GuiXin
    Qiu, Shuang
    Ye, ZhiQiang
    Wang, BingQiang
    Fang, Lin
    Lu, Mian
    See, Simon
    Mao, Rui
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [29] GStreamMiner: A GPU-accelerated Data Stream Mining Framework
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, John Jaehwan
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2489 - 2492
  • [30] FAST CELL DETECTION IN HIGH-THROUGHPUT IMAGERY USING GPU-ACCELERATED MACHINE LEARNING
    Mayerich, David
    Kwon, Jaerock
    Panchal, Aaron
    Keyser, John
    Choe, Yoonsuck
    [J]. 2011 8TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2011, : 719 - 723