Large-Scale Deep Belief Nets With MapReduce

被引:35
|
作者
Zhang, Kunlei [1 ]
Chen, Xue-Wen [1 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
来源
IEEE ACCESS | 2014年 / 2卷
关键词
Big data; deep learning; MapReduce; Hadoop; deep belief net (DBN); restricted Boltzmann machine (RBM);
D O I
10.1109/ACCESS.2014.2319813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep belief nets (DBNs) with restricted Boltzmann machines (RBMs) as the building block have recently attracted wide attention due to their great performance in various applications. The learning of a DBN starts with pretraining a series of the RBMs followed by fine-tuning the whole net using backpropagation. Generally, the sequential implementation of both RBMs and backpropagation algorithm takes significant amount of computational time to process massive data sets. The emerging big data learning requires distributed computing for the DBNs. In this paper, we present a distributed learning paradigm for the RBMs and the backpropagation algorithm using MapReduce, a popular parallel programming model. Thus, the DBNs can be trained in a distributed way by stacking a series of distributed RBMs for pretraining and a distributed backpropagation for fine-tuning. Through validation on the benchmark data sets of various practical problems, the experimental results demonstrate that the distributed RBMs and DBNs are amenable to large-scale data with a good performance in terms of accuracy and efficiency.
引用
收藏
页码:395 / 403
页数:9
相关论文
共 50 条
  • [1] Large-scale incremental processing with MapReduce
    Lee, Daewoo
    Kim, Jin-Soo
    Maeng, Seungryoul
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 36 : 66 - 79
  • [2] MapReduce for Large-scale Monitor Data Analyses
    Ding, Jianwei
    Liu, Yingbo
    Zhang, Li
    Wang, Jianmin
    2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 747 - 754
  • [3] MapReduce in MPI for Large-scale graph algorithms
    Plimpton, Steven J.
    Devine, Karen D.
    PARALLEL COMPUTING, 2011, 37 (09) : 610 - 632
  • [4] Large-Scale Frequent Subgraph Mining in MapReduce
    Lin, Wenqing
    Xiao, Xiaokui
    Ghinita, Gabriel
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 844 - 855
  • [5] Large-scale Neural Modeling in MapReduce and Giraph
    Yang, Shuo
    Spielman, Nicholas D.
    Jackson, Jadin C.
    Rubin, Brad S.
    2014 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2014, : 556 - 561
  • [6] Mining large-scale repetitive sequences in a MapReduce setting
    Cao, Hongfei
    Phinney, Michael
    Petersohn, Devin
    Merideth, Benjamin
    Shyu, Chi-Ren
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 14 (03) : 210 - 228
  • [7] Efficient Large-scale Trace Checking Using MapReduce
    Bersani, Marcello M.
    Bianculli, Domenico
    Ghezzi, Carlo
    Krstic, Srdan
    San Pietro, Pierluigi
    2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 888 - 898
  • [8] Efficient large-scale data analysis using mapreduce
    Kubo, R., 1600, Nippon Telegraph and Telephone Corp. (10):
  • [9] Review of large-scale RDF data processing in mapreduce
    Hou, Ke
    Zhang, Ming
    Fang, Xing
    Journal of Software Engineering, 2015, 9 (01): : 195 - 202
  • [10] A survey of large-scale analytical query processing in MapReduce
    Doulkeridis, Christos
    Norvag, Kjetil
    VLDB JOURNAL, 2014, 23 (03): : 355 - 380