Large-Scale Deep Belief Nets With MapReduce

被引:35
|
作者
Zhang, Kunlei [1 ]
Chen, Xue-Wen [1 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
来源
IEEE ACCESS | 2014年 / 2卷
关键词
Big data; deep learning; MapReduce; Hadoop; deep belief net (DBN); restricted Boltzmann machine (RBM);
D O I
10.1109/ACCESS.2014.2319813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep belief nets (DBNs) with restricted Boltzmann machines (RBMs) as the building block have recently attracted wide attention due to their great performance in various applications. The learning of a DBN starts with pretraining a series of the RBMs followed by fine-tuning the whole net using backpropagation. Generally, the sequential implementation of both RBMs and backpropagation algorithm takes significant amount of computational time to process massive data sets. The emerging big data learning requires distributed computing for the DBNs. In this paper, we present a distributed learning paradigm for the RBMs and the backpropagation algorithm using MapReduce, a popular parallel programming model. Thus, the DBNs can be trained in a distributed way by stacking a series of distributed RBMs for pretraining and a distributed backpropagation for fine-tuning. Through validation on the benchmark data sets of various practical problems, the experimental results demonstrate that the distributed RBMs and DBNs are amenable to large-scale data with a good performance in terms of accuracy and efficiency.
引用
收藏
页码:395 / 403
页数:9
相关论文
共 50 条
  • [41] Large-Scale Mammography CAD with Deformable Conv-Nets
    Morrell, Stephen
    Wojna, Zbigniew
    Khoo, Can Son
    Ourselin, Sebastien
    Iglesias, Juan Eugenio
    IMAGE ANALYSIS FOR MOVING ORGAN, BREAST, AND THORACIC IMAGES, 2018, 11040 : 64 - 72
  • [42] On reliability of large-scale nets constructed from identical elements
    V. A. Rodin
    S. V. Sinegubov
    Russian Mathematics, 2019, 63 : 51 - 56
  • [43] Decomposing large-scale POMDP via belief state analysis
    Li, X
    Cheung, WK
    Liu, JM
    2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, Proceedings, 2005, : 428 - 434
  • [44] MapReduce-based parallel learning for large-scale remote sensing images
    Huang, Fenghua
    Open Automation and Control Systems Journal, 2014, 6 (01): : 1962 - 1974
  • [45] Optimizing and Tuning MapReduce Jobs to Improve the Large-Scale Data Analysis Process
    Premchaiswadi, Wichian
    Romsaiyud, Walisa
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2013, 28 (02) : 185 - 200
  • [46] Phoenix Rebirth: Scalable MapReduce on a Large-Scale Shared-Memory System
    Yoo, Richard M.
    Romano, Anthony
    Kozyrakis, Christos
    PROCEEDINGS OF THE 2009 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2009, : 198 - 207
  • [47] Implementing Materialized View of Large-Scale Power Consumption Log Using MapReduce
    Ise, Yuki
    Yamamoto, Shintaro
    Matsumoto, Shinsuke
    Saiki, Sachio
    Nakamura, Masahide
    2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013), 2013, : 523 - 528
  • [48] An Incremental and Distributed Inference Method for Large-Scale Ontologies Based on MapReduce Paradigm
    Liu, Bo
    Huang, Keman
    Li, Jianqiang
    Zhou, MengChu
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (01) : 53 - 64
  • [49] CloudNMF:A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets
    Ruiqi Liao
    Yifan Zhang
    Jihong Guan
    Shuigeng Zhou
    Genomics,Proteomics & Bioinformatics, 2014, (01) : 48 - 51
  • [50] RESEARCH BASED ON LARGE-SCALE DATA QUERY WITH MAPREDUCE TECHNOLOGY IN CLOUD COMPUTING
    Wang, Feiping
    Gu, Xiaofeng
    2012 INTERNATIONAL CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (LCWAMTIP), 2012, : 243 - 245