Large-scale Neural Modeling in MapReduce and Giraph

被引:0
|
作者
Yang, Shuo [1 ]
Spielman, Nicholas D. [2 ]
Jackson, Jadin C. [3 ]
Rubin, Brad S. [1 ]
机构
[1] St Thomas Univ, Grad Programs Software, St Paul, MN 55455 USA
[2] Neurosci Program Univ St Thomas, Minneapolis, MN USA
[3] Univ St Thomas, Dept Biol, Minneapolis, MN USA
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
One of the most crucial challenges in scientific computing is scalability. Hadoop, an open-source implementation of the MapReduce parallel programming model developed by Google, has emerged as a powerful platform for performing large-scale scientific computing at very low costs. In this paper, we explore the use of Hadoop to model large-scale neural networks. A neural network is most naturally modeled by a graph structure with iterative processing. In this paper, we first present an improved graph algorithm design pattern in MapReduce called Mapper-side Schimmy. Experiments show that the application of our design pattern, combined with the current best practices, can reduce the running time of the neural network simulation on a neural network with 100,000 neurons and 2.3 billion edges by 64%. MapReduce, however, is inherently not efficient for iterative graph processing. To address the limitation of the MapReduce model, we then explore the use of Giraph, an open source large-scale graph processing framework that sits on top of Hadoop to implement graph algorithms with a vertex-centric approach. We show that our Giraph implementation boosted performance by 91% compared to a basic MapReduce implementation and by 60% compared to our improved Mapper-side Schimmy algorithm.
引用
收藏
页码:556 / 561
页数:6
相关论文
共 50 条
  • [41] Recent Developments on Security and Reliability in Large-Scale Data Processing with MapReduce
    Esposito, Christian
    Ficco, Massimo
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2016, 12 (01) : 49 - 68
  • [42] Social Relation Extraction of Large-Scale Logistics Network Based on MapReduce
    Gui, Feng
    Zhang, Feng
    Ma, Yunlong
    Liu, Min
    Shen, Weiming
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 2273 - 2277
  • [43] Application of convolutional neural networks to large-scale naphtha pyrolysis kinetic modeling
    Hua, Feng
    Fang, Zhou
    Qiu, Tong
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2018, 26 (12) : 2562 - 2572
  • [44] Application of convolutional neural networks to large-scale naphtha pyrolysis kinetic modeling
    Feng Hua
    Zhou Fang
    Tong Qiu
    Chinese Journal of Chemical Engineering, 2018, 26 (12) : 2562 - 2572
  • [45] LARGE-SCALE URBAN MODELING - CLOSURE
    HELWEG, OJ
    JOURNAL OF THE URBAN PLANNING & DEVELOPMENT DIVISION-ASCE, 1981, 107 (01): : 53 - 53
  • [46] Modeling of Large-Scale Point Model
    Guo Ming
    Wang Yanmin
    Zhao Youshan
    Zhou Junzhao
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 4, 2009, : 448 - +
  • [47] Approaches to large-scale urban modeling
    Hu, JH
    You, SY
    Neumann, U
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2003, 23 (06) : 62 - 69
  • [48] Parallel genesis for large-scale modeling
    Goddard, NH
    Hood, G
    COMPUTATIONAL NEUROSCIENCE: TRENDS IN RESEARCH, 1997, 1997, : 911 - 917
  • [49] SIMULATION MODELING OF LARGE-SCALE SYSTEMS
    FOSTER, JW
    HOGG, GL
    GONZALEZVEGA, O
    PROCEEDINGS ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 1986, (SYM): : 419 - 426
  • [50] Modeling large-scale dispersal distances
    Paradis, E
    Baillie, SR
    Sutherland, WJ
    ECOLOGICAL MODELLING, 2002, 151 (2-3) : 279 - 292