Large-scale Neural Modeling in MapReduce and Giraph

被引:0
|
作者
Yang, Shuo [1 ]
Spielman, Nicholas D. [2 ]
Jackson, Jadin C. [3 ]
Rubin, Brad S. [1 ]
机构
[1] St Thomas Univ, Grad Programs Software, St Paul, MN 55455 USA
[2] Neurosci Program Univ St Thomas, Minneapolis, MN USA
[3] Univ St Thomas, Dept Biol, Minneapolis, MN USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT) | 2014年
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
One of the most crucial challenges in scientific computing is scalability. Hadoop, an open-source implementation of the MapReduce parallel programming model developed by Google, has emerged as a powerful platform for performing large-scale scientific computing at very low costs. In this paper, we explore the use of Hadoop to model large-scale neural networks. A neural network is most naturally modeled by a graph structure with iterative processing. In this paper, we first present an improved graph algorithm design pattern in MapReduce called Mapper-side Schimmy. Experiments show that the application of our design pattern, combined with the current best practices, can reduce the running time of the neural network simulation on a neural network with 100,000 neurons and 2.3 billion edges by 64%. MapReduce, however, is inherently not efficient for iterative graph processing. To address the limitation of the MapReduce model, we then explore the use of Giraph, an open source large-scale graph processing framework that sits on top of Hadoop to implement graph algorithms with a vertex-centric approach. We show that our Giraph implementation boosted performance by 91% compared to a basic MapReduce implementation and by 60% compared to our improved Mapper-side Schimmy algorithm.
引用
收藏
页码:556 / 561
页数:6
相关论文
共 50 条
  • [21] Analyzing Patterns in Large-Scale Graphs Using MapReduce in Hadoop
    Schultz, Joshua
    Vierya, Jonathan
    Lu, Enyue
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1457 - +
  • [22] A Large-Scale Graph Learning Framework of Technological Gatekeepers by MapReduce
    Liu Tong
    Guo Wensheng
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1997 - 2003
  • [23] Large-Scale Multimedia Data Mining Using MapReduce Framework
    Wang, Hanli
    Shen, Yun
    Wang, Lei
    Zhufeng, Kuangtian
    Wang, Wei
    Cheng, Cheng
    2012 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2012,
  • [24] Large-Scale Graph Classification Based on Evolutionary Computation with MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Wang, Guoren
    Cheng, Yurong
    WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 227 - 243
  • [25] Biomarker Discovery Based on Large-Scale Feature Selection and MapReduce
    Kourid, Ahlam
    Batouche, Mohamed
    COMPUTER SCIENCE AND ITS APPLICATIONS, CIIA 2015, 2015, 456 : 81 - 92
  • [26] Convolutional neural networks for large-scale dynamical modeling of itinerant magnets
    Cheng X.
    Zhang S.
    Nguyen P.C.H.
    Azarfar S.
    Chern G.-W.
    Baek S.S.
    Physical Review Research, 2023, 5 (03):
  • [27] Neural network-based modeling for a large-scale power plant
    Lee, Kwang Y.
    Heo, Jin S.
    Hoffman, Jason A.
    Kim, Sung-Ho
    Jung, Won-Hee
    2007 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS 1-10, 2007, : 1028 - 1035
  • [28] Modeling of complex large-scale system using fuzzy neural networks
    Liu, J.
    San, Y.
    Wang, Z.
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2001, 13 (03): : 304 - 307
  • [29] LARGE-SCALE URBAN MODELING
    HELWEG, OJ
    JOURNAL OF THE URBAN PLANNING & DEVELOPMENT DIVISION-ASCE, 1979, 105 (02): : 89 - 101
  • [30] LARGE-SCALE URBAN MODELING
    GRIGG, NS
    JOURNAL OF THE URBAN PLANNING & DEVELOPMENT DIVISION-ASCE, 1980, 106 (01): : 106 - 107