Poster: A Novel Shared Memory Framework for Distributed Deep Learning in High-Performance Computing Architecture

被引:4
|
作者
Ahn, Shinyoung [1 ,2 ]
Kim, Joongheon [3 ]
Kang, Sungwon [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] ETRI, Daejeon, South Korea
[3] Chung Ang Univ, Seoul, South Korea
关键词
Distributed deep learning; remote shared memory; parameter sharing; HPC;
D O I
10.1145/3183440.3195091
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a novel virtual shared memory framework, Soft Memory Box (SMB), which directly shares the memory of remote nodes among distributed processes to improve communication performance/speed via deep learning parameter sharing.
引用
收藏
页码:191 / 192
页数:2
相关论文
共 50 条
  • [1] Soft Memory Box: A Virtual Shared Memory Framework for Fast Deep Neural Network Training in Distributed High Performance Computing
    Ahn, Shinyoung
    Kim, Joongheon
    Lim, Eunji
    Kang, Sungwon
    [J]. IEEE ACCESS, 2018, 6 : 26493 - 26504
  • [2] The BORG distributed architecture for high-performance computing
    Mou, ZG
    Duong, L
    Donuhue, D
    Ku, HC
    [J]. APPLICATIONS OF HIGH-PERFORMANCE COMPUTING IN ENGINEERING VI, 2000, 6 : 399 - 408
  • [3] HPDL: Towards a General Framework for High-performance Distributed Deep Learning
    Li, Dongsheng
    Lai, Zhiquan
    Ge, Keshi
    Zhang, Yiming
    Zhang, Zhaoning
    Sun, Tao
    Wang, Qinglin
    Wang, Huaimin
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 1742 - 1753
  • [4] A floating-paint validation suite for high-performance shared and distributed memory computing systems
    Ghoshal, SK
    [J]. FOURTH INTERNATIONAL CONFERENCE ON HIGH-PERFORMANCE COMPUTING, PROCEEDINGS, 1997, : 88 - 93
  • [5] ShmCaffe: A Distributed Deep Learning Platform with Shared Memory Buffer for HPC Architecture
    Ahn, Shinyoung
    Kim, Joongheon
    Lim, Eunji
    Choi, Wan
    Mohaisen, Aziz
    Kang, Sungwon
    [J]. 2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1118 - 1128
  • [6] Distributed Deep Learning Framework based on Shared Memory for Fast Deep Neural Network Training
    Lim, Eun-Ji
    Ahn, Shin-Young
    Park, Yoo-Mi
    Choi, Wan
    [J]. 2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1239 - 1242
  • [7] Breast histopathology with high-performance computing and deep learning
    Graziani M.
    Eggel I.
    Deligand F.
    Bobák M.
    Andrearczyk V.
    Müller H.
    [J]. Computing and Informatics, 2021, 39 (04) : 780 - 807
  • [8] BREAST HISTOPATHOLOGY WITH HIGH-PERFORMANCE COMPUTING AND DEEP LEARNING
    Graziani, Mara
    Eggel, Ivan
    Deligand, Francois
    Bobak, Martin
    Andrearczyk, Vincent
    Mueller, Henning
    [J]. COMPUTING AND INFORMATICS, 2020, 39 (04) : 780 - 807
  • [9] HIGH-PERFORMANCE DISTRIBUTED COMPUTING
    RAGHAVENDRA, CS
    [J]. CONCURRENCY-PRACTICE AND EXPERIENCE, 1994, 6 (04): : 231 - 233
  • [10] High-Performance Genomic Analysis Framework with In-Memory Computing
    Li, Xueqi
    Tan, Guangming
    Wang, Bingchen
    Sun, Ninghui
    [J]. ACM SIGPLAN NOTICES, 2018, 53 (01) : 317 - +