Parallelism in Deep Learning Accelerators

被引:0
|
作者
Song, Linghao [1 ]
Chen, Fan [1 ]
Chen, Yiran [1 ]
Li, Hai Helen [1 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning is the core of artificial intelligence and it achieves state-of-the-art in a wide range of applications. The intensity of computation and data in deep learning processing poses significant challenges to the conventional computing platforms. Thus, specialized accelerator architectures are proposed for the acceleration of deep learning In this paper, we classify the design space of current deep learning accelerators into three levels, (1) processing engine, (2) memory and (3) accelerator, and present a constructive view from a perspective of parallelism in the three levels.
引用
收藏
页码:645 / 650
页数:6
相关论文
共 50 条
  • [31] ScaMP: Scalable Meta-Parallelism for Deep Learning Search
    Anthony, Quentin
    Xu, Lang
    Shafi, Aamir
    Subramoni, Hari
    Panda, Dhabaleswar K.
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 346 - 348
  • [32] HSP: Hybrid Synchronous Parallelism for Fast Distributed Deep Learning
    Li, Yijun
    Huang, Jiawei
    Li, Zhaoyi
    Zhou, Shengwen
    Jiang, Wanchun
    Wang, Jianxin
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,
  • [33] Towards accelerating model parallelism in distributed deep learning systems
    Choi, Hyeonseong
    Lee, Byung Hyun
    Chun, Se Young
    Lee, Jaehwan
    PLOS ONE, 2023, 18 (11):
  • [34] ScaMP: Scalable Meta-Parallelism for Deep Learning Search
    Anthony, Quentin
    Xu, Lang
    Shafi, Aamir
    Subramoni, Hari
    Panda, Dhabaleswar K.
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID, 2023, : 391 - 402
  • [35] AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
    Li, Zhuohan
    Zheng, Lianmin
    Zhong, Yinmin
    Liu, Vincent
    Sheng, Ying
    Jin, Xin
    Huang, Yanping
    Chen, Zhifeng
    Zhang, Hao
    Gonzalez, Joseph E.
    Stoica, Ion
    PROCEEDINGS OF THE 17TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2023, 2023, : 663 - 679
  • [36] FPGA-Based Accelerators of Deep Learning Networks for Learning and Classification: A Review
    Shawahna, Ahmad
    Sait, Sadiq M.
    El-Maleh, Aiman
    IEEE ACCESS, 2019, 7 : 7823 - 7859
  • [37] Exploiting Memory-Level Parallelism in Reconfigurable Accelerators
    Cheng, Shaoyi
    Lin, Mingjie
    Liu, Hao Jun
    Scott, Simon
    Wawrzynek, John
    2012 IEEE 20TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2012, : 157 - 160
  • [38] Automatic parallelism exploitation for FPL-based accelerators
    Becker, J
    Schmidt, K
    PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL VII: SOFTWARE TECHNOLOGY TRACK, 1998, : 169 - 178
  • [39] Deep Learning Inferencing with High-performance Hardware Accelerators
    Kljucaric, Luke
    George, Alan D.
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (04)
  • [40] Decomposable Architecture and Fault Mitigation Methodology for Deep Learning Accelerators
    Huang, Ning-Chi
    Yang, Min-Syue
    Chang, Ya-Chu
    Wu, Kai-Chiang
    2023 24TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED, 2023, : 298 - 305