Analysis of on-chip communication properties in accelerator architectures for Deep Neural Networks

被引:7
|
作者
Krichene, Hana [1 ]
Philippe, Jean-Marc [1 ]
机构
[1] Univ Paris Saclay, CEA, List, F-91120 Palaiseau, France
关键词
Network-on-Chip; Deep Neural Networks; Artificial Intelligence; CNN accelerators;
D O I
10.1145/3479876.3481588
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNNs) algorithms are expected to be core components of next-generation applications. These high performance sensing and recognition algorithms are key enabling technologies of smarter systems that make appropriate decisions about their environment. The integration of these compute-intensive and memory-hungry algorithms into embedded systems will require the use of specific energy-efficient hardware accelerators. The intrinsic parallelism of DNNs algorithms allows for the use of a large number of small processing elements, and the tight exploitation of data reuse can significantly reduce power consumption. To meet these features, many dataflow models and on-chip communication proposals have been studied in recent years. This paper proposes a comprehensive study of on-chip communication properties based on the analysis of application-specific features, such as data reuse and communication models, as well as the results of mapping these applications to architectures of different sizes. In addition, the influence of mechanisms such as broadcast and multicast on performance and energy efficiency is analyzed. This study leads to the definition of overarching features to be integrated into next-generation on-chip communication infrastructures for CNN accelerators.
引用
收藏
页码:9 / 14
页数:6
相关论文
共 50 条
  • [21] Memristive Quantized Neural Networks: A Novel Approach to Accelerate Deep Learning On-Chip
    Zhang, Yang
    Cui, Menglin
    Shen, Linlin
    Zeng, Zhigang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (04) : 1875 - 1887
  • [22] Minimizing virtual channel buffer for routers in on-chip communication architectures
    Al Faruque, Mohammad Abdullah
    Henkel, Joerg
    [J]. 2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, : 1080 - 1085
  • [23] A hierarchical modeling framework for on-chip communication architectures of multiprocessing SoCs
    Zhu, Xinping
    Malik, Sharad
    [J]. ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2007, 12 (01)
  • [24] Fast exploration of bus-based on-chip communication architectures
    Pasricha, S
    Dutt, N
    Ben-Romdhane, M
    [J]. INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE CODESIGN AND SYSTEM SYNTHESIS, 2004, : 242 - 247
  • [25] Cooperative communication based barrier synchronization in on-chip mesh architectures
    Chen, Xiaowen
    Lu, Zhonghai
    Jantsch, Axel
    Chen, Shuming
    Liu, Hai
    [J]. IEICE ELECTRONICS EXPRESS, 2011, 8 (22): : 1856 - 1862
  • [26] Real-time communication analysis for on-chip networks with wormhole switching
    Shi, Zheng
    Burns, Alan
    [J]. NOCS 2008: SECOND IEEE INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP, PROCEEDINGS, 2007, : 161 - 170
  • [27] On-chip communication for neuro-glia networks
    Martin, George
    Harkin, Jim
    McDaid, Liam J.
    Wade, John J.
    Liu, Junxiu
    [J]. IET COMPUTERS AND DIGITAL TECHNIQUES, 2018, 12 (04): : 130 - 138
  • [28] Stable architectures for deep neural networks
    Haber, Eldad
    Ruthotto, Lars
    [J]. INVERSE PROBLEMS, 2018, 34 (01)
  • [29] On-Chip Communication Network for Efficient Training of Deep Convolutional Networks on Heterogeneous Manycore Systems
    Choi, Wonje
    Duraisamy, Karthi
    Kim, Ryan Gary
    Doppa, Janardhan Rao
    Pande, Partha Pratim
    Marculescu, Diana
    Marculescu, Radu
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (05) : 672 - 686
  • [30] Scalar operand networks: On-chip interconnect for ILP in partitioned architectures
    Taylor, MB
    Lee, W
    Amarasinghe, S
    Agarwal, A
    [J]. NINTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2003, : 341 - 353