Analysis of on-chip communication properties in accelerator architectures for Deep Neural Networks

被引：7

作者：

Krichene, Hana ^{[1
]}

Philippe, Jean-Marc ^{[1
]}

机构：

[1] Univ Paris Saclay, CEA, List, F-91120 Palaiseau, France

来源：

2021 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS 2021) | 2021年

关键词：

Network-on-Chip; Deep Neural Networks; Artificial Intelligence; CNN accelerators;

D O I：

10.1145/3479876.3481588

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep neural networks (DNNs) algorithms are expected to be core components of next-generation applications. These high performance sensing and recognition algorithms are key enabling technologies of smarter systems that make appropriate decisions about their environment. The integration of these compute-intensive and memory-hungry algorithms into embedded systems will require the use of specific energy-efficient hardware accelerators. The intrinsic parallelism of DNNs algorithms allows for the use of a large number of small processing elements, and the tight exploitation of data reuse can significantly reduce power consumption. To meet these features, many dataflow models and on-chip communication proposals have been studied in recent years. This paper proposes a comprehensive study of on-chip communication properties based on the analysis of application-specific features, such as data reuse and communication models, as well as the results of mapping these applications to architectures of different sizes. In addition, the influence of mechanisms such as broadcast and multicast on performance and energy efficiency is analyzed. This study leads to the definition of overarching features to be integrated into next-generation on-chip communication infrastructures for CNN accelerators.

引用

页码：9 / 14

页数：6

共 50 条

[21] Memristive Quantized Neural Networks: A Novel Approach to Accelerate Deep Learning On-Chip
Zhang, Yang
Cui, Menglin
Shen, Linlin
Zeng, Zhigang
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (04) : 1875 - 1887
[22] Minimizing virtual channel buffer for routers in on-chip communication architectures
Al Faruque, Mohammad Abdullah
Henkel, Joerg
[J]. 2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, : 1080 - 1085
[23] A hierarchical modeling framework for on-chip communication architectures of multiprocessing SoCs
Zhu, Xinping
Malik, Sharad
[J]. ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2007, 12 (01)
[24] Fast exploration of bus-based on-chip communication architectures
Pasricha, S
Dutt, N
Ben-Romdhane, M
[J]. INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE CODESIGN AND SYSTEM SYNTHESIS, 2004, : 242 - 247
[25] Cooperative communication based barrier synchronization in on-chip mesh architectures
Chen, Xiaowen
Lu, Zhonghai
Jantsch, Axel
Chen, Shuming
Liu, Hai
[J]. IEICE ELECTRONICS EXPRESS, 2011, 8 (22): : 1856 - 1862
[26] Real-time communication analysis for on-chip networks with wormhole switching
Shi, Zheng
Burns, Alan
[J]. NOCS 2008: SECOND IEEE INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP, PROCEEDINGS, 2007, : 161 - 170
[27] On-chip communication for neuro-glia networks
Martin, George
Harkin, Jim
McDaid, Liam J.
Wade, John J.
Liu, Junxiu
[J]. IET COMPUTERS AND DIGITAL TECHNIQUES, 2018, 12 (04): : 130 - 138
[28] Stable architectures for deep neural networks
Haber, Eldad
Ruthotto, Lars
[J]. INVERSE PROBLEMS, 2018, 34 (01)
[29] On-Chip Communication Network for Efficient Training of Deep Convolutional Networks on Heterogeneous Manycore Systems
Choi, Wonje
Duraisamy, Karthi
Kim, Ryan Gary
Doppa, Janardhan Rao
Pande, Partha Pratim
Marculescu, Diana
Marculescu, Radu
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (05) : 672 - 686
[30] Scalar operand networks: On-chip interconnect for ILP in partitioned architectures
Taylor, MB
Lee, W
Amarasinghe, S
Agarwal, A
[J]. NINTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2003, : 341 - 353

← 1 2 3 4 5 →