DNN pruning and mapping on NoC-Based communication infrastructure

被引:15
|
作者
Mirmahaleh, Seyedeh Yasaman Hosseini [1 ]
Rahmani, Amir Masoud [1 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Tehran, Iran
关键词
Deep neural network (DNN); Network on chip (NoC); DNN mapping; Dataflow mapping; Weight and neuron pruning (WNP);
D O I
10.1016/j.mejo.2019.104655
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine learning algorithm-based applications have been deployed for supporting the intemet of things (IoT) and web search engines without losing accuracy in order to satisfy human requests. Developments in deep learning-based applications and complexity of machine learning algorithms increase the depth of artificial neural networks (ANN). Increasing depth of neural network (NN) is challenging regarding the delay, energy consumption, learning, and inference speed up. We train a deep neural network (DNN) gradient descent-based method based on two Booth and Matyas standard generating functions. We also propose a method for pruning weights, neurons, and layers of DNNs based on minimal distance error before and after pruning in a range of safety margin error. This paper employs a new elastic dataflow and DNN mapping on the mesh topology for decreasing delay and energy consumption. Simulation results show reducing the delay and energy consumption of training and inference phases by approximately 22.56%-77% and 65.94%-88.54% compared with not employing a DNN pruning.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Travel Time-Based Task Mapping for NoC-Based DNN Accelerator
    Chen, Yizhi
    Zhu, Wenyao
    Lu, Zhonghai
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2024, PT I, 2025, 15226 : 76 - 92
  • [2] Communication Synchronization-Aware Arbitration Policy in NoC-Based DNN Accelerators
    Fan, Wenjie
    Li, Siyue
    Zhu, Lingxiao
    Lu, Zhonghai
    Li, Li
    Fu, Yuxiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (10) : 4521 - 4525
  • [3] NoC-based DNN Accelerator: A Future Design Paradigm
    Chen, Kun-Chih
    Ebrahimi, Masoumeh
    Wang, Ting-Yi
    Yang, Yuch-Chi
    PROCEEDINGS OF THE 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS'19), 2019,
  • [4] Communication-Aware Application Mapping and Scheduling for NoC-Based MPSoCs
    Yu, Heng
    Ha, Yajun
    Veeravalli, Bharadwaj
    2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 3232 - 3235
  • [5] A Configurable Monitoring Infrastructure for NoC-Based Architectures
    Fiorin, Leandro
    Palermo, Gianluca
    Silvano, Cristina
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (11) : 2436 - 2440
  • [6] Neuron grouping and mapping methods for 2D-mesh NoC-based DNN accelerators
    Nacar, Furkan
    Cakin, Alperen
    Dilek, Selma
    Tosun, Suleyman
    Chakrabarty, Krishnendu
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 193
  • [7] Run-time mapping and communication strategies for homogeneous NoC-Based MPSoCs
    Sassatelli, G.
    Saint-Jean, N.
    Benoit, P.
    Torres, L.
    Robert, M.
    Woszezenki, Cristiane
    Grehs, Ismael Augusto
    Moraes, Fernando
    FCCM 2007: 15TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2007, : 295 - +
  • [8] Differentiated Communication Services for NoC-Based MPSoCs
    Carara, Everton Alceu
    Calazans, Ney Laert Vilar
    Moraes, Fernando Gehm
    IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (03) : 595 - 608
  • [9] Computation and Communication Aware Run-Time Mapping for NoC-based MPSoC Platforms
    Kaushik, Samarth
    Singh, Amit Kumar
    Srikanthan, Thambipillai
    2011 IEEE INTERNATIONAL SOC CONFERENCE (SOCC), 2011, : 185 - 190
  • [10] Mapping Algorithms for NoC-based Heterogeneous MPSoC Platforms
    Singh, Amit Kumar
    Wu Jigang
    Prakash, Alok
    Srikanthan, Thambipillai
    PROCEEDINGS OF THE 2009 12TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, ARCHITECTURES, METHODS AND TOOLS, 2009, : 133 - 140