ComPreEND: Computation Pruning through Predictive Early Negative Detection for ReLU in a Deep Neural Network Accelerator

被引:0
|
作者
Kim, Namhyung [1 ]
Park, Hanmin [1 ]
Lee, Dongwoo [1 ]
Kang, Sungbum [1 ]
Lee, Jinho [2 ]
Choi, Kiyoung [1 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 08826, South Korea
[2] Yonsei Univ, Dept Comp Sci, Seoul 03722, South Korea
基金
新加坡国家研究基金会;
关键词
Neural networks; Encoding; Hardware; Market research; Adders; Three-dimensional displays; Energy consumption; Early negative detection; prediction; computation pruning; deep neural network; accelerator;
D O I
10.1109/TC.2021.3092205
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A vast amount of activation values of DNNs are zeros due to ReLU (Rectified Linear Unit), which is one of the most common activation functions used in modern neural networks. Since ReLU outputs zero for all negative inputs, the inputs to ReLU do not need to be determined exactly as long as they are negative. However, many accelerators usually do not consider such aspects of DNNs, losing a huge amount of opportunities for speedups and energy savings. To exploit such opportunities, we propose early negative detection (END), a computation pruning technique that detects the negative results at an early stage. The key to the early negative detection is the adoption of inverted two's complement representation for filter parameters. This ensures that as soon as the intermediate results become negative, the final results are guaranteed to be negative. Upon detection, the remaining computation can be skipped and the following ReLU output can be simply set to zero. We also propose a DNN accelerator architecture (ComPreEND) that takes advantage of such skipping. ComPreEND with END significantly improves both the energy efficiency and the performance according to the evaluation. Compared to the baseline, we obtain 20.5 and 29.3 percent speedup with accurate mode and predictive mode, and energy savings by 28.4 and 41.4 percent, respectively.
引用
收藏
页码:1537 / 1550
页数:14
相关论文
共 50 条
  • [1] ComPEND: Computation Pruning through Early Negative Detection for ReLU in a Deep Neural Network Accelerator
    Lee, Dongwoo
    Kang, Sungbum
    Choi, Kiyoung
    [J]. INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 139 - 148
  • [2] A Novel Architecture for Early Detection of Negative Output Features in Deep Neural Network Accelerators
    Asadikouhanjani, Mohammadreza
    Ko, Seok-Bum
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (12) : 3332 - 3336
  • [3] Deep neural network compression through interpretability-based filter pruning
    Yao, Kaixuan
    Cao, Feilong
    Leung, Yee
    Liang, Jiye
    [J]. PATTERN RECOGNITION, 2021, 119
  • [4] Absorption Pruning of Deep Neural Network for Object Detection in Remote Sensing Imagery
    Wang, Jielei
    Cui, Zongyong
    Zang, Zhipeng
    Meng, Xiangjie
    Cao, Zongjie
    [J]. REMOTE SENSING, 2022, 14 (24)
  • [5] SnaPEA: Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks
    Akhlaghi, Vahideh
    Yazdanbakhsh, Amir
    Samadi, Kambiz
    Gupta, Rajesh K.
    Esmaeilzadeh, Hadi
    [J]. 2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, : 662 - 673
  • [6] Efficient computation of wireless sensor network lifetime through deep neural networks
    Muhammed Yilmaz
    Ahmet Murat Ozbayoglu
    Bulent Tavli
    [J]. Wireless Networks, 2021, 27 : 2055 - 2065
  • [7] Efficient computation of wireless sensor network lifetime through deep neural networks
    Yilmaz, Muhammed
    Ozbayoglu, Ahmet Murat
    Tavli, Bulent
    [J]. WIRELESS NETWORKS, 2021, 27 (03) : 2055 - 2065
  • [8] A Deep Convolutional Neural Network for the Early Detection of Heart Disease
    Arooj, Sadia
    Rehman, Saif Ur
    Imran, Azhar
    Almuhaimeed, Abdullah
    Alzahrani, A. Khuzaim
    Alzahrani, Abdulkareem
    [J]. BIOMEDICINES, 2022, 10 (11)
  • [9] Advancing energy efficiency of spiking neural network accelerator via dynamic predictive early stopping
    Miao, Yijie
    Ikeda, Makoto
    [J]. IEICE ELECTRONICS EXPRESS, 2024,
  • [10] P-DNN: An Effective Intrusion Detection Method based on Pruning Deep Neural Network
    Lei, Mingjian
    Li, Xiaoyong
    Cai, Binsi
    Li, Yunfeng
    Liu, Limengwei
    Kong, Wenping
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,