DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture

被引:18
|
作者
Liu, Liu [1 ]
Qu, Zheng [1 ]
Deng, Lei [1 ]
Tu, Fengbin [1 ]
Li, Shuangchen [1 ]
Hu, Xing [1 ]
Gu, Zhenyu [2 ]
Ding, Yufei [1 ]
Xie, Yuan [1 ]
机构
[1] UC Santa Barbara, Santa Barbara, CA 93106 USA
[2] Alibaba DAMO Acad, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Neural networks; accelerator architecture;
D O I
10.1109/MICRO50266.2020.00066
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNNs) have been driving the mainstream of Machine Learning applications. However, deploying DNNs on modern hardware with stringent latency requirements and energy constraints is challenging because of the compute-intensive and memory-intensive execution patterns of various DNN models. We propose an algorithm-architecture co-design to boost DNN execution efficiency. Leveraging the noise resilience of nonlinear activation functions in DNNs, we propose dual-module processing that uses approximate modules learned from original DNN layers to compute insensitive activations. Therefore, we can save expensive computations and data accesses of unnecessary sensitive activations. We then design an Executor-Speculator dual-module architecture with support for balance execution and memory access reduction. With acceptable model inference quality degradation, our accelerator design can achieve 2.24x speedup and 1.97x energy efficiency improvement for compute-bound Convolutional Neural Networks (CNNs) and memory-bound Recurrent Neural Networks (RNNs).
引用
收藏
页码:738 / 750
页数:13
相关论文
共 50 条
  • [1] Fault diagnosis of industrial robot based on dual-module attention convolutional neural network
    Lu K.
    Chen C.
    Wang T.
    Cheng L.
    Qin J.
    [J]. Autonomous Intelligent Systems, 2 (1):
  • [2] A Deep Convolutional Neural Network Architecture for Boosting Image Discrimination Accuracy of Rice Species
    Lin, P.
    Li, X. L.
    Chen, Y. M.
    He, Y.
    [J]. FOOD AND BIOPROCESS TECHNOLOGY, 2018, 11 (04) : 765 - 773
  • [3] A Deep Convolutional Neural Network Architecture for Boosting Image Discrimination Accuracy of Rice Species
    P. Lin
    X. L. Li
    Y. M. Chen
    Y. He
    [J]. Food and Bioprocess Technology, 2018, 11 : 765 - 773
  • [4] A dual-stream deep neural network integrated with adaptive boosting for sleep staging
    Fang, Yongkangjian
    Xia, Yi
    Chen, Peng
    Zhang, Jun
    Zhang, Yongliang
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [5] Deep Learning-Enhanced Dual-Module Large-Throughput Microinjection System for Adherent Cells
    Pan, Fei
    Jiao, Yang
    Chen, Shuxun
    Xing, Liuxi
    Sun, Dong
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (04) : 2409 - 2422
  • [6] A Deep Neural Network with Module Architecture for Model Reduction and its Application to Nonlinear System Identification
    Takano, Seiya
    Kawaguchi, Takahiro
    Asami, Satoshi
    Sasaki, Risako
    Sugimoto, Seiya
    Shinya, Yoshiyuki
    Adachi, Shuichi
    [J]. IFAC PAPERSONLINE, 2023, 56 (02): : 10650 - 10655
  • [7] Dual-module multi-head spatiotemporal joint network with SACGA for wind turbines fault detection
    Wang, Tian
    Yin, Linfei
    [J]. ENERGY, 2024, 308
  • [8] INTELLIGENT STOCK TRADING SYSTEM WITH PRICE TREND PREDICTION AND REVERSAL RECOGNITION USING DUAL-MODULE NEURAL NETWORKS
    JANG, GS
    LAI, FP
    JIANG, BW
    PARNG, TM
    CHIEN, LH
    [J]. APPLIED INTELLIGENCE, 1993, 3 (03) : 225 - 248
  • [9] Performance Analysis of a Dual Stage Deep Rain Streak Removal Convolution Neural Network Module with a Modified Deep Residual Dense Network
    Jayaraman, Thiyagarajan
    Chinnusamy, Gowri Shankar
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2022, 32 (01) : 111 - 123
  • [10] On a Unified Deep Neural Network Decoding Architecture
    Artemasov, Dmitry
    Andreev, Kirill
    Frolov, Alexey
    [J]. 2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,