Hardware-Driven Nonlinear Activation for Stochastic Computing Based Deep Convolutional Neural Networks

被引:0
|
作者
Li, Ji [1 ]
Yuan, Zihao [1 ]
Li, Zhe [2 ]
Ding, Caiwen [2 ]
Ren, Ao [2 ]
Qiu, Qinru [2 ]
Draper, Jeffrey [1 ,3 ]
Wang, Yanzhi [2 ]
机构
[1] Univ Southern Calif, Dept Elect Engn, Los Angeles, CA 90007 USA
[2] Syracuse Univ, Coll Engn & Comp Sci, Syracuse, NY USA
[3] Univ Southern Calif, Informat Sci Inst, Marina Del Rey, CA 90292 USA
关键词
Deep Convolutional Neural Networks; Stochastic Computing; Deep Learning; Activation Function;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, Deep Convolutional Neural Networks (DCNNs) have made unprecedented progress, achieving the accuracy close to, or even better than human-level perception in various tasks. There is a timely need to map the latest software DCNNs to application-specific hardware, in order to achieve orders of magnitude improvement in performance, energy efficiency and compactness. Stochastic Computing (SC), as a low-cost alternative to the conventional binary computing paradigm, has the potential to enable massively parallel and highly scalable hardware implementation of DCNNs. One major challenge in SC based DCNNs is designing accurate nonlinear activation functions, which have a significant impact on the network-level accuracy but cannot be implemented accurately by existing SC computing blocks. In this paper, we design and optimize SC based neurons, and we propose highly accurate activation designs for the three most frequently used activation functions in software DCNNs, i.e, hyperbolic tangent, logistic, and rectified linear units. Experimental results on LeNet-5 using MNIST dataset demonstrate that compared with a binary ASIC hardware DCNN, the DCNN with the proposed SC neurons can achieve up to 61X, 151X, and 2X improvement in terms of area, power, and energy, respectively, at the cost of small precision degradation. In addition, the SC approach achieves up to 21X and 41X of the area, 41X and 72X of the power, and 198200X and 96443X of the energy, compared with CPU and GPU approaches, respectively, while the error is increased by less than 3.07%. ReLU activation is suggested for future SC based DCNNs considering its superior performance under a small bit stream length.
引用
收藏
页码:1230 / 1236
页数:7
相关论文
共 50 条
  • [41] Hardware-Friendly Stochastic and Adaptive Learning in Memristor Convolutional Neural Networks
    Zhang, Wei
    Pan, Lunshuai
    Yan, Xuelong
    Zhao, Guangchao
    Chen, Hong
    Wang, Xingli
    Tay, Beng Kang
    Zhong, Gaokuo
    Li, Jiangyu
    Huang, Mingqiang
    ADVANCED INTELLIGENT SYSTEMS, 2021, 3 (09)
  • [42] Nonlinear Activation Functions for Artificial Neural Networks Realized in Hardware
    Dlugosz, Zofia
    Dlugosz, Rafal
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM (MIXDES 2018), 2018, : 381 - 384
  • [43] A Nonlinear Controller Based on the Convolutional Neural Networks
    Nobahari, Hadi
    Seifouripour, Yousef
    2019 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM 2019), 2019, : 362 - 367
  • [44] Hardware Implementation of Stochastic-Based Neural Networks
    Rossello, Josep L.
    Canals, Vincent
    Morro, Antoni
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [45] An efficient stochastic computing based deep neural network accelerator with optimized activation functions
    Bodiwala S.
    Nanavati N.
    International Journal of Information Technology, 2021, 13 (3) : 1179 - 1192
  • [46] FPGA Implementation of Convolutional Neural Network Based on Stochastic Computing
    Kim, Daewoo
    Moghaddam, Mansureh S.
    Moradian, Hossein
    Sim, Hyeonuk
    Lee, Jongeun
    Choi, Kiyoung
    2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 287 - 290
  • [47] Accurate and compact convolutional neural network based on stochastic computing
    Abdellatef, Hamdan
    Khalil-Hani, Mohamed
    Shaikh-Husin, Nasir
    Ayat, Sayed Omid
    NEUROCOMPUTING, 2022, 471 : 31 - 47
  • [48] Coupled Nonlinear Delay Systems as Deep Convolutional Neural Networks
    Penkovsky, Bogdan
    Porte, Xavier
    Jacquot, Maxime
    Larger, Laurent
    Brunner, Daniel
    PHYSICAL REVIEW LETTERS, 2019, 123 (05)
  • [49] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
    Qin Huabiao
    Cao Qinping
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2599 - 2605
  • [50] Hardware Implementation of an Improved Stochastic Computing Based Deep Neural Network Using Short Sequence Length
    Xiong, Haifei
    Abu Bakar, Muhammad
    He, Guanghui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (11) : 2667 - 2671