SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing

被引:41
|
作者
Ren, Ao [1 ]
Li, Zhe [1 ]
Ding, Caiwen [1 ]
Qiu, Qinru [1 ]
Wang, Yanzhi [1 ]
Li, Ji [2 ]
Qian, Xuehai [2 ]
Yuan, Bo [3 ]
机构
[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Syracuse, NY 13244 USA
[2] Univ Southern Calif, Dept Elect Engn, Los Angeles, CA 90089 USA
[3] CUNY City Coll, Dept Elect Engn, New York, NY 10031 USA
关键词
IMPLEMENTATION;
D O I
10.1145/3093336.3037746
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the recent advance of wearable devices and Internet of Things (IoTs), it becomes attractive to implement the Deep Convolutional Neural Networks (DCNNs) in embedded and portable systems. Currently, executing the software-based DCNNs requires high-performance servers, restricting the widespread deployment on embedded and mobile IoT devices. To overcome this obstacle, considerable research efforts have been made to develop highly-parallel and specialized DCNN accelerators using GPGPUs, FPGAs or ASICs. Stochastic Computing (SC), which uses a bit-stream to represent a number within [-1, 1] by counting the number of ones in the bit-stream, has high potential for implementing DCNNs with high scalability and ultra-low hardware footprint. Since multiplications and additions can be calculated using AND gates and multiplexers in SC, significant reductions in power (energy) and hardware footprint can be achieved compared to the conventional binary arithmetic implementations. The tremendous savings in power (energy) and hardware resources allow immense design space for enhancing scalability and robustness for hardware DCNNs. This paper presents SC-DCNN, the first comprehensive design and optimization framework of SC-based DCNNs, using a bottom-up approach. We first present the designs of function blocks that perform the basic operations in DCNN, including inner product, pooling, and activation function. Then we propose four designs of feature extraction blocks, which are in charge of extracting features from input feature maps, by connecting different basic function blocks with joint optimization. Moreover, the efficient weight storage methods are proposed to reduce the area and power (energy) consumption. Putting all together, with feature extraction blocks carefully selected, SC-DCNN is holistically optimized to minimize area and power (energy) consumption while maintaining high network accuracy. Experimental results demonstrate that the LeNet5 implemented in SCDCNN consumes only 17 mm(2) area and 1.53 W power, achieves throughput of 781250 images/s, area efficiency of 45946 images/s/mm(2), and energy efficiency of 510734 images/J.
引用
收藏
页码:405 / 418
页数:14
相关论文
共 50 条
  • [21] Scalable FPGA Accelerator for Deep Convolutional Neural Networks with Stochastic Streaming
    Alawad, Mohammed
    Lin, Mingjie
    IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2018, 4 (04): : 888 - 899
  • [22] Accurate and compact convolutional neural network based on stochastic computing
    Abdellatef, Hamdan
    Khalil-Hani, Mohamed
    Shaikh-Husin, Nasir
    Ayat, Sayed Omid
    NEUROCOMPUTING, 2022, 471 : 31 - 47
  • [23] FPGA Implementation of Convolutional Neural Network Based on Stochastic Computing
    Kim, Daewoo
    Moghaddam, Mansureh S.
    Moradian, Hossein
    Sim, Hyeonuk
    Lee, Jongeun
    Choi, Kiyoung
    2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 287 - 290
  • [24] A novel scalable method for machine degradation assessment using deep convolutional neural network
    Li, Pin
    Jia, Xiaodong
    Feng, Jianshe
    Zhu, Feng
    Miller, Marcella
    Chen, Liang-Yu
    Lee, Jay
    MEASUREMENT, 2020, 151
  • [25] An Optimized Framework of Video Compression Using Deep Convolutional Neural Networks (DCNN)
    Sreelatha, M.
    Tulasi, R. Lakshmi
    Kumar, K. Siva
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 515 - 522
  • [26] Feasibility Study of Fish Disease Detection using Computer Vision and Deep Convolutional Neural Network (DCNN) Algorithm
    Yasruddin, Muhammad Luqman
    Ismail, Muhammad Amir Hakim
    Husin, Zulkifli
    Tan, Wei Keong
    2022 IEEE 18TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & APPLICATIONS (CSPA 2022), 2022, : 272 - 276
  • [27] FPGA-based implementation of deep neural network using stochastic computing
    Nobari, Maedeh
    Jahanirad, Hadi
    APPLIED SOFT COMPUTING, 2023, 137
  • [28] A New Stochastic Computing Multiplier with Application to Deep Convolutional Neural Networks
    Sim, Hyeonuk
    Lee, Jongeun
    PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [29] Scalable flood level trend monitoring with surveillance cameras using a deep convolutional neural network
    de Vitry, Matthew Moy
    Kramer, Simon
    Wegner, Jan Dirk
    Leitao, Joao P.
    HYDROLOGY AND EARTH SYSTEM SCIENCES, 2019, 23 (11) : 4621 - 4634
  • [30] Parallel Convolutional Neural Network (CNN) Accelerators Based on Stochastic Computing
    Zhang, Yawen
    Zhang, Xinyue
    Song, Jiahao
    Wang, Yuan
    Huang, Ru
    Wang, Runsheng
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2019), 2019, : 19 - 24