An energy-efficient deep convolutional neural networks coprocessor for multi-object detection

被引:7
|
作者
Wu, Yuancong [1 ]
Wang, J. J. [1 ]
Qian, Kun [1 ]
Liu, Yanchen [1 ]
Guo, Rui [1 ]
Hu, S. G. [1 ]
Yu, Q. [1 ]
Chen, T. P. [2 ]
Liu, Y. [1 ]
Rong, Limei [1 ]
机构
[1] Univ Elect Sci & Technol China, State Key Lab Elect Thin Films & Integrated Devic, Chengdu 610054, Peoples R China
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Energy-efficient; Deep convolution neural networks; Coprocessor; Deep learning; Multi-object detection; Hardware accelerators; Mobile devices;
D O I
10.1016/j.mejo.2020.104737
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposed an energy-efficient deep convolution neural networks coprocessor (DCNNs-CP) architecture for multi-object detection applications based on deep learning algorithms. The DCNNs-CP can support both convolutional layers and fully connected layers to accelerate various mobile deep learning algorithms. It also supports maximum and mean pooling operations through a separate pooling module structure. Besides, a reconfigurable activation function module supporting four nonlinear functions is also realized in this coprocessor. The DCNNs-CP chip was implemented in 55 nm CMOS process technology and occupied the 4 mm(2) die area. The DCNNs-CP supports 8-bit and 16-bit fixed-point data precision and achieves a peak performance of 3.4 Tops/W at 1.2 V supply voltage and a maximum frequency of 500 MHz, represent 2.13x improvements over reported hardware accelerators. Besides, the chip achieves 0.85 Tops/W . mm(2) energy efficiency per area and 34.0 Tops/W . MB energy efficiency per memory (on-chip memory), making it suitable to be integrated with the mobile devices.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition
    Cao, Yongqiang
    Chen, Yang
    Khosla, Deepak
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) : 54 - 66
  • [2] Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition
    Yongqiang Cao
    Yang Chen
    Deepak Khosla
    [J]. International Journal of Computer Vision, 2015, 113 : 54 - 66
  • [3] An Efficient Implementation of Deep Convolutional Neural Networks on a Mobile Coprocessor
    Jin, Jonghoon
    Gokhale, Vinayak
    Dundar, Aysegul
    Krishnamurthy, Bharadwaj
    Martini, Berin
    Culurciello, Eugenio
    [J]. 2014 IEEE 57TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2014, : 133 - 136
  • [4] ONLINE MULTI-OBJECT TRACKING WITH CONVOLUTIONAL NEURAL NETWORKS
    Chen, Long
    Ai, Haizhou
    Shang, Chong
    Zhuang, Zijie
    Bai, Bo
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 645 - 649
  • [5] Multi-Object Tracking with Quadruplet Convolutional Neural Networks
    Son, Jeany
    Baek, Mooyeol
    Cho, Minsu
    Han, Bohyung
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3786 - 3795
  • [6] SCTN: Event-based object tracking with energy-efficient deep convolutional spiking neural networks
    Ji, Mingcheng
    Wang, Ziling
    Yan, Rui
    Liu, Qingjie
    Xu, Shu
    Tang, Huajin
    [J]. FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [7] MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection
    Ma, Wenchi
    Wu, Yuanwei
    Wang, Zongbo
    Wang, Guanghui
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2510 - 2515
  • [8] A Pipelined Energy-efficient Hardware Accelaration for Deep Convolutional Neural Networks
    Alaeddine, Hmidi
    Jihene, Malek
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON DESIGN & TEST OF INTEGRATED MICRO & NANO-SYSTEMS (DTS), 2019,
  • [9] Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks
    Chen, Yu-Hsin
    Krishna, Tushar
    Emer, Joel S.
    Sze, Vivienne
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (01) : 127 - 138
  • [10] Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks
    Chen, Yu-Hsin
    Krishna, Tushar
    Emer, Joel
    Sze, Vivienne
    [J]. 2016 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE (ISSCC), 2016, 59 : 262 - U363