Efficient Zero-Activation-Skipping for On-Chip Low-Energy CNN Acceleration

被引:2
|
作者
Liu, Min [1 ]
He, Yifan [2 ]
Jiao, Hailong [1 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, VLSI Lab, Sch Elect & Comp Engn, Shenzhen, Peoples R China
[2] Reconova Technol Co Ltd, Xiamen, Peoples R China
关键词
Sparse matrix; convolution; sparse storage; input feature map; speedup; energy savings; energy efficiency; PROCESSOR;
D O I
10.1109/AICAS51828.2021.9458578
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new convolution paradigm is proposed for convolutional neural networks (CNNs) in this paper, which can efficiently skip the storage and computation of zeros in the input feature maps, by exploring the position information of the activations. Furthermore, by reusing the activations as much as possible, load balance is achieved among different processing elements without hardware cost. With the proposed sparse convolution technique, the calculation speed is enhanced by 7.29x for convolutional layers with a sparsity of 90% and by 2.59x for running VGG16 with ImageNet2012 dataset, compared to the traditional convolution method. Implemented in a UMC 55-nm low power CMOS technology, a CNN accelerator with the proposed technique achieves an effective energy efficiency of 1.94 TOPS/W while running at 100 MHz and 1.08 V supply voltage.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] Delay and energy efficient design of on-chip encoded bus with repeaters
    Zhang, Qingh
    Wang, Jinxiang
    Ye, Yizheng
    21ST INTERNATIONAL CONFERENCE ON VLSI DESIGN: HELD JOINTLY WITH THE 7TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS, PROCEEDINGS, 2008, : 377 - 382
  • [32] Runtime Multi-Optimizations for Energy Efficient On-chip Interconnections
    He, Yuan
    Kondo, Masaaki
    Nakada, Takashi
    Sasaki, Hiroshi
    Miwa, Shinobu
    Nakamura, Hiroshi
    2015 33RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2015, : 455 - 458
  • [33] SNN Model for Highly Energy and Area Efficient On-Chip Classification
    Biswas, Anmol
    Shukla, Aditya
    Prasad, Sidharth
    Ganguly, Udayan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 763 - 764
  • [34] Energy Efficient On-Chip Communications Implementation Based on Power Slacks
    XiaoYu Xia
    WenMing Pan
    JiaChong Kan
    JournalofElectronicScienceandTechnology, 2014, 12 (04) : 354 - 360
  • [35] Flexible On-Chip Power Delivery for Energy Efficient Heterogeneous Systems
    Calhoun, Benton H.
    Craig, Kyle
    2013 50TH ACM / EDAC / IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2013,
  • [36] Energy efficient adaptive clustering of on-chip power delivery systems
    Vaisband, Inna
    Friedman, Eby G.
    INTEGRATION-THE VLSI JOURNAL, 2015, 48 : 1 - 9
  • [37] Energy Efficient On-Chip Communications Implementation Based on Power Slacks
    Xiao-Yu Xia
    Wen-Ming Pan
    Jia-Chong Kan
    Journal of Electronic Science and Technology, 2014, (04) : 354 - 360
  • [38] Heterogeneous Methodology for Energy Efficient Distribution of On-Chip Power Supplies
    Vaisband, Inna
    Friedman, Eby G.
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2013, 28 (09) : 4267 - 4280
  • [39] A low-energy chip-set for wireless intercom
    Ammer, MJ
    Sheets, M
    Karalar, T
    Kuulusa, M
    Rabaey, J
    40TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2003, 2003, : 916 - 919
  • [40] Zero- and low-energy housing for the Mediterranean climate
    Ferrante, Annarita
    ADVANCES IN BUILDING ENERGY RESEARCH, 2012, 6 (01) : 81 - 118