Recent advances in efficient computation of deep convolutional neural networks

被引:155
|
作者
Cheng, Jian [1 ,2 ]
Wang, Pei-song [1 ,2 ]
Li, Gang [1 ,2 ]
Hu, Qing-hao [1 ,2 ]
Lu, Han-qing [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Deep neural networks; Acceleration; Compression; Hardware accelerator;
D O I
10.1631/FITEE.1700789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks have evolved remarkably over the past few years and they are currently the fundamental tools of many intelligent systems. At the same time, the computational complexity and resource consumption of these networks continue to increase. This poses a significant challenge to the deployment of such networks, especially in real-time applications or on resource-limited devices. Thus, network acceleration has become a hot topic within the deep learning community. As for hardware implementation of deep neural networks, a batch of accelerators based on a field-programmable gate array (FPGA) or an application-specific integrated circuit (ASIC) have been proposed in recent years. In this paper, we provide a comprehensive survey of recent advances in network acceleration, compression, and accelerator design from both algorithm and hardware points of view. Specifically, we provide a thorough analysis of each of the following topics: network pruning, low-rank approximation, network quantization, teacher-student networks, compact network design, and hardware accelerators. Finally, we introduce and discuss a few possible future directions.
引用
收藏
页码:64 / 77
页数:14
相关论文
共 50 条
  • [1] Recent advances in efficient computation of deep convolutional neural networks
    Jian CHENG
    Pei-song WANG
    Gang LI
    Qing-hao HU
    Han-qing LU
    [J]. Frontiers of Information Technology & Electronic Engineering, 2018, 19 (01) : 64 - 77
  • [2] Recent advances in efficient computation of deep convolutional neural networks
    Jian Cheng
    Pei-song Wang
    Gang Li
    Qing-hao Hu
    Han-qing Lu
    [J]. Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 64 - 77
  • [3] Recent advances in convolutional neural networks
    Gu, Jiuxiang
    Wang, Zhenhua
    Kuen, Jason
    Ma, Lianyang
    Shahroudy, Amir
    Shuai, Bing
    Liu, Ting
    Wang, Xingxing
    Wang, Gang
    Cai, Jianfei
    Chen, Tsuhan
    [J]. PATTERN RECOGNITION, 2018, 77 : 354 - 377
  • [4] Efficient Computation of Robustness of Convolutional Neural Networks
    Arcaini, Paolo
    Bombarda, Andrea
    Bonfanti, Silvia
    Gargantini, Angelo
    [J]. THIRD IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST 2021), 2021, : 21 - 28
  • [5] An Efficient Accelerator for Deep Convolutional Neural Networks
    Kuo, Yi-Xian
    Lai, Yeong-Kang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [6] Advances in Very Deep Convolutional Neural Networks for LVCSR
    Sercu, Tom
    Goel, Vaibhava
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3429 - 3433
  • [7] A survey of the recent architectures of deep convolutional neural networks
    Khan, Asifullah
    Sohail, Anabia
    Zahoora, Umme
    Qureshi, Aqsa Saeed
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (08) : 5455 - 5516
  • [8] A survey of the recent architectures of deep convolutional neural networks
    Asifullah Khan
    Anabia Sohail
    Umme Zahoora
    Aqsa Saeed Qureshi
    [J]. Artificial Intelligence Review, 2020, 53 : 5455 - 5516
  • [9] CEModule: A Computation Efficient Module for Lightweight Convolutional Neural Networks
    Liang, Yu
    Li, Maozhen
    Jiang, Changjun
    Liu, Guanjun
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6069 - 6080
  • [10] Efficient Incremental Training for Deep Convolutional Neural Networks
    Tao, Yudong
    Tu, Yuexuan
    Shyu, Mei-Ling
    [J]. 2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 286 - 291