F-E3D: FPGA-based Acceleration of an Efficient 3D Convolutional Neural Network for Human Action Recognition

被引:34
|
作者
Fan, Hongxiang [1 ]
Luo, Cheng [2 ]
Zeng, Chenglong [3 ]
Ferianc, Martin [1 ]
Que, Zhiqiang [1 ]
Liu, Shuanglong [1 ]
Niu, Xinyu [4 ]
Luk, Wayne [1 ]
机构
[1] Imperial Coll London, Sch Engn, Dept Comp, London, England
[2] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China
[3] Tianjin Univ, Sch Microelect, Tianjin, Peoples R China
[4] Corerain Technol Ltd, Shenzhen, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ASAP.2019.00-44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional convolutional neural networks (3D CNNs) have demonstrated their outstanding classification accuracy for human action recognition (HAR). However, the large number of computations and parameters in 3D CNNs limits their deployability in real-life applications. To address this challenge, this paper adopts an algorithm-hardware co-design method by proposing an efficient 3D CNN building unit called 3D-1 bottleneck residual block (3D-1 BRB) at the algorithm level, and a corresponding FPGA-based hardware architecture called F-E3D at hardware level. Based on 3D-1 BRB, a novel 3D CNN model called E3DNet is developed, which achieves nearly 37 times reduction in model size and 5% improvement in accuracy compared to standard 3D CNNs on the UCF101 dataset. Together with several hardware optimizations, including 3D fused BRB, online blocking and kernel reuse, the proposed F-E3D is nearly 13 times faster than a previous FPGA design for 3D CNNs, with performance and accuracy comparable to other state-of-the-art 3D CNN models on GPU platforms while requiring only 7% of their energy consumption.
引用
下载
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [11] An Improved Two-stream 3D Convolutional Neural Network for Human Action Recognition
    Chen, Jun
    Xu, Yuanping
    Zhang, Chaolong
    Xu, Zhijie
    Meng, Xiangxiang
    Wang, Jie
    2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 135 - 140
  • [12] Improving human action recognition with two-stream 3D convolutional neural network
    Van-Minh Khong
    Thanh-Hai Tran
    2018 1ST INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2018,
  • [13] An efficient attention module for 3d convolutional neural networks in action recognition
    Guanghao Jiang
    Xiaoyan Jiang
    Zhijun Fang
    Shanshan Chen
    Applied Intelligence, 2021, 51 : 7043 - 7057
  • [14] Temporal Residual Feature Learning for Efficient 3D Convolutional Neural Network on Action Recognition Task
    Wang, Haonan
    Mei, Yuchen
    Lin, Jun
    Wang, Zhongfeng
    2020 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2020, : 123 - 128
  • [15] Automatic 3D Pollen Recognition Based on Convolutional Neural Network
    Wang, Zhuo
    Wang, Zixuan
    Wang, Likai
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [16] 3D Convolutional Neural Network based on memristor for video recognition
    Liu, Jiaqi
    Li, Zhenghao
    Tang, Yongliang
    Hu, Wei
    Wu, Jun
    PATTERN RECOGNITION LETTERS, 2020, 130 (130) : 116 - 124
  • [17] Asymmetric 3D Convolutional Neural Networks for action recognition
    Yang, Hao
    Yuan, Chunfeng
    Li, Bing
    Du, Yang
    Xing, Junliang
    Hu, Weiming
    Maybank, Stephen J.
    PATTERN RECOGNITION, 2019, 85 : 1 - 12
  • [18] END-TO-END LEARNING OF DEEP CONVOLUTIONAL NEURAL NETWORK FOR 3D HUMAN ACTION RECOGNITION
    Li, Chao
    Sun, Shouqian
    Min, Xin
    Lin, Wenqian
    Nie, Binling
    Zhang, Xianfu
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [19] Action Recognition Based on Features Fusion and 3D Convolutional Neural Networks
    Liu, Lulu
    Hu, Fangyu
    Zhou, Jiahui
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2016, : 178 - 181
  • [20] 3D skeleton-based action recognition with convolutional neural networks
    Van-Nam Hoang
    Thi-Lan Le
    Thanh-Hai Tran
    Hai-Vu
    Van-Toi Nguyen
    2019 INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2019,