F-E3D: FPGA-based Acceleration of an Efficient 3D Convolutional Neural Network for Human Action Recognition

被引:34
|
作者
Fan, Hongxiang [1 ]
Luo, Cheng [2 ]
Zeng, Chenglong [3 ]
Ferianc, Martin [1 ]
Que, Zhiqiang [1 ]
Liu, Shuanglong [1 ]
Niu, Xinyu [4 ]
Luk, Wayne [1 ]
机构
[1] Imperial Coll London, Sch Engn, Dept Comp, London, England
[2] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China
[3] Tianjin Univ, Sch Microelect, Tianjin, Peoples R China
[4] Corerain Technol Ltd, Shenzhen, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ASAP.2019.00-44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional convolutional neural networks (3D CNNs) have demonstrated their outstanding classification accuracy for human action recognition (HAR). However, the large number of computations and parameters in 3D CNNs limits their deployability in real-life applications. To address this challenge, this paper adopts an algorithm-hardware co-design method by proposing an efficient 3D CNN building unit called 3D-1 bottleneck residual block (3D-1 BRB) at the algorithm level, and a corresponding FPGA-based hardware architecture called F-E3D at hardware level. Based on 3D-1 BRB, a novel 3D CNN model called E3DNet is developed, which achieves nearly 37 times reduction in model size and 5% improvement in accuracy compared to standard 3D CNNs on the UCF101 dataset. Together with several hardware optimizations, including 3D fused BRB, online blocking and kernel reuse, the proposed F-E3D is nearly 13 times faster than a previous FPGA design for 3D CNNs, with performance and accuracy comparable to other state-of-the-art 3D CNN models on GPU platforms while requiring only 7% of their energy consumption.
引用
下载
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [1] Human Action Recognition with 3D Convolutional Neural Network
    Lima, Tiago
    Fernandes, Bruno
    Barros, Pablo
    2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,
  • [2] 3D Convolutional Neural Network for Action Recognition
    Zhang, Junhui
    Chen, Li
    Tian, Jing
    COMPUTER VISION, PT I, 2017, 771 : 600 - 607
  • [3] F-C3D: FPGA-based 3-Dimensional Convolutional Neural Network
    Fan, Hongxiang
    Niu, Xinyu
    Liu, Qiang
    Luk, Wayne
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [4] 3D Convolutional Neural Networks for Human Action Recognition
    Ji, Shuiwang
    Xu, Wei
    Yang, Ming
    Yu, Kai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) : 221 - 231
  • [5] Action Recognition by 3D Convolutional Network
    Brezovsky, Matus
    Sopiak, Dominik
    Oravec, Milos
    PROCEEDINGS OF ELMAR-2018: 60TH INTERNATIONAL SYMPOSIUM ELMAR-2018, 2018, : 71 - 74
  • [6] Skeleton-Based Square Grid for Human Action Recognition With 3D Convolutional Neural Network
    Ding, Wenwen
    Ding, Chongyang
    Li, Guang
    Liu, Kai
    IEEE ACCESS, 2021, 9 : 54078 - 54089
  • [7] An improved memristor-based 3D Convolutional Neural Network for action recognition
    Wang, Yining
    Li, Ke
    Shen, Siyuan
    Duan, Shukai
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707
  • [8] An efficient 3D convolutional neural network with informative 3D volumes for human activity recognition using wearable sensors‏
    Saeedeh Zebhi
    Multimedia Tools and Applications, 2024, 83 : 42233 - 42256
  • [9] An efficient 3D convolutional neural network with informative 3D volumes for human activity recognition using wearable sensors
    Zebhi, Saeedeh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 42233 - 42256
  • [10] An efficient attention module for 3d convolutional neural networks in action recognition
    Jiang, Guanghao
    Jiang, Xiaoyan
    Fang, Zhijun
    Chen, Shanshan
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7043 - 7057