DC3D: A Video Action Recognition Network Based on Dense Connection

被引:1
|
作者
Mu, Xiaofang [1 ]
Liu, Zhenyu [1 ]
Liu, Jiaji [1 ]
Li, Hao [1 ]
Li, Yue [2 ]
Li, Yikun [3 ]
机构
[1] Taiyuan Normal Univ, Coll Comp Sci & Technol, Taiyuan, Peoples R China
[2] Tianjin Univ Technol, Sch Elect & Informat Engn, Tianjin, Peoples R China
[3] Massey Univ, Dept Informat Sci, Palmerston North, New Zealand
来源
2022 TENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, CBD | 2022年
关键词
Action recognition; 3D Convolutions; DenseNet; Fisher discriminant criterion;
D O I
10.1109/CBD58033.2022.00032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficiently extracting the temporal and spatial information of motion in the video, and how to obtain the spatiotemporal features with high degree of differentiation, is the key issue to improve the accuracy of action recognition classification. In this paper, a dense 3D convolutional block is designed as the basic unit to construct a dense convolutional 3D network, the spatiotemporal features existing in the video are extracted at the same time, and the transmission and reuse of the features in the network are strengthened, effectively fuse the shallow and deep spatiotemporal features of the network. At the same time, in order to make the features extracted by the network sufficiently discriminative, this paper proposes a joint loss function based on the Fisher discriminant regularization term, it can make the trained network have the ability to increase the inter-class dispersion and reduce the intra-class dispersion of the classified samples, and improve the classification accuracy. Experiments on the UCF-101 human actions classes dataset show that the network recognition accuracy rate proposed in this paper reaches 92.4%, which is higher than 85.2% of the C3D network, which proves the effectiveness of the method proposed in this paper.
引用
收藏
页码:133 / 138
页数:6
相关论文
共 50 条
  • [41] Improving Action Units Recognition Using Dense Flow-based Face Registration in Video
    Yang, Songfan
    An, Le
    Bhanu, Bir
    Thakoor, Ninad
    2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
  • [42] 3D Convolutional Neural Network for Action Recognition
    Zhang, Junhui
    Chen, Li
    Tian, Jing
    COMPUTER VISION, PT I, 2017, 771 : 600 - 607
  • [43] Action recognition with motion map 3D network
    Sun, Yuchao
    Wu, Xinxiao
    Yu, Wennan
    Yu, Feiwu
    NEUROCOMPUTING, 2018, 297 : 33 - 39
  • [44] Workout Action Recognition in Video Streams Using an Attention Driven Residual DC-GRU Network
    Dey, Arnab
    Biswas, Samit
    Le, Dac-Nhuong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (02): : 3067 - 3087
  • [45] 2D Deep Video Capsule Network with Temporal Shift for Action Recognition
    Voillemin, Theo
    Wannous, Hazem
    Vandeborre, Jean-Philippe
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3513 - 3519
  • [46] Two-Level Attention Model Based Video Action Recognition Network
    Sang, Haifeng
    Zhao, Ziyu
    He, Dakuo
    IEEE ACCESS, 2019, 7 : 118388 - 118401
  • [47] A Knowledge-Based Hierarchical Causal Inference Network for Video Action Recognition
    Liu, Yang
    Liu, Fang
    Jiao, Licheng
    Bao, Qianyue
    Li, Lingling
    Guo, Yuwei
    Chen, Puhua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9135 - 9149
  • [48] Video-based action recognition using spurious-3D residual attention networks
    Chen, Bo
    Tang, Hongying
    Zhang, Zebin
    Tong, Guanjun
    Li, Baoqing
    IET IMAGE PROCESSING, 2022, 16 (11) : 3097 - 3111
  • [49] PA3D: Pose-Action 3D Machine for Video Recognition
    Yan, An
    Wang, Yali
    Li, Zhifeng
    Qiao, Yu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7914 - 7923
  • [50] Residual attention fusion network for video action recognition
    Li, Ao
    Yi, Yang
    Liang, Daan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98