Fusion of 2D CNN and 3D DenseNet for Dynamic Gesture Recognition

被引:28
|
作者
Zhang, Erhu [1 ]
Xue, Botao [1 ]
Cao, Fangzhou [1 ]
Duan, Jinghong [2 ]
Lin, Guangfeng [1 ]
Lei, Yifei [3 ]
机构
[1] Xian Univ Technol, Dept Informat Sci, Xian 710048, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Changan Univ, Sch Elect & Control Engn, Xian 710064, Peoples R China
基金
中国国家自然科学基金;
关键词
gesture recognition; motion representation; 2D CNN; 3D DenseNet; information fusion; FLOW;
D O I
10.3390/electronics8121511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gesture recognition has been applied in many fields as it is a natural human-computer communication method. However, recognition of dynamic gesture is still a challenging topic because of complex disturbance information and motion information. In this paper, we propose an effective dynamic gesture recognition method by fusing the prediction results of a two-dimensional (2D) motion representation convolution neural network (CNN) model and three-dimensional (3D) dense convolutional network (DenseNet) model. Firstly, to obtain a compact and discriminative gesture motion representation, the motion history image (MHI) and pseudo-coloring technique were employed to integrate the spatiotemporal motion sequences into a frame image, before being fed into a 2D CNN model for gesture classification. Next, the proposed 3D DenseNet model was used to extract spatiotemporal features directly from Red, Green, Blue (RGB) gesture videos. Finally, the prediction results of the proposed 2D and 3D deep models were blended together to boost recognition performance. The experimental results on two public datasets demonstrate the effectiveness of our proposed method.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Hand Gesture Recognition in Multi-space of 2D/3D
    Lee, Hansaem
    Park, Junseok
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (06): : 12 - 16
  • [2] DYNAMIC HAND GESTURE RECOGNITION USING A CNN MODEL WITH 3D RECEPTIVE FIELDS
    Kim, Ho-Joon
    Lee, Joseph S.
    Park, Jin-Hui
    [J]. 2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 14 - 19
  • [3] Face recognition based on 2D and 3D data fusion
    Krotewicz, Pawel
    Sankowski, Wojciech
    Nowak, Piotr Stefan
    [J]. INTERNATIONAL JOURNAL OF BIOMETRICS, 2015, 7 (01) : 69 - 81
  • [4] Dynamic gesture recognition based on 2D convolutional neural network and feature fusion
    Yu, Jimin
    Qin, Maowei
    Zhou, Shangbo
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Dynamic gesture recognition based on 2D convolutional neural network and feature fusion
    Jimin Yu
    Maowei Qin
    Shangbo Zhou
    [J]. Scientific Reports, 12
  • [6] Dynamic Gesture Recognition using 3D Trajectory
    Wang, Qianqian
    Xu, Yuan-Rong
    Bai, Xiao
    Xu, Dan
    Chen, Yen-Lun
    Wu, Xinyu
    [J]. 2014 4TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2014, : 598 - 601
  • [7] Combined 2D and 3D Convolution Residual Attention Network for Hand Gesture Recognition
    Tsai, Chang-Ting
    Ding, Jian-Jiun
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 104 - 108
  • [8] AirMouse: Finger Gesture for 2D and 3D Interaction
    Ortega, Michael
    Nigay, Laurence
    [J]. HUMAN-COMPUTER INTERACTION - INTERACT 2009, PT II, PROCEEDINGS, 2009, 5727 : 214 - 227
  • [9] Spatiotemporal Action Detection Using 2D CNN and 3D CNN
    Liu, Hengshuai
    Li, Jianjun
    Tang, Yuhong
    Zhang, Ningfei
    Zhang, Ming
    Wang, Yaping
    Li, Guang
    [J]. Computers and Electrical Engineering, 2024, 120
  • [10] Hybrid Deep Feature Fusion of 2D CNN and 3D CNN for Vestibule Segmentation from CT Images
    Zhang, Ruicong
    Zhuo, Li
    Chen, Meijuan
    Yin, Hongxia
    Li, Xiaoguang
    Wang, Zhenchang
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022