Large-scale gesture recognition with a fusion of RGB-D data based on optical flow and the C3D model

被引:32
|
作者
Li, Yunan [1 ]
Miao, Qiguang [1 ]
Tian, Kuan [1 ]
Fan, Yingying [1 ]
Xu, Xin [1 ]
Ma, Zhenxin [1 ]
Song, Jianfeng [1 ]
机构
[1] Xidian Univ, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Gesture recognition; RGB-D data; Optical flow; 3D Convolutional Neural Networks;
D O I
10.1016/j.patrec.2017.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition has attracted great attention owing to its applications in many fields such as Human Computer Interaction. However, in video-based gesture recognition, some gesture-irrelevant factors like the background handicap the improvement of recognition rate. In this paper, we propose an effective 3D Convolutional Neural Network based method for large-scale gesture recognition using RGB-D video data. To obtain compact but with sufficient motion path information data for the network, the inputs are unified into 32-frame videos first. Then the optical flow images are constructed from the RGB videos frame by frame, to help with eliminating the disturbing background inside them. After that, the spatiotemporal features of de-background RGB and depth data are extracted with the C3D model (a 3D CNN model) respectively and blended together in the next stage according to the discriminant correlation analysis to boost the performance. Finally the classes are predicted with a linear SVM classifier. Our proposed method achieves 54.50% accuracy on the validation subset and 60.93% on the testing subset of the Chalearn LAP IsoGD dataset, both of which outperform our results (ranked 1st place) in the Chalearn LAP Large-scale Gesture Recognition Challenge. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:187 / 194
页数:8
相关论文
共 50 条
  • [21] Selection of Large-Scale 3D Point Cloud Data Using Gesture Recognition
    Burgess, Robin
    Falcao, Antonio J.
    Fernandes, Tiago
    Ribeiro, Rita A.
    Gomes, Miguel
    Krone-Martins, Alberto
    de Almeida, Andre Moitinho
    TECHNOLOGICAL INNOVATION FOR CLOUD-BASED ENGINEERING SYSTEMS, 2015, 450 : 188 - 195
  • [22] Static Hand Gesture Recognition Based on RGB-D Image and Arm Removal
    Xu, Bingyuan
    Zhou, Zhiheng
    Huang, Junchu
    Huang, Yu
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 180 - 187
  • [23] A Comparison of 2D and 3D Convolutional Neural Networks for Hand Gesture Recognition from RGB-D Data
    Kurmanji, Meghdad
    Ghaderi, Foad
    2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 2022 - 2027
  • [24] Robust Hand Gesture Recognition Based on RGB-D Data for Natural Human-Computer Interaction
    Xu, Jun
    Wang, Hanchen
    Zhang, Jianrong
    Cai, Linqin
    IEEE ACCESS, 2022, 10 : 54549 - 54562
  • [25] Planar Clustering Algorithm Based on RGB-D Data Fusion
    Huang Zhongyi
    Li Jiansheng
    Hao Xiangyang
    Wang Teng
    Yan Libo
    2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 757 - 761
  • [26] Infrared and 3D Skeleton Feature Fusion for RGB-D Action Recognition
    De Boissiere, Alban Main
    Noumeir, Rita
    IEEE ACCESS, 2020, 8 (08): : 168297 - 168308
  • [27] Context-Assisted 3D (C3D) Object Detection from RGB-D Images
    Ren, Yuzhuo
    Chen, Chen
    Li, Shangwen
    Kuo, C-C Jay
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 55 : 131 - 141
  • [28] RGB-D Data-Based Action Recognition: A Review
    Shaikh, Muhammad Bilal
    Chai, Douglas
    SENSORS, 2021, 21 (12)
  • [29] Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks
    Fan, Deng-Ping
    Lin, Zheng
    Zhang, Zhao
    Zhu, Menglong
    Cheng, Ming-Ming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2075 - 2089
  • [30] Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
    Zhou, Benjia
    Li, Yunan
    Wan, Jun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3563 - 3571