Large-scale gesture recognition with a fusion of RGB-D data based on optical flow and the C3D model

被引:32
|
作者
Li, Yunan [1 ]
Miao, Qiguang [1 ]
Tian, Kuan [1 ]
Fan, Yingying [1 ]
Xu, Xin [1 ]
Ma, Zhenxin [1 ]
Song, Jianfeng [1 ]
机构
[1] Xidian Univ, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Gesture recognition; RGB-D data; Optical flow; 3D Convolutional Neural Networks;
D O I
10.1016/j.patrec.2017.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition has attracted great attention owing to its applications in many fields such as Human Computer Interaction. However, in video-based gesture recognition, some gesture-irrelevant factors like the background handicap the improvement of recognition rate. In this paper, we propose an effective 3D Convolutional Neural Network based method for large-scale gesture recognition using RGB-D video data. To obtain compact but with sufficient motion path information data for the network, the inputs are unified into 32-frame videos first. Then the optical flow images are constructed from the RGB videos frame by frame, to help with eliminating the disturbing background inside them. After that, the spatiotemporal features of de-background RGB and depth data are extracted with the C3D model (a 3D CNN model) respectively and blended together in the next stage according to the discriminant correlation analysis to boost the performance. Finally the classes are predicted with a linear SVM classifier. Our proposed method achieves 54.50% accuracy on the validation subset and 60.93% on the testing subset of the Chalearn LAP IsoGD dataset, both of which outperform our results (ranked 1st place) in the Chalearn LAP Large-scale Gesture Recognition Challenge. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:187 / 194
页数:8
相关论文
共 50 条
  • [1] Large-scale Gesture Recognition with a Fusion of RGB-D Data Based on the C3D model
    Li, Yunan
    Miao, Qiguang
    Tian, Kuan
    Fan, Yingying
    Xu, Xin
    Li, Rui
    Song, Jianfeng
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 25 - 30
  • [2] Large-Scale Gesture Recognition With a Fusion of RGB-D Data Based on Saliency Theory and C3D Model
    Li, Yunan
    Miao, Qiguang
    Tian, Kuan
    Fan, Yingying
    Xu, Xin
    Li, Rui
    Song, Jianfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2956 - 2964
  • [3] ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition
    Wan, Jun
    Lin, Chi
    Wen, Longyin
    Li, Yunan
    Miao, Qiguang
    Escalera, Sergio
    Anbarjafari, Gholamreza
    Guyon, Isabelle
    Guo, Guodong
    Li, Stan Z.
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (05) : 3422 - 3433
  • [4] Gesture recognition algorithm based on multi-scale feature fusion in RGB-D images
    Sun, Ying
    Weng, Yaoqing
    Luo, Bowen
    Li, Gongfa
    Tao, Bo
    Jiang, Du
    Chen, Disi
    IET IMAGE PROCESSING, 2023, 17 (04) : 1280 - 1290
  • [5] Static Hand Gesture Recognition Using RGB-D Data
    Elboushaki, Abdessamad
    Hannane, Rachida
    Afdel, Karim
    Koutti, Lahcen
    Networked Systems, NETYS 2016, 2016, 9944 : 381 - 381
  • [6] A Dynamic Gesture Recognition Algorithm based on Feature Fusion from RGB-D Sensor
    Wang, Xia
    Chen, Peng
    Wu, Man
    Niu, Yong
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 1040 - 1045
  • [7] A comparative study of data fusion for RGB-D based visual recognition
    Sanchez-Riera, Jordi
    Hua, Kai-Lung
    Hsiao, Yuan-Sheng
    Lim, Tekoing
    Hidayati, Shintami C.
    Cheng, Wen-Huang
    PATTERN RECOGNITION LETTERS, 2016, 73 : 1 - 6
  • [8] Static Gesture Recognition Based on RGB-D Depth Information
    Wang, Yi
    Dong, Xiucheng
    Li, Changlong
    Yu, Ximu
    ADVANCES IN COMPUTERS, ELECTRONICS AND MECHATRONICS, 2014, 667 : 248 - +
  • [9] Hand part labeling and gesture recognition from RGB-D data
    Yao, Yuan
    Zhang, Linjian
    Qiao, Wenbao
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2013, 25 (12): : 1810 - 1817
  • [10] Stable and real-time hand gesture recognition based on RGB-D data
    Liu, Bo
    Wang, Guijin
    Chen, Xinghao
    He, Bei
    2013 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTOELECTRONIC IMAGING AND PROCESSING TECHNOLOGY, 2013, 9045