Large-scale gesture recognition with a fusion of RGB-D data based on optical flow and the C3D model

被引:32
|
作者
Li, Yunan [1 ]
Miao, Qiguang [1 ]
Tian, Kuan [1 ]
Fan, Yingying [1 ]
Xu, Xin [1 ]
Ma, Zhenxin [1 ]
Song, Jianfeng [1 ]
机构
[1] Xidian Univ, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Gesture recognition; RGB-D data; Optical flow; 3D Convolutional Neural Networks;
D O I
10.1016/j.patrec.2017.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition has attracted great attention owing to its applications in many fields such as Human Computer Interaction. However, in video-based gesture recognition, some gesture-irrelevant factors like the background handicap the improvement of recognition rate. In this paper, we propose an effective 3D Convolutional Neural Network based method for large-scale gesture recognition using RGB-D video data. To obtain compact but with sufficient motion path information data for the network, the inputs are unified into 32-frame videos first. Then the optical flow images are constructed from the RGB videos frame by frame, to help with eliminating the disturbing background inside them. After that, the spatiotemporal features of de-background RGB and depth data are extracted with the C3D model (a 3D CNN model) respectively and blended together in the next stage according to the discriminant correlation analysis to boost the performance. Finally the classes are predicted with a linear SVM classifier. Our proposed method achieves 54.50% accuracy on the validation subset and 60.93% on the testing subset of the Chalearn LAP IsoGD dataset, both of which outperform our results (ranked 1st place) in the Chalearn LAP Large-scale Gesture Recognition Challenge. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:187 / 194
页数:8
相关论文
共 50 条
  • [41] ETRI-Activity3D: A Large-Scale RGB-D Dataset for Robots to Recognize Daily Activities of the Elderly
    Jang, Jinhyeok
    Kim, Dohyung
    Park, Cheonshu
    Jang, Minsu
    Lee, Jaeyeon
    Kim, Jaehong
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10990 - 10997
  • [42] Multiple Classifiers-Based Feature Fusion for RGB-D Object Recognition
    Wu, Yan
    Li, Jiqian
    Bai, Jing
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (05)
  • [43] Object recognition and robot grasping technology based on RGB-D data
    Yu, Sheng
    Zhai, Di-Hua
    Wu, Haocun
    Yang, Hongda
    Xia, Yuanqing
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3869 - 3874
  • [44] RETRACTION: Gesture recognition algorithm based on multi-scale feature fusion in RGB-D images (Retraction of Vol 14, Pg 3662, 2020)
    Sun, Ying
    Weng, Yaoqing
    Luo, Bowen
    Li, Gongfa
    Tao, Bo
    Jiang, Du
    Chen, Disi
    IET IMAGE PROCESSING, 2023, 17 (01) : 301 - 301
  • [45] Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation
    Liu, Xingyu
    Ren, Pengfei
    Gao, Yuanyuan
    Wang, Jingyu
    Sun, Haifeng
    Qi, Qi
    Zhuang, Zirui
    Liao, Jianxin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3756 - 3764
  • [46] FlowFusion: Dynamic Dense RGB-D SLAM Based on Optical Flow
    Zhang, Tianwei
    Zhang, Huayan
    Li, Yang
    Nakamura, Yoshihiko
    Zhang, Lei
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 7322 - 7328
  • [47] A Smart TV Interaction System Based on Hand Gesture Recognition by Using RGB-D Sensor
    Feng, Qi
    Yang, Cheng
    Wu, Xiaoyu
    Li, Zhuojia
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 1319 - 1322
  • [48] Early or Late Fusion Matters: Efficient RGB-D Fusion in Vision Transformers for 3D Object Recognition
    Tziafas, Georgios
    Kasaei, Hamidreza
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9558 - 9565
  • [49] One-shot Learning Gesture Recognition from RGB-D Data Using Bag of Features
    Wan, Jun
    Ruan, Qiuqi
    Li, Wei
    Deng, Shuang
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 2549 - 2582
  • [50] Dynamic hand gesture recognition using RGB-D data for natural human-computer interaction
    Cai Linqin
    Cui Shuangjie
    Xiang Min
    Yu Jimin
    Zhang Jianrong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 32 (05) : 3495 - 3507