Dynamic Gesture Recognition Based on the Multi-modality Fusion Temporal Segment Networks

被引:0
|
作者
Zheng, Mingyao [1 ]
Tie, Yun [1 ]
Qi, Lin [1 ]
Jiang, Shengnan [1 ]
机构
[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou, Peoples R China
关键词
Dynamic gesture recognition; Multi-modality fusion; Temporal segment networks; Optical flow;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Gesture recognition is applied in various intelligent scenes. In this paper, we propose the multi-modality fusion temporal segment networks (MMFTSN) model to solve dynamic gestures recognition. Three gesture modalities: RGB, Depth and Optical flow (OF) video data are equally segmented and randomly sampled. Then, the sampling frames are classified using convolutional neural network. Finally, fusing three kinds of modality classification results. MMFTSN is used to obtain the recognition accuracy of 60.2% on the gesture database Chalearn LAP IsoGD, which is better than the result of related algorithms. The results show that the improved performance of our MMFTSN model.
引用
收藏
页数:3
相关论文
共 50 条
  • [1] Multi-modality Fusion Network for Action Recognition
    Huang, Kai
    Qin, Zheng
    Xu, Kaiping
    Ye, Shuxiong
    Wang, Guolong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 139 - 149
  • [2] GCN-Based Multi-Modality Fusion Network for Action Recognition
    Liu, Shaocan
    Wang, Xingtao
    Xiong, Ruiqin
    Fan, Xiaopeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1242 - 1253
  • [3] Research on Dynamic Gesture Recognition Based on Multi Feature Fusion
    Yu, Meijuan
    Liu, Yankai
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2018, 423 : 733 - 738
  • [4] Multi-modality Gesture Detection and Recognition with Un-supervision, Randomization and Discrimination
    Chen, Guang
    Clarke, Daniel
    Giuliani, Manuel
    Gaschler, Andre
    Wu, Di
    Weikersdorfer, David
    Knoll, Alois
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 608 - 622
  • [5] Pilot Behavior Recognition Based on Multi-Modality Fusion Technology Using Physiological Characteristics
    Li, Yuhan
    Li, Ke
    Wang, Shaofan
    Chen, Xiaodan
    Wen, Dongsheng
    BIOSENSORS-BASEL, 2022, 12 (06):
  • [6] Open set HRRP recognition with few samples based on multi-modality prototypical networks
    Tian, Long
    Chen, Bo
    Guo, Zekun
    Du, Chuan
    Peng, Yang
    Liu, Hongwei
    SIGNAL PROCESSING, 2022, 193
  • [7] Open set HRRP recognition with few samples based on multi-modality prototypical networks
    Tian, Long
    Chen, Bo
    Guo, Zekun
    Du, Chuan
    Peng, Yang
    Liu, Hongwei
    Signal Processing, 2022, 193
  • [8] Equivariant Multi-Modality Image Fusion
    Zhao, Zixiang
    Hai, Haowen
    Zhang, Jiangshe
    Zhang, Yulun
    Zhane, Kai
    Xu, Shuang
    Chen, Dongdong
    Timofte, Radu
    Van Gool, Luc
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 25912 - 25921
  • [9] Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition
    Zhu, Xiaoguang
    Zhu, Ye
    Wang, Haoyu
    Wen, Honglin
    Yan, Yan
    Liu, Peilin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (03)
  • [10] Fusion of multi-modality biomedical images using deep neural networks
    Gupta, Manish
    Kumar, Naresh
    Gupta, Neha
    Zaguia, Atef
    SOFT COMPUTING, 2022, 26 (16) : 8025 - 8036