Human gesture recognition of dynamic skeleton using graph convolutional networks

被引:1
|
作者
Liang, Wuyan [1 ]
Xu, Xiaolong [2 ]
Xiao, Fu [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
intelligent vision computing; graph convolutional networks; spatiotemporal correlations; dynamic gesture recognition; SIGN-LANGUAGE RECOGNITION;
D O I
10.1117/1.JEI.32.2.021402
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this era, intelligent vision computing has always been a fascinating field. With the rapid development in computer vision, dynamic gesture-based recognition systems have attracted significant attention. However, automatically recognizing skeleton-based human gestures in the form of sign language is complex and challenging. Most existing methods consider skeleton-based human gesture recognition as a standard video recognition problem, without considering the rich structure information among both joints and gesture frames. Graph convolutional networks (GCNs) are a promising way to leverage structure information to learn structure representations. However, adopting GCNs to tackle such gesture sequences both in spatial and temporal spaces is challenging as graph could be highly nonlinear and complex. To overcome this issue, we propose the spatiotemporal GCNs model to leverage the powerful spatiotemporal correlations to adaptively construct spatiotemporal graphs, called Aegles. Our method could dynamically attend to relatively significant spatiotemporal joints and construct different graphs, including spatial, temporal, and spatiotemporal graph, and well capturing the structure information in gesture sequences. Besides, we introduce the second-order information of the gesture skeleton data, i.e., the length and orientation of bones, to improve the representation of human hands and fingers. In addition, with the public sign language datasets, we use OpenPose technology to extract human gesture skeleton and obtain human skeleton video, building four skeleton-based sign language recognition datasets. Experimental results show that this Aegles outperforms the state-of-the-art ones and that the spatiotemporal correlations effectively boost the performance of human gesture recognition.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Spatial temporal graph convolutional networks for skeleton-based dynamic hand gesture recognition
    Li, Yong
    He, Zihang
    Ye, Xiang
    He, Zuguo
    Han, Kangrong
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2019, 2019 (01)
  • [2] Spatial temporal graph convolutional networks for skeleton-based dynamic hand gesture recognition
    Yong Li
    Zihang He
    Xiang Ye
    Zuguo He
    Kangrong Han
    EURASIP Journal on Image and Video Processing, 2019
  • [3] Gaze Gesture Recognition by Graph Convolutional Networks
    Shi, Lei
    Copot, Cosmin
    Vanlanduit, Steve
    FRONTIERS IN ROBOTICS AND AI, 2021, 8
  • [4] Spatio-Temporal Dynamic Attention Graph Convolutional Network Based on Skeleton Gesture Recognition
    Han, Xiaowei
    Cui, Ying
    Chen, Xingyu
    Lu, Yunjing
    Hu, Wen
    ELECTRONICS, 2024, 13 (18)
  • [5] Temporal Decoupling Graph Convolutional Network for Skeleton-Based Gesture Recognition
    Liu, Jinfu
    Wang, Xinshun
    Wang, Can
    Gao, Yuan
    Liu, Mengyuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 811 - 823
  • [6] Spatiotemporal 2D Skeleton-based Image for Dynamic Gesture Recognition Using Convolutional Neural Networks
    Paulo, Joao Ruivo
    Garrote, Luis
    Peixoto, Paulo
    Nunes, Urbano J.
    2021 30TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2021, : 1138 - 1144
  • [7] Traffic Police Gesture Recognition by Pose Graph Convolutional Networks
    Fang, Zhijie
    Zhang, Wuqiang
    Guo, Zijie
    Zhi, Rong
    Wang, Baofeng
    Flohr, Fabian
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1833 - 1838
  • [8] Enhancing human behavior recognition with spatiotemporal graph convolutional neural networks and skeleton sequences
    Xu, Jianmin
    Liu, Fenglin
    Wang, Qinghui
    Zou, Ruirui
    Wang, Ying
    Zheng, Junling
    Du, Shaoyi
    Zeng, Wei
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2024, 2024 (01)
  • [9] A comparative review of graph convolutional networks for human skeleton-based action recognition
    Liqi Feng
    Yaqin Zhao
    Wenxuan Zhao
    Jiaxi Tang
    Artificial Intelligence Review, 2022, 55 : 4275 - 4305
  • [10] A comparative review of graph convolutional networks for human skeleton-based action recognition
    Feng, Liqi
    Zhao, Yaqin
    Zhao, Wenxuan
    Tang, Jiaxi
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (05) : 4275 - 4305