HybridNet: Integrating GCN and CNN for skeleton-based action recognition

被引:16
|
作者
Yang, Wenjie [1 ,2 ]
Zhang, Jianlin [1 ]
Cai, Jingju [1 ]
Xu, Zhiyong [1 ]
机构
[1] Chinese Acad Sci, Inst Opt & Elect, Key Lab Opt Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing, Peoples R China
关键词
Action recognition; Human skeleton; Graph convolutional networks; CONVOLUTION NEURAL-NETWORKS;
D O I
10.1007/s10489-022-03436-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional networks (GCNs) can well-preserve the structure information of the human body. They have achieved outstanding performance in skeleton-based action recognition. Nevertheless, there are still some issues with existing GCN-based methods. First, all channels have the same adjacency matrix. However, the correlations between joints are complex and may drastically change depending on the actions. These correlations are difficult to fit by merely channel-shared adjacency matrices. Second, the interframe edges of graphs only connect the same joints, neglecting the dependencies between the different joints. Fortunately, convolutional neural networks (CNNs) can simultaneously establish the interdependence of all the points in a spatial-temporal patch. Furthermore, CNNs use different kernels among channels. They are more adaptable for modeling complicated dependencies. In this work, we design a hybrid network (HybridNet) to integrate GCNs and CNNs. The HybridNet not only utilizes structural information well but also models complicated relationships between interframe joints properly.Extensive experiments are conducted on three challenging datasets: NTU-RGB+D, NTU-RGB+D 120, and Skeleton-Kinetics. The proposed model achieves state-of-the-art performance on all these datasets by a considerable margin, demonstrating the superiority of our method.
引用
收藏
页码:574 / 585
页数:12
相关论文
共 50 条
  • [21] GAS-GCN: Gated Action-Specific Graph Convolutional Networks for Skeleton-Based Action Recognition
    Chan, Wensong
    Tian, Zhiqiang
    Wu, Yang
    SENSORS, 2020, 20 (12) : 1 - 13
  • [22] RELATIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
    Zheng, Wu
    Li, Lin
    Zhang, Zhaoxiang
    Huang, Yan
    Wang, Liang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 826 - 831
  • [23] SpatioTemporal focus for skeleton-based action recognition
    Wu, Liyu
    Zhang, Can
    Zou, Yuexian
    PATTERN RECOGNITION, 2023, 136
  • [24] AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
    Guan, Shannan
    Lu, Haiyan
    Zhu, Linchao
    Fang, Gengfa
    NEUROCOMPUTING, 2022, 514 : 256 - 267
  • [25] Skeleton-based action recognition using sparse spatio-temporal GCN with edge effective resistance
    Ahmad, Tasweer
    Jin, Lianwen
    Lin, Luojun
    Tang, GuoZhi
    NEUROCOMPUTING, 2021, 423 : 389 - 398
  • [26] DD-GCN: Directed Diffusion Graph Convolutional Network for Skeleton-based Human Action Recognition
    Li, Chang
    Huang, Qian
    Mao, Yingchi
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 786 - 791
  • [27] Si-GCN: Structure-induced Graph Convolution Network for Skeleton-based Action Recognition
    Liu, Rong
    Xu, Chunyan
    Zhang, Tong
    Zhao, Wenting
    Cui, Zhen
    Yang, Jian
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [28] Integrating vertex and edge features with Graph Convolutional Networks for skeleton-based action recognition
    Liu, Kai
    Gao, Lei
    Khan, Naimul Mefraz
    Qi, Lin
    Guan, Ling
    NEUROCOMPUTING, 2021, 466 : 190 - 201
  • [29] Generative Action Description Prompts for Skeleton-based Action Recognition
    Xiang, Wangmeng
    Li, Chao
    Zhou, Yuxuan
    Wang, Biao
    Zhang, Lei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10242 - 10251
  • [30] A Novel Skeleton Spatial Pyramid Model for Skeleton-based Action Recognition
    Li, Yanshan
    Guo, Tianyu
    Xia, Rongjie
    Liu, Xing
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 16 - 20