HybridNet: Integrating GCN and CNN for skeleton-based action recognition

被引：16

作者：

Yang, Wenjie ^{[1
,2
]}

Zhang, Jianlin ^{[1
]}

Cai, Jingju ^{[1
]}

Xu, Zhiyong ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Opt & Elect, Key Lab Opt Engn, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 01期

关键词：

Action recognition; Human skeleton; Graph convolutional networks; CONVOLUTION NEURAL-NETWORKS;

D O I：

10.1007/s10489-022-03436-0

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph convolutional networks (GCNs) can well-preserve the structure information of the human body. They have achieved outstanding performance in skeleton-based action recognition. Nevertheless, there are still some issues with existing GCN-based methods. First, all channels have the same adjacency matrix. However, the correlations between joints are complex and may drastically change depending on the actions. These correlations are difficult to fit by merely channel-shared adjacency matrices. Second, the interframe edges of graphs only connect the same joints, neglecting the dependencies between the different joints. Fortunately, convolutional neural networks (CNNs) can simultaneously establish the interdependence of all the points in a spatial-temporal patch. Furthermore, CNNs use different kernels among channels. They are more adaptable for modeling complicated dependencies. In this work, we design a hybrid network (HybridNet) to integrate GCNs and CNNs. The HybridNet not only utilizes structural information well but also models complicated relationships between interframe joints properly.Extensive experiments are conducted on three challenging datasets: NTU-RGB+D, NTU-RGB+D 120, and Skeleton-Kinetics. The proposed model achieves state-of-the-art performance on all these datasets by a considerable margin, demonstrating the superiority of our method.

引用

页码：574 / 585

页数：12

共 50 条

[21] GAS-GCN: Gated Action-Specific Graph Convolutional Networks for Skeleton-Based Action Recognition
Chan, Wensong
Tian, Zhiqiang
Wu, Yang
SENSORS, 2020, 20 (12) : 1 - 13
[22] RELATIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
Zheng, Wu
Li, Lin
Zhang, Zhaoxiang
Huang, Yan
Wang, Liang
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 826 - 831
[23] SpatioTemporal focus for skeleton-based action recognition
Wu, Liyu
Zhang, Can
Zou, Yuexian
PATTERN RECOGNITION, 2023, 136
[24] AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
Guan, Shannan
Lu, Haiyan
Zhu, Linchao
Fang, Gengfa
NEUROCOMPUTING, 2022, 514 : 256 - 267
[25] Skeleton-based action recognition using sparse spatio-temporal GCN with edge effective resistance
Ahmad, Tasweer
Jin, Lianwen
Lin, Luojun
Tang, GuoZhi
NEUROCOMPUTING, 2021, 423 : 389 - 398
[26] DD-GCN: Directed Diffusion Graph Convolutional Network for Skeleton-based Human Action Recognition
Li, Chang
Huang, Qian
Mao, Yingchi
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 786 - 791
[27] Si-GCN: Structure-induced Graph Convolution Network for Skeleton-based Action Recognition
Liu, Rong
Xu, Chunyan
Zhang, Tong
Zhao, Wenting
Cui, Zhen
Yang, Jian
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[28] Integrating vertex and edge features with Graph Convolutional Networks for skeleton-based action recognition
Liu, Kai
Gao, Lei
Khan, Naimul Mefraz
Qi, Lin
Guan, Ling
NEUROCOMPUTING, 2021, 466 : 190 - 201
[29] Generative Action Description Prompts for Skeleton-based Action Recognition
Xiang, Wangmeng
Li, Chao
Zhou, Yuxuan
Wang, Biao
Zhang, Lei
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10242 - 10251
[30] A Novel Skeleton Spatial Pyramid Model for Skeleton-based Action Recognition
Li, Yanshan
Guo, Tianyu
Xia, Rongjie
Liu, Xing
2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 16 - 20

← 1 2 3 4 5 →