Tripool: Graph triplet pooling for 3D skeleton-based action recognition

被引:39
|
作者
Peng, Wei [1 ]
Hong, Xiaopeng [1 ,2 ]
Zhao, Guoying [1 ,3 ]
机构
[1] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu, Finland
[2] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Cyber Sci & Engn, Xian, Peoples R China
[3] Northwest Univ, Sch Informat & Technol, Xian, Peoples R China
基金
芬兰科学院; 中国国家自然科学基金;
关键词
3D skeletal action recognition; ST-GCN; Graph pooling; Graph topology analysis;
D O I
10.1016/j.patcog.2021.107921
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Convolutional Network (GCN) has already been successfully applied to skeleton-based action recog-nition. However, current GCNs in this task are lack of pooling operations such that the architectures are inherently flat, which not only increases the computational complexity but also requires larger memory space to keep the entire graph embedding. More seriously, a flat architecture forces the high-level seman-tic feature representations to have the same physical structure of the low-level input skeletons, which we argue is unreasonable and harmful for the final performance. To address these issues, we propose Tripool, a novel graph pooling method for 3D action recognition from skeleton data. Tripool provides to optimize a triplet pooling loss, in which both graph topology and global graph context are taken into considera-tion, to learn a hierarchical graph representation. The training process of graph pooling is efficient since it optimizes the graph topology by minimizing an upper bound of the pooling loss. Besides, Tripool also automatically generates an embedding matrix since the graph is changed after pooling. On one hand, Tripool reduces the computational cost by removing the redundant nodes. On the other hand it over-comes the limitation of the topology constrain for the high-level semantic representations, thus improves the final performance. Tripool can be combined with various graph neural networks in an end-to-end fashion. Comprehensive experiments on two current largest scale 3D datasets are conducted to evalu-ate our method. With our Tripool, we consistently get the best results in terms of various performance measures. (C) 2021 The Author(s). Published by Elsevier Ltd.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] 3D skeleton-based action recognition with convolutional neural networks
    Van-Nam Hoang
    Thi-Lan Le
    Thanh-Hai Tran
    Hai-Vu
    Van-Toi Nguyen
    [J]. 2019 INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2019,
  • [2] Learning Clip Representations for Skeleton-Based 3D Action Recognition
    Ke, Qiuhong
    Bennamoun, Mohammed
    An, Senjian
    Sohel, Ferdous
    Boussaid, Farid
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) : 2842 - 2855
  • [3] A 3D graph convolutional networks model for 2D skeleton-based human action recognition
    Weng, Libo
    Lou, Weidong
    Shen, Xin
    Gao, Fei
    [J]. IET IMAGE PROCESSING, 2023, 17 (03) : 773 - 783
  • [4] Symbiotic Graph Neural Networks for 3D Skeleton-Based Human Action Recognition and Motion Prediction
    Li, Maosen
    Chen, Siheng
    Chen, Xu
    Zhang, Ya
    Wang, Yanfeng
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 3316 - 3333
  • [5] Fast 3D-graph convolutional networks for skeleton-based action recognition
    Zhang, Guohao
    Wen, Shuhuan
    Li, Jiaqi
    Che, Haijun
    [J]. APPLIED SOFT COMPUTING, 2023, 145
  • [6] Mix Dimension in Poincare Geometry for 3D Skeleton-based Action Recognition
    Peng, Wei
    Shi, Jingang
    Xia, Zhaoqiang
    Zhao, Guoying
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1432 - 1440
  • [7] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Yu, Lubin
    Tian, Lianfang
    Du, Qiliang
    Bhutto, Jameel Ahmed
    [J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 14838 - 14854
  • [8] A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
    Ren, Bin
    Liu, Mengyuan
    Ding, Runwei
    Liu, Hong
    [J]. CYBORG AND BIONIC SYSTEMS, 2024, 5
  • [9] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Lubin Yu
    Lianfang Tian
    Qiliang Du
    Jameel Ahmed Bhutto
    [J]. Applied Intelligence, 2023, 53 : 14838 - 14854
  • [10] Skeleton-based Action Recognition with Graph Involution Network
    Tang, Zhihao
    Xia, Hailun
    Gao, Xinkai
    Gao, Feng
    Feng, Chunyan
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3348 - 3354