A Multi-Task Neural Network for Action Recognition with 3D Key-Points

被引:4
|
作者
Tang, Rongxiao [1 ]
Wang, Luyang [1 ]
Guo, Zhenhua [1 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
Multi-Task Deep Learning; 3D Pose Estimation; Stereo Inspired Neural Network; Action Recognition;
D O I
10.1109/ICPR48806.2021.9412348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition and 3D human pose estimation are fundamental problems in computer vision and closely related areas. In this work, we propose a multi-task neural network for action recognition and 3D human pose estimation. Results of previous methods are usually error-prone especially when tested against the images taken in-the-wild, leading error results in action recognition. To solve this problem, we propose a principled approach to generate high quality 3D pose ground truth given any in-the-wild image with a person inside. We achieve this by first devising a novel stereo inspired neural network to directly map any 2D pose to high quality 3D counterpart Based on the high-quality 3D labels, w e carefully design the multi-task framework for action recognition and 3D human pose estimation. The proposed architecture can utilize shallow, deep features of images, and in-the-wild 3D human key-points to guide a more precise result High quality 3D key-points can fully reflect morphological features of motions, thus boost the performance on action recognition. Experimental results demonstrate that 3D pose estimation leads to significantly higher performance on action recognition than separated learning. We also evaluate the generalization ability of our method both quantitatively and qualitatively. The proposed architecture performs favorably against the baseline 3D pose estimation methods. In addition, the reported results on Penn Action and NTU datasets demonstrate the effectiveness of our method on the action recognition task.
引用
收藏
页码:3899 / 3906
页数:8
相关论文
共 50 条
  • [1] Three-Stream Convolutional Neural Network with Multi-task and Ensemble Learning for 3D Action Recognition
    Liang, Duohan
    Fan, Guoliang
    Lin, Guangfeng
    Chen, Wanjun
    Pan, Xiaorong
    Zhu, Hong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 934 - 940
  • [2] Action recognition based on 3D motion history image and multi-task learning
    Wang S.
    Dang J.-W.
    Wang Y.-P.
    Jin J.
    Dang, Jian-Wu (dangjw@mail.lzjtu.cn), 1600, Editorial Board of Jilin University (50): : 1495 - 1502
  • [3] Predicting Taxi Demand Based on 3D Convolutional Neural Network and Multi-task Learning
    Kuang, Li
    Yan, Xuejin
    Tan, Xianhan
    Li, Shuqi
    Yang, Xiaoxian
    REMOTE SENSING, 2019, 11 (11)
  • [4] 3D Convolutional Neural Network for Action Recognition
    Zhang, Junhui
    Chen, Li
    Tian, Jing
    COMPUTER VISION, PT I, 2017, 771 : 600 - 607
  • [5] 3D human action recognition model based on image set and regularized multi-task leaning
    Gao, Z.
    Zhang, G. T.
    Zhang, H.
    Xue, Y. B.
    Xu, G. P.
    NEUROCOMPUTING, 2017, 252 : 67 - 76
  • [6] Multi-Task Convolutional Neural Network for Car Attribute Recognition
    Tian, Yunfei
    Zhang, Dongping
    Jing, Changxing
    Chu, Donghui
    Yang, Li
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 459 - 463
  • [7] Multi-task segmentation network for the plant on 3D point cloud
    Zeng A.
    Luo L.
    Pan D.
    Xian Z.
    Jiang X.
    Xian Y.
    Liu L.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2023, 39 (12): : 132 - 140
  • [8] 3D CONVOLUTIONAL NEURAL NETWORK WITH MULTI-MODEL FRAMEWORK FOR ACTION RECOGNITION
    Jing, Longlong
    Ye, Yuancheng
    Yang, Xiaodong
    Tian, Yingli
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1837 - 1841
  • [9] PedRecNet: Multi-task deep neural network for full 3D human pose and orientation estimation
    Burgermeister, Dennis
    Curio, Cristobal
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 441 - 448
  • [10] Weighted Multi-task Sparse Representation Classifier for 3D Face Recognition
    Tang, Linlin
    Li, Zhangyan
    Qian, Tao
    Qi, Shuhan
    Liu, Yang
    Zhang, Jiajia
    Shi, Shuaijie
    Liu, Churan
    Su, Jingyong
    ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING (ECC 2021), 2022, 268 : 105 - 116