A Multi-Task Neural Network for Action Recognition with 3D Key-Points

被引:4
|
作者
Tang, Rongxiao [1 ]
Wang, Luyang [1 ]
Guo, Zhenhua [1 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
Multi-Task Deep Learning; 3D Pose Estimation; Stereo Inspired Neural Network; Action Recognition;
D O I
10.1109/ICPR48806.2021.9412348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition and 3D human pose estimation are fundamental problems in computer vision and closely related areas. In this work, we propose a multi-task neural network for action recognition and 3D human pose estimation. Results of previous methods are usually error-prone especially when tested against the images taken in-the-wild, leading error results in action recognition. To solve this problem, we propose a principled approach to generate high quality 3D pose ground truth given any in-the-wild image with a person inside. We achieve this by first devising a novel stereo inspired neural network to directly map any 2D pose to high quality 3D counterpart Based on the high-quality 3D labels, w e carefully design the multi-task framework for action recognition and 3D human pose estimation. The proposed architecture can utilize shallow, deep features of images, and in-the-wild 3D human key-points to guide a more precise result High quality 3D key-points can fully reflect morphological features of motions, thus boost the performance on action recognition. Experimental results demonstrate that 3D pose estimation leads to significantly higher performance on action recognition than separated learning. We also evaluate the generalization ability of our method both quantitatively and qualitatively. The proposed architecture performs favorably against the baseline 3D pose estimation methods. In addition, the reported results on Penn Action and NTU datasets demonstrate the effectiveness of our method on the action recognition task.
引用
收藏
页码:3899 / 3906
页数:8
相关论文
共 50 条
  • [21] A 3D-CNN and LSTM Based Multi-Task Learning Architecture for Action Recognition
    Ouyang, Xi
    Xu, Shuangjie
    Zhang, Chaoyun
    Zhou, Pan
    Yang, Yang
    Liu, Guanghui
    Li, Xuelong
    IEEE ACCESS, 2019, 7 : 40757 - 40770
  • [22] Multi-Task and Multi-Level Detection Neural Network Based Real-Time 3D Pose Estimation
    Luo, Dingli
    Du, Songlin
    Ikenaga, Takeshi
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1427 - 1434
  • [23] Multi-Modality Multi-Task Recurrent Neural Network for Online Action Detection
    Liu, Jiaying
    Li, Yanghao
    Song, Sijie
    Xing, Junliang
    Lan, Cuiling
    Zeng, Wenjun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (09) : 2667 - 2682
  • [24] Enhanced 3D Action Recognition Based on Deep Neural Network
    Park, Sungjoo
    Kim, Dongchil
    2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 470 - 472
  • [25] Study on 3D Action Recognition Based on Deep Neural Network
    Park, Sungjoo
    Kim, Dongchil
    2019 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2019, : 309 - 311
  • [26] Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition
    Yin, Xi
    Liu, Xiaoming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (02) : 964 - 975
  • [27] Captcha Recognition based on Multi-task Convolutional Neural Network and Active Learning
    Qiu, Jucheng
    Wu, Xiaoyu
    2021 IEEE FOURTH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE 2021), 2021, : 108 - 112
  • [28] A neural network multi-task learning approach to biomedical named entity recognition
    Gamal Crichton
    Sampo Pyysalo
    Billy Chiu
    Anna Korhonen
    BMC Bioinformatics, 18
  • [29] AAGNet: A graph neural network towards multi-task machining feature recognition
    Wu, Hongjin
    Lei, Ruoshan
    Peng, Yibing
    Gao, Liang
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2024, 86
  • [30] A neural network multi-task learning approach to biomedical named entity recognition
    Crichton, Gamal
    Pyysalo, Sampo
    Chiu, Billy
    Korhonen, Anna
    BMC BIOINFORMATICS, 2017, 18