A Multi-Task Neural Network for Action Recognition with 3D Key-Points

被引：4

作者：

Tang, Rongxiao ^{[1
]}

Wang, Luyang ^{[1
]}

Guo, Zhenhua ^{[1
]}

机构：

[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

关键词：

Multi-Task Deep Learning; 3D Pose Estimation; Stereo Inspired Neural Network; Action Recognition;

D O I：

10.1109/ICPR48806.2021.9412348

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Action recognition and 3D human pose estimation are fundamental problems in computer vision and closely related areas. In this work, we propose a multi-task neural network for action recognition and 3D human pose estimation. Results of previous methods are usually error-prone especially when tested against the images taken in-the-wild, leading error results in action recognition. To solve this problem, we propose a principled approach to generate high quality 3D pose ground truth given any in-the-wild image with a person inside. We achieve this by first devising a novel stereo inspired neural network to directly map any 2D pose to high quality 3D counterpart Based on the high-quality 3D labels, w e carefully design the multi-task framework for action recognition and 3D human pose estimation. The proposed architecture can utilize shallow, deep features of images, and in-the-wild 3D human key-points to guide a more precise result High quality 3D key-points can fully reflect morphological features of motions, thus boost the performance on action recognition. Experimental results demonstrate that 3D pose estimation leads to significantly higher performance on action recognition than separated learning. We also evaluate the generalization ability of our method both quantitatively and qualitatively. The proposed architecture performs favorably against the baseline 3D pose estimation methods. In addition, the reported results on Penn Action and NTU datasets demonstrate the effectiveness of our method on the action recognition task.

引用

页码：3899 / 3906

页数：8

共 50 条

[1] Three-Stream Convolutional Neural Network with Multi-task and Ensemble Learning for 3D Action Recognition
Liang, Duohan
Fan, Guoliang
Lin, Guangfeng
Chen, Wanjun
Pan, Xiaorong
Zhu, Hong
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 934 - 940
[2] Action recognition based on 3D motion history image and multi-task learning
Wang S.
Dang J.-W.
Wang Y.-P.
Jin J.
Dang, Jian-Wu (dangjw@mail.lzjtu.cn), 1600, Editorial Board of Jilin University (50): : 1495 - 1502
[3] Predicting Taxi Demand Based on 3D Convolutional Neural Network and Multi-task Learning
Kuang, Li
Yan, Xuejin
Tan, Xianhan
Li, Shuqi
Yang, Xiaoxian
REMOTE SENSING, 2019, 11 (11)
[4] 3D Convolutional Neural Network for Action Recognition
Zhang, Junhui
Chen, Li
Tian, Jing
COMPUTER VISION, PT I, 2017, 771 : 600 - 607
[5] 3D human action recognition model based on image set and regularized multi-task leaning
Gao, Z.
Zhang, G. T.
Zhang, H.
Xue, Y. B.
Xu, G. P.
NEUROCOMPUTING, 2017, 252 : 67 - 76
[6] Multi-Task Convolutional Neural Network for Car Attribute Recognition
Tian, Yunfei
Zhang, Dongping
Jing, Changxing
Chu, Donghui
Yang, Li
2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 459 - 463
[7] Multi-task segmentation network for the plant on 3D point cloud
Zeng A.
Luo L.
Pan D.
Xian Z.
Jiang X.
Xian Y.
Liu L.
Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2023, 39 (12): : 132 - 140
[8] 3D CONVOLUTIONAL NEURAL NETWORK WITH MULTI-MODEL FRAMEWORK FOR ACTION RECOGNITION
Jing, Longlong
Ye, Yuancheng
Yang, Xiaodong
Tian, Yingli
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1837 - 1841
[9] PedRecNet: Multi-task deep neural network for full 3D human pose and orientation estimation
Burgermeister, Dennis
Curio, Cristobal
2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 441 - 448
[10] Weighted Multi-task Sparse Representation Classifier for 3D Face Recognition
Tang, Linlin
Li, Zhangyan
Qian, Tao
Qi, Shuhan
Liu, Yang
Zhang, Jiajia
Shi, Shuaijie
Liu, Churan
Su, Jingyong
ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING (ECC 2021), 2022, 268 : 105 - 116

← 1 2 3 4 5 →