2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning

被引:327
|
作者
Luvizon, Diogo C. [1 ]
Picard, David [1 ,2 ]
Tabia, Hedi [1 ]
机构
[1] Paris Seine Univ, CNRS, ENSEA, ETIS UMR 8051, F-95000 Cergy, France
[2] Sorbonne Univ, CNRS, Lab Informat Paris 6, F-75005 Paris, France
关键词
D O I
10.1109/CVPR.2018.00539
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition and human pose estimation are closely related but both problems are generally handled as distinct tasks in the literature. In this work, we propose a multitask framework for jointly 2D and 3D pose estimation from still images and human action recognition from video sequences. We show that a single architecture can be used to solve the two problems in an efficient way and still achieves state-of-the-art results. Additionally, we demonstrate that optimization from end-toend leads to significantly higher accuracy than separated learning. The proposed architecture can be trained with data from different categories simultaneously in a seamlessly way. The reported results on four datasets (MPII, Human3.6M, Penn Action and NTU) demonstrate the effectiveness of our method on the targeted tasks.
引用
收藏
页码:5137 / 5146
页数:10
相关论文
共 50 条
  • [1] 2D and 3D Human Pose Estimation and Analysis Using Deep Learning
    Yadav, Anju
    Saxena, Rahul
    Bhattacharya, Anubhav
    Pal, Vipin
    Pathak, Nitish
    [J]. ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 133 - 143
  • [2] 2D Action Recognition Serves 3D Human Pose Estimation
    Gall, Juergen
    Yao, Angela
    Van Gool, Luc
    [J]. COMPUTER VISION-ECCV 2010, PT III, 2010, 6313 : 425 - 438
  • [3] A review of deep learning techniques for 2D and 3D human pose estimation
    Ben Gamra, Miniar
    Akhloufi, Moulay A.
    [J]. IMAGE AND VISION COMPUTING, 2021, 114
  • [4] 3D Human Pose Estimation via Deep Learning from 2D annotations
    Brau, Ernesto
    Jiang, Hao
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 582 - 591
  • [5] Geometric Deep Learning on Skeleton Sequences for 2D/3D Action Recognition
    Friji, Rasha
    Drira, Hassen
    Chaieb, Faten
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 196 - 204
  • [6] A 3D Object Recognition and Pose estimation System Using Deep Learning Method
    Liang, Dong
    Weng, Kaijian
    Wang, Can
    Liang, Guoyuan
    Chen, Haoyao
    Wu, Xinyu
    [J]. 2014 4TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2014, : 401 - 404
  • [7] 2D and 3D Palmprint Recognition using Deep Learning Method
    Samai, Djamel
    Bensid, Khaled
    Meraoumia, Abdallah
    Taleb-Ahmed, Abdelmalik
    Bedda, Mouldi
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON PATTERN ANALYSIS AND INTELLIGENT SYSTEMS (PAIS), 2018, : 245 - 250
  • [8] Adaptive Loss Balancing for Multitask Learning of Object Instance Recognition and 3D Pose Estimation
    Hosono, Takashi
    Hoshi, Yuuna
    Shimamura, Jun
    Sagata, Atsushi
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 2587 - 2592
  • [9] Recent Advances of Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective
    Liu, Wu
    Bao, Qian
    Sun, Yu
    Mei, Tao
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (04)
  • [10] 3D Human Pose Estimation from Deep Multi-View 2D Pose
    Schwarcz, Steven
    Pollard, Thomas
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2326 - 2331