3D Human Pose Estimation Using Two-Stream Architecture with Joint Training

被引:0
|
作者
Kang, Jian [1 ]
Fan, Wanshu [1 ]
Li, Yijing [2 ]
Liu, Rui [1 ]
Zhou, Dongsheng [1 ]
机构
[1] Dalian Univ, Sch Software Engn, Natl & Local Joint Engn Lab Comp Aided Design, Dalian 116622, Peoples R China
[2] Dalian Maritime Univ, Dalian 116023, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
3D human pose; improved TCN; GELU; kinematic structure;
D O I
10.32604/cmes.2023.024420
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the advancement of image sensing technology, estimating 3D human pose from monocular video has become a hot research topic in computer vision. 3D human pose estimation is an essential prerequisite for subsequent action analysis and understanding. It empowers a wide spectrum of potential applications in various areas, such as intelligent transportation, human-computer interaction, and medical rehabilitation. Currently, some methods for 3D human pose estimation in monocular video employ temporal convolutional network (TCN) to extract inter-frame feature relationships, but the majority of them suffer from insufficient inter-frame feature relationship extractions. In this paper, we decompose the 3D joint location regression into the bone direction and length, we propose the TCG, a temporal convolutional network incorporating Gaussian error linear units (GELU), to solve bone direction. It enables more inter-frame features to be captured and makes the utmost of the feature relationships between data. Furthermore, we adopt kinematic structural information to solve bone length enhancing the use of intra-frame joint features. Finally, we design a loss function for joint training of the bone direction estimation network with the bone length estimation network. The proposed method has extensively experimented on the public benchmark dataset Human 3.6M. Both quantitative and qualitative experimental results showed that the proposed method can achieve more accurate 3D human pose estimations.
引用
收藏
页码:607 / 629
页数:23
相关论文
共 50 条
  • [41] A survey on deep 3D human pose estimation
    Neupane, Rama Bastola
    Li, Kan
    Boka, Tesfaye Fenta
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 58 (01)
  • [42] Deep 3D human pose estimation: A review
    Wang, Jinbao
    Tan, Shujie
    Zhen, Xiantong
    Xu, Shuo
    Zheng, Feng
    He, Zhenyu
    Shao, Ling
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 210
  • [43] View Invariant 3D Human Pose Estimation
    Wei, Guoqiang
    Lan, Cuiling
    Zeng, Wenjun
    Chen, Zhibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4601 - 4610
  • [44] 3D human pose estimation by depth map
    Wu, Jianzhai
    Hu, Dewen
    Xiang, Fengtao
    Yuan, Xingsheng
    Su, Jiongming
    VISUAL COMPUTER, 2020, 36 (07): : 1401 - 1410
  • [45] 3D Human Pose Estimation With Adversarial Learning
    Meng, Wenming
    Hu, Tao
    Shuai, Li
    2019 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2019, : 93 - 99
  • [46] MONOCULAR 3D HUMAN POSE ESTIMATION BY CLASSIFICATION
    Greif, Thomas
    Lienhart, Rainer
    Sengupta, Debabrata
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [47] 3D human pose estimation by depth map
    Jianzhai Wu
    Dewen Hu
    Fengtao Xiang
    Xingsheng Yuan
    Jiongming Su
    The Visual Computer, 2020, 36 : 1401 - 1410
  • [48] 3D Human Pose Estimation Using Convolutional Neural Networks with 2D Pose Information
    Park, Sungheon
    Hwang, Jihye
    Kwak, Nojun
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 156 - 169
  • [49] 3D human pose estimation based on 2D-3D consistency with synchronized adversarial training
    Deng, Yicheng
    Sun, Cheng
    Sun, Yongqi
    Zhu, Jiahui
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 175
  • [50] 3D Human Pose Estimation in Weightless Environments Using a Fisheye Camera
    Minoda, Koji
    Yairi, Takehisa
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 4100 - 4105