Monocular Human Depth Estimation Via Pose Estimation

被引:3
|
作者
Jun, Jinyoung [1 ]
Lee, Jae-Han [1 ]
Lee, Chul [2 ]
Kim, Chang-Su [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea
[2] Dongguk Univ, Dept Multimedia Engn, Seoul 04620, South Korea
来源
IEEE ACCESS | 2021年 / 9卷
基金
新加坡国家研究基金会;
关键词
Monocular depth estimation; human pose estimation; human depth estimation; loss rebalancing strategy;
D O I
10.1109/ACCESS.2021.3126629
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a novel monocular depth estimator, which improves the prediction accuracy on human regions by utilizing pose information. The proposed algorithm consists of two networks - PoseNet and DepthNet - to estimate keypoint heatmaps and a depth map, respectively. We incorporate the pose information from PoseNet to improve the depth estimation performance of DepthNet. Specifically, we develop the feature blending block, which fuses the features from PoseNet and DepthNet and feeds them into the next layer of DepthNet, to make the networks learn to predict the depths of human regions more accurately. Furthermore, we develop a novel joint training scheme using partially labeled datasets, which balances multiple loss functions effectively by adjusting weights. Experimental results demonstrate that the proposed algorithm can improve depth estimation performance significantly, especially around human regions. For example, the proposed algorithm improves the depth estimation performance on the human regions of ResNet-50 by 2.8% and 7.0% in terms of delta 1 and RMSE, respectively, on the proposed HD + P dataset.
引用
收藏
页码:151444 / 151457
页数:14
相关论文
共 50 条
  • [1] Posebits for Monocular Human Pose Estimation
    Pons-Moll, Gerard
    Fleet, David J.
    Rosenhahn, Bodo
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2345 - 2352
  • [2] Monocular 3D Human Pose Estimation by Predicting Depth on Joints
    Nie, Bruce Xiaohan
    Wei, Ping
    Zhu, Song-Chun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3467 - 3475
  • [3] Enhancing Self-supervised Monocular Depth Estimation via Piece-Wise Pose Estimation and Geometric Constraints
    Shyam, Pranjay
    Okon, Alexandre
    Yoo, HyunJin
    [J]. 2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 221 - 231
  • [4] Pose estimation via structure-depth information from monocular endoscopy images sequence
    Liu, Shiyuan
    Fan, Jingfan
    Zang, Liugeng
    Yang, Yun
    Fu, Tianyu
    Song, Hong
    Wang, Yongtian
    Yang, Jian
    [J]. BIOMEDICAL OPTICS EXPRESS, 2024, 15 (01): : 460 - 478
  • [5] Monocular head pose estimation
    Martins, Pedro
    Batista, Jorge
    [J]. IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2008, 5112 : 357 - 368
  • [6] HUMAN POSE ESTIMATION FROM MONOCULAR IMAGE CAPTURES
    Lin, Huei-Yung
    Chen, Ting-Wen
    Chen, Chih-Chang
    Hsieh, Chia-Hao
    Lie, Wen-Nung
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 994 - +
  • [7] A Unified Framework for Depth-Assisted Monocular Object Pose Estimation
    Hoang, Dinh-Cuong
    Tan, Phan Xuan
    Nguyen, Thu-Uyen
    Pham, Hai-Nam
    Nguyen, Chi-Minh
    Bui, Son-Anh
    Duong, Quang-Tri
    Vu, van-Duc
    Nguyen, van-Thiep
    Duong, van-Hiep
    Hoang, Ngoc-Anh
    Phan, Khanh-Toan
    Tran, Duc-Thanh
    Ho, Ngoc-Trung
    Tran, Cong-Trinh
    [J]. IEEE ACCESS, 2024, 12 : 111723 - 111740
  • [8] Human Pose Estimation Combined with Depth Information
    Sha, Yuan
    Shi, Ping
    Pan, Da
    Zhou, Shaojing
    [J]. PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 663 - 667
  • [9] Unsupervised Human Pose Estimation on Depth Images
    Blanc-Beyne, Thibault
    Carlier, Axel
    Mouysset, Sandrine
    Charvillat, Vincent
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE TRACK, ECML PKDD 2020, PT IV, 2021, 12460 : 358 - 373
  • [10] Depth Estimation of a Deformable Object via a Monocular Camera
    Jiang, Guolai
    Jin, Shaokun
    Ou, Yongsheng
    Zhou, Shoujun
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (07):