Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

被引:58
|
作者
Liu, Shaowei [1 ]
Jiang, Hanwen [1 ]
Xu, Jiarui [1 ]
Liu, Sifei [2 ]
Wang, Xiaolong [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] NVIDIA, Santa Clara, CA USA
关键词
D O I
10.1109/CVPR46437.2021.01445
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating 3D hand and object pose from a single image is an extremely challenging problem: hands and objects are often self-occluded during interactions, and the 3D annotations are scarce as even humans cannot directly label the ground-truths from a single image perfectly. To tackle these challenges, we propose a unified framework for estimating the 3D hand and object poses with semi-supervised learning. We build a joint learning framework where we perform explicit contextual reasoning between hand and object representations. Going beyond limited 3D annotations in a single image, we leverage the spatial-temporal consistency in large-scale hand-object videos as a constraint for generating pseudo labels in semi-supervised learning. Our method not only improves hand pose estimation in challenging real-world dataset, but also substantially improve the object pose which has fewer ground-truths per instance. By training with large-scale diverse videos, our model also generalizes better across multiple out-of-domain datasets. Project page and code: https://stevenlsw.github.io/Semi-Hand-Object.
引用
收藏
页码:14682 / 14692
页数:11
相关论文
共 50 条
  • [1] SEMI-SUPERVISED 3D HAND-OBJECT POSE ESTIMATION VIA POSE DICTIONARY LEARNING
    Cheng, Zida
    Chen, Siheng
    Zhang, Ya
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3632 - 3636
  • [2] S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-supervised Learning
    Tse, Tze Ho Elden
    Zhang, Zhongqun
    Kim, Kwang In
    Leonardis, Ales
    Zheng, Feng
    Chang, Hyung Jin
    COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 568 - 584
  • [3] 3D Object Reconstruction from Hand-Object Interactions
    Tzionas, Dimitrios
    Gall, Juergen
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 729 - 737
  • [4] H plus O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions
    Tekin, Bugra
    Bogo, Federica
    Pollefeys, Marc
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4506 - 4515
  • [5] Semi-Supervised 3D Hand Shape and Pose Estimation with Label Propagation
    Kaviani, Samira
    Rahimi, Amir
    Hartley, Richard
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 1 - 8
  • [6] REAL-TIME 3D HAND-OBJECT POSE ESTIMATION FOR MOBILE DEVICES
    Yin, Yue
    McCarthy, Chris
    Rezazadegan, Dana
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3288 - 3292
  • [7] Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
    Wu, Xiaopei
    Peng, Liang
    Xie, Liang
    Hou, Yuenan
    Lin, Binbin
    Huang, Xiaoshui
    Liu, Haifeng
    Cai, Deng
    Ouyang, Wanli
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6153 - 6161
  • [8] Semi-supervised 3D Object Detection with Proficient Teachers
    Yin, Junbo
    Fang, Jin
    Zhou, Dingfu
    Zhang, Liangjun
    Xu, Cheng-Zhong
    Shen, Jianbing
    Wang, Wenguan
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 727 - 743
  • [9] Interaction Fusion: Real-time Reconstruction of Hand Poses and Deformable Objects in Hand-object Interactions
    Zhang, Hao
    Bo, Zi-Hao
    Yong, Jun-Hai
    Xu, Feng
    ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (04):
  • [10] Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object Detection
    Wang, Chuxin
    Yang, Wenfei
    Zhang, Tianzhu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3791 - 3801