Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

被引:58
|
作者
Liu, Shaowei [1 ]
Jiang, Hanwen [1 ]
Xu, Jiarui [1 ]
Liu, Sifei [2 ]
Wang, Xiaolong [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] NVIDIA, Santa Clara, CA USA
关键词
D O I
10.1109/CVPR46437.2021.01445
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating 3D hand and object pose from a single image is an extremely challenging problem: hands and objects are often self-occluded during interactions, and the 3D annotations are scarce as even humans cannot directly label the ground-truths from a single image perfectly. To tackle these challenges, we propose a unified framework for estimating the 3D hand and object poses with semi-supervised learning. We build a joint learning framework where we perform explicit contextual reasoning between hand and object representations. Going beyond limited 3D annotations in a single image, we leverage the spatial-temporal consistency in large-scale hand-object videos as a constraint for generating pseudo labels in semi-supervised learning. Our method not only improves hand pose estimation in challenging real-world dataset, but also substantially improve the object pose which has fewer ground-truths per instance. By training with large-scale diverse videos, our model also generalizes better across multiple out-of-domain datasets. Project page and code: https://stevenlsw.github.io/Semi-Hand-Object.
引用
收藏
页码:14682 / 14692
页数:11
相关论文
共 50 条
  • [21] HOnnotate: A method for 3D Annotation of Hand and Object Poses
    Hampali, Shreyas
    Rad, Mahdi
    Oberweger, Markus
    Lepetit, Vincent
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3193 - 3203
  • [22] Semi-Supervised Online Continual Learning for 3D Object Detection in Mobile Robotics
    Liu, Binhong
    Yao, Dexin
    Yang, Rui
    Yan, Zhi
    Yang, Tao
    Journal of Intelligent and Robotic Systems: Theory and Applications, 2024, 110 (04):
  • [23] 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection
    Wang, He
    Cong, Yezhen
    Litany, Or
    Gao, Yue
    Guibas, Leonidas J.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14610 - 14619
  • [24] Semi-supervised Monocular 3D Object Detection by Multi-view Consistency
    Lian, Qing
    Xu, Yanbo
    Yao, Weilong
    Chen, Yingcong
    Zhang, Tong
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 715 - 731
  • [25] Semi-supervised 3D Object Detection via Temporal Graph Neural Networks
    Wang, Jianren
    Gang, Haiming
    Ancha, Siddarth
    Chen, Yi-Ting
    Held, David
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 413 - 422
  • [26] Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection
    Liu, Chuandong
    Gao, Chenqiang
    Liu, Fangcen
    Li, Pengcheng
    Meng, Deyu
    Gao, Xinbo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23819 - 23828
  • [27] SEMI-SUPERVISED 3D OBJECT DETECTION VIA ADAPTIVE PSEUDO-LABELING
    Xu, Hongyi
    Liu, Fengqi
    Zhou, Qianyu
    Hao, Jinkun
    Cao, Zhijie
    Feng, Zhengyang
    Ma, Lizhuang
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3183 - 3187
  • [28] Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection
    Zeng, Shuai
    Zheng, Wenzhao
    Lu, Jiwen
    Yan, Haibin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9644 - 9656
  • [29] SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation with Semi-supervised Learning
    Chen, Yujin
    Tu, Zhigang
    Ge, Liuhao
    Zhang, Dejun
    Chen, Ruizhi
    Yuan, Junsong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6960 - 6969
  • [30] ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis
    Yang, Lixin
    Li, Kailin
    Zhan, Xinyu
    Lv, Jun
    Xu, Wenqiang
    Li, Jiefeng
    Lu, Cewu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2740 - 2750