3D Reconstruction of Objects in Hands Without Real World 3D Supervision

被引:0
|
作者
Prakash, Aditya [1 ]
Chang, Matthew [1 ]
Jin, Matthew [1 ]
Tu, Ruisen [1 ]
Gupta, Saurabh [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
来源
关键词
hand-held objects; shape priors; multiview supervision;
D O I
10.1007/978-3-031-73229-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior works for reconstructing hand-held objects from a single image train models on images paired with 3D shapes. Such data is challenging to gather in the real world at scale. Consequently, these approaches do not generalize well when presented with novel objects in-the-wild settings. While 3D supervision is a major bottleneck, there is an abundance of a) in-the-wild raw video data showing hand-object interactions and b) synthetic 3D shape collections. In this paper, we propose modules to leverage 3D supervision from these sources to scale up the learning of models for reconstructing hand-held objects. Specifically, we extract multiview 2D mask supervision from videos and 3D shape priors from shape collections. We use these indirect 3D cues to train occupancy networks that predict the 3D shape of objects from a single RGB image. Our experiments in the challenging object generalization setting on in-the-wild MOW dataset show 11.6% relative improvement over models trained with 3D supervision on existing datasets.
引用
收藏
页码:126 / 145
页数:20
相关论文
共 50 条
  • [1] Managing 3D objects for real world scenes reconstruction
    Amato, Flora
    Mazzeo, Antonino
    Moscato, Vincenzo
    Picariello, Antonio
    Sansone, Carlo
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2015, 11 (01) : 56 - 67
  • [2] Hands in Action: Real-Time 3D Reconstruction of Hands in Interaction with Objects
    Romero, Javier
    Kjellstrom, Hedvig
    Kragic, Danica
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 458 - 463
  • [3] What's in your hands? 3D Reconstruction of Generic Objects in Hands
    Ye, Yufei
    Gupta, Abhinav
    Tulsiani, Shubham
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3885 - 3895
  • [4] SimpleRecon: 3D Reconstruction Without 3D Convolutions
    Sayed, Mohamed
    Gibson, John
    Watson, Jamie
    Prisacariu, Victor
    Firman, Michael
    Godard, Clement
    COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 : 1 - 19
  • [5] Learning 3D Object Shape and Layout without 3D Supervision
    Gkioxari, Georgia
    Ravi, Nikhila
    Johnson, Justin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1685 - 1694
  • [6] SURFACE RECONSTRUCTION OF 3D OBJECTS
    Li, Xiaokun
    Han, Chia Yung
    Wee, William G.
    COMPUTER VISION AND GRAPHICS (ICCVG 2004), 2006, 32 : 642 - 647
  • [7] Contactless 3D Fingerprint Identification Without 3D Reconstruction
    Zheng, Qian
    Kumar, Ajay
    Pan, Gang
    2018 6TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF), 2018,
  • [8] Perspective Transformer Nets: Learning Single-View 3D Object Reconstruction without 3D Supervision
    Yan, Xinchen
    Yang, Jimei
    Yumer, Ersin
    Guo, Yijie
    Lee, Honglak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [9] 3D reconstruction of real objects with high resolution shape and texture
    Yemez, Y
    Schmitt, F
    IMAGE AND VISION COMPUTING, 2004, 22 (13) : 1137 - 1153
  • [10] 3D reconstruction of real world scenes using a low-cost 3D range scanner
    Dias, Paulo
    Matos, Miguel
    Santos, Vitor
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2006, 21 (07) : 486 - 497