3D Reconstruction of Objects in Hands Without Real World 3D Supervision

被引:0
|
作者
Prakash, Aditya [1 ]
Chang, Matthew [1 ]
Jin, Matthew [1 ]
Tu, Ruisen [1 ]
Gupta, Saurabh [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
来源
关键词
hand-held objects; shape priors; multiview supervision;
D O I
10.1007/978-3-031-73229-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior works for reconstructing hand-held objects from a single image train models on images paired with 3D shapes. Such data is challenging to gather in the real world at scale. Consequently, these approaches do not generalize well when presented with novel objects in-the-wild settings. While 3D supervision is a major bottleneck, there is an abundance of a) in-the-wild raw video data showing hand-object interactions and b) synthetic 3D shape collections. In this paper, we propose modules to leverage 3D supervision from these sources to scale up the learning of models for reconstructing hand-held objects. Specifically, we extract multiview 2D mask supervision from videos and 3D shape priors from shape collections. We use these indirect 3D cues to train occupancy networks that predict the 3D shape of objects from a single RGB image. Our experiments in the challenging object generalization setting on in-the-wild MOW dataset show 11.6% relative improvement over models trained with 3D supervision on existing datasets.
引用
收藏
页码:126 / 145
页数:20
相关论文
共 50 条
  • [41] 3D Surface Reconstruction of Smooth and Textureless Objects
    Hafeez, Jahanzeb
    Kwon, Soon-Chul
    Lee, Seung-Hyun
    Hamacher, Alaric
    2017 INTERNATIONAL CONFERENCE ON EMERGING TRENDS & INNOVATION IN ICT (ICEI), 2017, : 145 - 149
  • [42] A 3D AVATAR MODELING OF REAL WORLD OBJECTS USING A DEPTH CAMERA
    Cho, Ji-Ho
    Kim, Hyun Soo
    Lee, Kwan H.
    2009 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO, 2009, : 165 - 168
  • [43] HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video
    Fan, Zicong
    Parelli, Maria
    Kadoglou, Maria Eleni
    Chen, Xu
    Kocabas, Muhammed
    Black, Michael J.
    Hilliges, Otmar
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 494 - 504
  • [44] Enhancing Computational Thinking with 3D printing: Imagining, designing, and printing 3D objects to solve real-world problems
    Grizioti, Marianthi
    Kynigos, Chronis
    Nikolaou, Maria-Stella
    PROCEEDINGS OF ACM INTERACTION DESIGN AND CHILDREN CONFERENCE, IDC 2024, 2024, : 133 - 141
  • [45] COMMUNICATING IN THE REAL WORLD: 3D MIMO
    Cheng, Xiang
    Yu, Bo
    Yang, Liuqing
    Zhang, Jianhua
    Liu, Guangyi
    Wu, Yong
    Wan, Lei
    IEEE WIRELESS COMMUNICATIONS, 2014, 21 (04) : 136 - 144
  • [46] Real time tracking of 3D objects with occultations
    Jurie, F
    Dhome, M
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 413 - 416
  • [47] Color constancy improves for real 3D objects
    Hedrich, Monika
    Bloj, Marina
    Ruppertsberg, Alexa I.
    JOURNAL OF VISION, 2009, 9 (04):
  • [48] Robust real time tracking of 3D objects
    Masson, L
    Dhome, M
    Jurie, F
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 252 - 255
  • [49] Tracking 3D Deformable Objects in Real Time
    Silva, Tiago
    Magalhaes, Luis
    Ferreira, Manuel
    Khanal, Salik Ram
    Silva, Jorge
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 823 - 830
  • [50] 3D Interaction with Virtual Objects in Real Water
    Costa, Raphael
    Quarles, John
    2019 11TH INTERNATIONAL CONFERENCE ON VIRTUAL WORLDS AND GAMES FOR SERIOUS APPLICATIONS (VS-GAMES), 2019, : 48 - 54