3D Reconstruction of Objects in Hands Without Real World 3D Supervision

被引:0
|
作者
Prakash, Aditya [1 ]
Chang, Matthew [1 ]
Jin, Matthew [1 ]
Tu, Ruisen [1 ]
Gupta, Saurabh [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
来源
关键词
hand-held objects; shape priors; multiview supervision;
D O I
10.1007/978-3-031-73229-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior works for reconstructing hand-held objects from a single image train models on images paired with 3D shapes. Such data is challenging to gather in the real world at scale. Consequently, these approaches do not generalize well when presented with novel objects in-the-wild settings. While 3D supervision is a major bottleneck, there is an abundance of a) in-the-wild raw video data showing hand-object interactions and b) synthetic 3D shape collections. In this paper, we propose modules to leverage 3D supervision from these sources to scale up the learning of models for reconstructing hand-held objects. Specifically, we extract multiview 2D mask supervision from videos and 3D shape priors from shape collections. We use these indirect 3D cues to train occupancy networks that predict the 3D shape of objects from a single RGB image. Our experiments in the challenging object generalization setting on in-the-wild MOW dataset show 11.6% relative improvement over models trained with 3D supervision on existing datasets.
引用
收藏
页码:126 / 145
页数:20
相关论文
共 50 条
  • [21] 3D objects
    Anon
    2002, F and W Publications (17):
  • [22] 3D objects
    HOW, 1999, 14 (05):
  • [23] Reconstruction and 3D visualisation based on objective real 3D based documentation
    Michael J. Bolliger
    Ursula Buck
    Michael J. Thali
    Stephan A. Bolliger
    Forensic Science, Medicine, and Pathology, 2012, 8 : 208 - 217
  • [24] Reconstruction and 3D visualisation based on objective real 3D based documentation
    Bolliger, Michael J.
    Buck, Ursula
    Thali, Michael J.
    Bolliger, Stephan A.
    FORENSIC SCIENCE MEDICINE AND PATHOLOGY, 2012, 8 (03) : 208 - 217
  • [25] Real-time active 3D shape reconstruction for 3D video
    Wu, X
    Matsuyama, T
    ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 186 - 191
  • [26] Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision
    Sanyal, Soubhik
    Bolkart, Timo
    Feng, Haiwen
    Black, Michael J.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7755 - 7764
  • [27] 3D Motion Reconstruction for Real-World Camera Motion
    Zhu, Yingying
    Cox, Mark
    Lucey, Simon
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [28] A Real World Dataset for Multi-view 3D Reconstruction
    Shrestha, Rakesh
    Hu, Siqi
    Gou, Minghao
    Liu, Ziyuan
    Tan, Ping
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 56 - 73
  • [29] 3D interval wavelet moments for 3D objects
    Cui, Li
    Li, Ying
    Li, Hua
    Journal of Information and Computational Science, 2009, 6 (04): : 1799 - 1810
  • [30] Photogrammetric 3D Reconstruction of Small Objects for a Real-Time Fruition
    De Paolis, Lucio Tommaso
    De Luca, Valerio
    Gatto, Carola
    D'Errico, Giovanni
    Paladini, Giovanna Ilenia
    AUGMENTED REALITY, VIRTUAL REALITY, AND COMPUTER GRAPHICS, AVR 2020, PT I, 2020, 12242 : 375 - 394