HDM-Net: Monocular Non-rigid 3D Reconstruction with Learned Deformation Model

被引:11
|
作者
Golyanik, Vladislav [1 ,2 ]
Shimada, Soshi [1 ,2 ]
Varanasi, Kiran [1 ]
Stricker, Didier [1 ,2 ]
机构
[1] DFKI, Augmented Vis, Kaiserslautern, Germany
[2] Univ Kaiserslautern, Kaiserslautern, Germany
关键词
Monocular non-rigid reconstruction; Hybrid deformation model; Deep neural network; STRUCTURE-FROM-MOTION; SHAPE; FACTORIZATION; SURFACES;
D O I
10.1007/978-3-030-01790-3_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular dense 3D reconstruction of deformable objects is a hard ill-posed problem in computer vision. Current techniques either require dense correspondences and rely on motion and deformation cues, or assume a highly accurate reconstruction (referred to as a template) of at least a single frame given in advance and operate in the manner of non-rigid tracking. Accurate computation of dense point tracks often requires multiple frames and might be computationally expensive. Availability of a template is a very strong prior which restricts system operation to a pre-defined environment and scenarios. In this work, we propose a new hybrid approach for monocular non-rigid reconstruction which we call Hybrid Deformation Model Network (HDM-Net). In our approach, a deformation model is learned by a deep neural network, with a combination of domain-specific loss functions. We train the network with multiple states of a non-rigidly deforming structure with a known shape at rest. HDM-Net learns different reconstruction cues including texture-dependent surface deformations, shading and contours. We show generalisability of HDM-Net to states not presented in the training dataset, with unseen textures and under new illumination conditions. Experiments with noisy data and a comparison with other methods demonstrate the robustness and accuracy of the proposed approach and suggest possible application scenarios of the new technique in interventional diagnostics and augmented reality.
引用
收藏
页码:51 / 72
页数:22
相关论文
共 50 条
  • [1] State of the Art in Dense Monocular Non-Rigid 3D Reconstruction
    Tretschk, Edith
    Kairanda, Navami
    Mallikarjun, B. R.
    Dabral, Rishabh
    Kortylewski, Adam
    Egger, Bernhard
    Habermann, Marc
    Fua, Pascal
    Theobalt, Christian
    Golyanik, Vladislav
    [J]. COMPUTER GRAPHICS FORUM, 2023, 42 (02) : 485 - 520
  • [2] 3D Reconstruction of Non-Rigid Surfaces from Realistic Monocular Video
    Sepehrinour, Maryam
    Kasaei, Shohreh
    [J]. 2015 9TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2015, : 199 - 202
  • [3] Beyond Feature Points: Structured Prediction for Monocular Non-rigid 3D Reconstruction
    Salzmann, Mathieu
    Urtasun, Raquel
    [J]. COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 : 245 - 259
  • [4] IsMo-GAN: Adversarial Learning for Monocular Non-Rigid 3D Reconstruction
    Shimada, Soshi
    Golyanik, Vladislav
    Theobalt, Christian
    Stricker, Didier
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2876 - 2885
  • [5] Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos
    Wang, Yikai
    Dong, Yinpeng
    Sun, Fuchun
    Yang, Xiao
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13844 - 13854
  • [6] Accurate reconstruction of non-rigid 3D shapes
    Koh, Sung Shik
    Zin, Thi Thi
    Hama, Hiromitsu
    [J]. ICCE: 2007 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2007, : 369 - +
  • [7] Unsupervised 3D Reconstruction and Grouping of Rigid and Non-Rigid Categories
    Agudo, Antonio
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (01) : 519 - 532
  • [8] Template-Based 3D Reconstruction of Non-rigid Deformable Object from Monocular Video
    Liu, Yang
    Peng, Xiaodong
    Zhou, Wugen
    Liu, Bo
    Gerndt, Andreas
    [J]. 3D RESEARCH, 2018, 9 (02):
  • [9] KillingFusion: Non-rigid 3D Reconstruction without Correspondences
    Slavcheva, Miroslava
    Baust, Maximilian
    Cremers, Daniel
    Ilic, Slobodan
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5474 - 5483
  • [10] Non-rigid 3D Object Retrieval with a Learned Shape Descriptor
    Shi, Xiangfu
    Zhao, Jieyu
    Zhang, Long
    Ye, Xulun
    [J]. IMAGE AND GRAPHICS (ICIG 2017), PT II, 2017, 10667 : 24 - 37