HDM-Net: Monocular Non-rigid 3D Reconstruction with Learned Deformation Model

被引：11

作者：

Golyanik, Vladislav ^{[1
,2
]}

Shimada, Soshi ^{[1
,2
]}

Varanasi, Kiran ^{[1
]}

Stricker, Didier ^{[1
,2
]}

机构：

[1] DFKI, Augmented Vis, Kaiserslautern, Germany

[2] Univ Kaiserslautern, Kaiserslautern, Germany

来源：

VIRTUAL REALITY AND AUGMENTED REALITY, EUROVR 2018 | 2018年 / 11162卷

关键词：

Monocular non-rigid reconstruction; Hybrid deformation model; Deep neural network; STRUCTURE-FROM-MOTION; SHAPE; FACTORIZATION; SURFACES;

D O I：

10.1007/978-3-030-01790-3_4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Monocular dense 3D reconstruction of deformable objects is a hard ill-posed problem in computer vision. Current techniques either require dense correspondences and rely on motion and deformation cues, or assume a highly accurate reconstruction (referred to as a template) of at least a single frame given in advance and operate in the manner of non-rigid tracking. Accurate computation of dense point tracks often requires multiple frames and might be computationally expensive. Availability of a template is a very strong prior which restricts system operation to a pre-defined environment and scenarios. In this work, we propose a new hybrid approach for monocular non-rigid reconstruction which we call Hybrid Deformation Model Network (HDM-Net). In our approach, a deformation model is learned by a deep neural network, with a combination of domain-specific loss functions. We train the network with multiple states of a non-rigidly deforming structure with a known shape at rest. HDM-Net learns different reconstruction cues including texture-dependent surface deformations, shading and contours. We show generalisability of HDM-Net to states not presented in the training dataset, with unseen textures and under new illumination conditions. Experiments with noisy data and a comparison with other methods demonstrate the robustness and accuracy of the proposed approach and suggest possible application scenarios of the new technique in interventional diagnostics and augmented reality.

引用

页码：51 / 72

页数：22

共 50 条

[1] State of the Art in Dense Monocular Non-Rigid 3D Reconstruction
Tretschk, Edith
Kairanda, Navami
Mallikarjun, B. R.
Dabral, Rishabh
Kortylewski, Adam
Egger, Bernhard
Habermann, Marc
Fua, Pascal
Theobalt, Christian
Golyanik, Vladislav
[J]. COMPUTER GRAPHICS FORUM, 2023, 42 (02) : 485 - 520
[2] 3D Reconstruction of Non-Rigid Surfaces from Realistic Monocular Video
Sepehrinour, Maryam
Kasaei, Shohreh
[J]. 2015 9TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2015, : 199 - 202
[3] Beyond Feature Points: Structured Prediction for Monocular Non-rigid 3D Reconstruction
Salzmann, Mathieu
Urtasun, Raquel
[J]. COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 : 245 - 259
[4] IsMo-GAN: Adversarial Learning for Monocular Non-Rigid 3D Reconstruction
Shimada, Soshi
Golyanik, Vladislav
Theobalt, Christian
Stricker, Didier
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2876 - 2885
[5] Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos
Wang, Yikai
Dong, Yinpeng
Sun, Fuchun
Yang, Xiao
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13844 - 13854
[6] Accurate reconstruction of non-rigid 3D shapes
Koh, Sung Shik
Zin, Thi Thi
Hama, Hiromitsu
[J]. ICCE: 2007 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2007, : 369 - +
[7] Unsupervised 3D Reconstruction and Grouping of Rigid and Non-Rigid Categories
Agudo, Antonio
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (01) : 519 - 532
[8] Template-Based 3D Reconstruction of Non-rigid Deformable Object from Monocular Video
Liu, Yang
Peng, Xiaodong
Zhou, Wugen
Liu, Bo
Gerndt, Andreas
[J]. 3D RESEARCH, 2018, 9 (02):
[9] KillingFusion: Non-rigid 3D Reconstruction without Correspondences
Slavcheva, Miroslava
Baust, Maximilian
Cremers, Daniel
Ilic, Slobodan
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5474 - 5483
[10] Non-rigid 3D Object Retrieval with a Learned Shape Descriptor
Shi, Xiangfu
Zhao, Jieyu
Zhang, Long
Ye, Xulun
[J]. IMAGE AND GRAPHICS (ICIG 2017), PT II, 2017, 10667 : 24 - 37

← 1 2 3 4 5 →