Learning-Based Non-rigid Video Depth Estimation Using Invariants to Generalized Bas-Relief Transformations

被引：0

作者：

Matteo Pedone

Abdelrahman Mostafa

Janne Heikkilä

机构：

[1] University of Oulu,Center for Machine Vision and Signal Analysis

来源：

Journal of Mathematical Imaging and Vision | 2022年 / 64卷

关键词：

depth video estimation; Invariant; Moving frame; Deep learning; Bas-relief ambiguity;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present a method to locally reconstruct dense video depth maps of a non-rigidly deformable object directly from a video sequence acquired by a static orthographic camera. The estimation of depth is performed locally on spatiotemporal patches of the video, and then, the full depth video is recovered by combining them together. Since the geometric complexity of a local spatiotemporal patch of a deforming non-rigid object is often simple enough to be faithfully represented with a parametric model, we artificially generate a database of small deforming rectangular meshes rendered with different material properties and light conditions, along with their corresponding depth videos, and use such data to train a convolutional neural network. Since the database images are rendered with an orthographic camera model, linear deformations along the optical axis cannot be recovered from the training images. These are known in the literature as generalized bas-relief (GBR) transformations. We address this ambiguity problem by employing the invariant-theoretic normalization procedure in order to obtain complete invariants with respect to this group of transformations, and use them in the loss function of a neural network. We tested our method on both synthetic and Kinect data and experimentally observed that the reconstruction error is significantly lower than the one obtained using conventional non-rigid structure from motion approaches and state-of-the-art video depth estimation techniques.

引用

页码：993 / 1009

页数：16

共 45 条

[41] MR-MOTUS: model-based non-rigid motion estimation for MR-guided radiotherapy using a reference image and minimal k-space data
Huttinga, Niek R. F.
van den Berg, Cornelis A. T.
Luijten, Peter R.
Sbrizzi, Alessandro
PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (01):
[42] Depth estimation of surface-opening crack in concrete beams using impact-echo and non-contact video-based methods
Yamin Sun
Pingming Huang
Jufeng Su
Tao Wang
EURASIP Journal on Image and Video Processing, 2018
[43] Depth estimation of surface-opening crack in concrete beams using impact-echo and non-contact video-based methods
Sun, Yamin
Huang, Pingming
Su, Jufeng
Wang, Tao
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2018,
[44] Deep Learning-Based Energy Expenditure Estimation in Assisted and Non-Assisted Gait Using Inertial, EMG, and Heart Rate Wearable Sensors
Lopes, Joao M.
Figueiredo, Joana
Fonseca, Pedro
Cerqueira, Joao J.
Vilas-Boas, Joao P.
Santos, Cristina P.
SENSORS, 2022, 22 (20)
[45] Maize Silage Kernel Fragment Estimation Using Deep Learning-Based Object Recognition in Non-Separated Kernel/Stover RGB Images
Rasmussen, Christoffer Bogelund
Moeslund, Thomas B.
SENSORS, 2019, 19 (16)

← 1 2 3 4 5 →