Learning-Based Non-rigid Video Depth Estimation Using Invariants to Generalized Bas-Relief Transformations

被引:0
|
作者
Matteo Pedone
Abdelrahman Mostafa
Janne Heikkilä
机构
[1] University of Oulu,Center for Machine Vision and Signal Analysis
关键词
depth video estimation; Invariant; Moving frame; Deep learning; Bas-relief ambiguity;
D O I
暂无
中图分类号
学科分类号
摘要
We present a method to locally reconstruct dense video depth maps of a non-rigidly deformable object directly from a video sequence acquired by a static orthographic camera. The estimation of depth is performed locally on spatiotemporal patches of the video, and then, the full depth video is recovered by combining them together. Since the geometric complexity of a local spatiotemporal patch of a deforming non-rigid object is often simple enough to be faithfully represented with a parametric model, we artificially generate a database of small deforming rectangular meshes rendered with different material properties and light conditions, along with their corresponding depth videos, and use such data to train a convolutional neural network. Since the database images are rendered with an orthographic camera model, linear deformations along the optical axis cannot be recovered from the training images. These are known in the literature as generalized bas-relief (GBR) transformations. We address this ambiguity problem by employing the invariant-theoretic normalization procedure in order to obtain complete invariants with respect to this group of transformations, and use them in the loss function of a neural network. We tested our method on both synthetic and Kinect data and experimentally observed that the reconstruction error is significantly lower than the one obtained using conventional non-rigid structure from motion approaches and state-of-the-art video depth estimation techniques.
引用
收藏
页码:993 / 1009
页数:16
相关论文
共 45 条
  • [41] MR-MOTUS: model-based non-rigid motion estimation for MR-guided radiotherapy using a reference image and minimal k-space data
    Huttinga, Niek R. F.
    van den Berg, Cornelis A. T.
    Luijten, Peter R.
    Sbrizzi, Alessandro
    PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (01):
  • [42] Depth estimation of surface-opening crack in concrete beams using impact-echo and non-contact video-based methods
    Yamin Sun
    Pingming Huang
    Jufeng Su
    Tao Wang
    EURASIP Journal on Image and Video Processing, 2018
  • [43] Depth estimation of surface-opening crack in concrete beams using impact-echo and non-contact video-based methods
    Sun, Yamin
    Huang, Pingming
    Su, Jufeng
    Wang, Tao
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2018,
  • [44] Deep Learning-Based Energy Expenditure Estimation in Assisted and Non-Assisted Gait Using Inertial, EMG, and Heart Rate Wearable Sensors
    Lopes, Joao M.
    Figueiredo, Joana
    Fonseca, Pedro
    Cerqueira, Joao J.
    Vilas-Boas, Joao P.
    Santos, Cristina P.
    SENSORS, 2022, 22 (20)
  • [45] Maize Silage Kernel Fragment Estimation Using Deep Learning-Based Object Recognition in Non-Separated Kernel/Stover RGB Images
    Rasmussen, Christoffer Bogelund
    Moeslund, Thomas B.
    SENSORS, 2019, 19 (16)