Fully Convolutional Denoising Autoencoder for 3D Scene Reconstruction from a single depth image

被引:0
|
作者
Palla, Alessandro [1 ]
Moloney, David [1 ]
Fanucci, Luca [2 ]
机构
[1] Intel, Movidius, Comp Vis & Machine Learing Grp, Dublin, Ireland
[2] Univ Pisa, Dept Informat Engn, Pisa, Italy
关键词
AI; CNN; Voxel; Point Cloud; Scene Reconstruction; Deep Learning; Autoencoder; Neural Network;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this work, we propose a 3D scene reconstruction algorithm based on a fully convolutional 3D denoising autoen-coder neural network. The network is capable of reconstructing a full scene from a single depth image by creating a 3D representation of it and automatically filling holes and inserting hidden elements. We exploit the fact that our neural network is capable of generalizing object shapes by inferring similarities in geometry. Our fully convolutional architecture enables the network to be unconstrained by a fixed 3D shape, and so it is capable of successfully reconstructing arbitrary scene sizes. Our algorithm was evaluated on a real word dataset of tabletop scenes acquired using a Kinect and processed using KinectFusion software in order to obtain ground truth for network training and evaluation. Extensive measurements show that our deep neural network architecture outperforms the previous state of the art both in terms of precision and recall for the scene reconstruction task. The network has been broadly profiled in terms of memory footprint, number of floating point operations, inference time and power consumption in CPU, GPU and embedded devices. Its small memory footprint and its low computation requirements enable low power, memory constrained, real time always-on embedded applications such as autonomous vehicles, warehouse robots, interactive gaming controllers and drones.
引用
收藏
页码:566 / 575
页数:10
相关论文
共 50 条
  • [1] Face Denoising and 3D Reconstruction from A Single Depth Image
    Zhong, Yicheng
    Pei, Yuru
    Li, Peixin
    Guo, Yuke
    Ma, Gengyu
    Liu, Meng
    Bai, Wei
    Wu, WenHai
    Zha, Hongbin
    [J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 117 - 124
  • [2] Panoptic 3D Scene Reconstruction From a Single RGB Image
    Dahnert, Manuel
    Hou, Ji
    Niessner, Matthias
    Dai, Angela
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Learning 3D Scene Semantics and Structure from a Single Depth Image
    Yang, Bo
    Lai, Zihang
    Lu, Xiaoxuan
    Lin, Shuyu
    Wen, Hongkai
    Markham, Andrew
    Trigoni, Niki
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 422 - 425
  • [4] Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
    Huang, Siyuan
    Qi, Siyuan
    Zhu, Yixin
    Xiao, Yinxue
    Xu, Yuanlu
    Zhu, Song-Chun
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 194 - 211
  • [5] Stage-Based 3D Scene Reconstruction from Single Image
    Liu, Yixian
    Hao, Pengwei
    Izquierdo, Ebroul
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1034 - 1037
  • [6] Fused voxel autoencoder for single image to 3D object reconstruction
    Turhan, C. Guzel
    Bilge, H. S.
    [J]. ELECTRONICS LETTERS, 2020, 56 (03) : 134 - 136
  • [7] An Unsupervised Approach for 3D Face Reconstruction from a Single Depth Image
    Li, Peixin
    Pei, Yuru
    Zhong, Yicheng
    Guo, Yuke
    Ma, Gengyu
    Liu, Meng
    Bai, Wei
    Wu, Wenhai
    Zha, Hongbin
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2020, 2020, 12221 : 206 - 219
  • [8] CGAN-Based Forest Scene 3D Reconstruction from a Single Image
    Li, Yuan
    Kan, Jiangming
    [J]. FORESTS, 2024, 15 (01):
  • [9] Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image
    Yin, Wei
    Zhang, Jianming
    Wang, Oliver
    Niklaus, Simon
    Chen, Simon
    Liu, Yifan
    Shen, Chunhua
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6480 - 6494
  • [10] Machine learning for scene 3D reconstruction using a single image
    Knyaz, Vladimir
    [J]. OPTICS, PHOTONICS AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VI, 2021, 11353