Fully Convolutional Denoising Autoencoder for 3D Scene Reconstruction from a single depth image

被引：0

作者：

Palla, Alessandro ^{[1
]}

Moloney, David ^{[1
]}

Fanucci, Luca ^{[2
]}

机构：

[1] Intel, Movidius, Comp Vis & Machine Learing Grp, Dublin, Ireland

[2] Univ Pisa, Dept Informat Engn, Pisa, Italy

来源：

2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI) | 2017年

关键词：

AI; CNN; Voxel; Point Cloud; Scene Reconstruction; Deep Learning; Autoencoder; Neural Network;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this work, we propose a 3D scene reconstruction algorithm based on a fully convolutional 3D denoising autoen-coder neural network. The network is capable of reconstructing a full scene from a single depth image by creating a 3D representation of it and automatically filling holes and inserting hidden elements. We exploit the fact that our neural network is capable of generalizing object shapes by inferring similarities in geometry. Our fully convolutional architecture enables the network to be unconstrained by a fixed 3D shape, and so it is capable of successfully reconstructing arbitrary scene sizes. Our algorithm was evaluated on a real word dataset of tabletop scenes acquired using a Kinect and processed using KinectFusion software in order to obtain ground truth for network training and evaluation. Extensive measurements show that our deep neural network architecture outperforms the previous state of the art both in terms of precision and recall for the scene reconstruction task. The network has been broadly profiled in terms of memory footprint, number of floating point operations, inference time and power consumption in CPU, GPU and embedded devices. Its small memory footprint and its low computation requirements enable low power, memory constrained, real time always-on embedded applications such as autonomous vehicles, warehouse robots, interactive gaming controllers and drones.

引用

页码：566 / 575

页数：10

共 50 条

[1] Face Denoising and 3D Reconstruction from A Single Depth Image
Zhong, Yicheng
Pei, Yuru
Li, Peixin
Guo, Yuke
Ma, Gengyu
Liu, Meng
Bai, Wei
Wu, WenHai
Zha, Hongbin
[J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 117 - 124
[2] Panoptic 3D Scene Reconstruction From a Single RGB Image
Dahnert, Manuel
Hou, Ji
Niessner, Matthias
Dai, Angela
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[3] Learning 3D Scene Semantics and Structure from a Single Depth Image
Yang, Bo
Lai, Zihang
Lu, Xiaoxuan
Lin, Shuyu
Wen, Hongkai
Markham, Andrew
Trigoni, Niki
[J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 422 - 425
[4] Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
Huang, Siyuan
Qi, Siyuan
Zhu, Yixin
Xiao, Yinxue
Xu, Yuanlu
Zhu, Song-Chun
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 194 - 211
[5] Stage-Based 3D Scene Reconstruction from Single Image
Liu, Yixian
Hao, Pengwei
Izquierdo, Ebroul
[J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1034 - 1037
[6] Fused voxel autoencoder for single image to 3D object reconstruction
Turhan, C. Guzel
Bilge, H. S.
[J]. ELECTRONICS LETTERS, 2020, 56 (03) : 134 - 136
[7] An Unsupervised Approach for 3D Face Reconstruction from a Single Depth Image
Li, Peixin
Pei, Yuru
Zhong, Yicheng
Guo, Yuke
Ma, Gengyu
Liu, Meng
Bai, Wei
Wu, Wenhai
Zha, Hongbin
[J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2020, 2020, 12221 : 206 - 219
[8] CGAN-Based Forest Scene 3D Reconstruction from a Single Image
Li, Yuan
Kan, Jiangming
[J]. FORESTS, 2024, 15 (01):
[9] Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image
Yin, Wei
Zhang, Jianming
Wang, Oliver
Niklaus, Simon
Chen, Simon
Liu, Yifan
Shen, Chunhua
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6480 - 6494
[10] Machine learning for scene 3D reconstruction using a single image
Knyaz, Vladimir
[J]. OPTICS, PHOTONICS AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VI, 2021, 11353

← 1 2 3 4 5 →