Recovering 3D Planes from a Single Image via Convolutional Neural Networks

被引:47
|
作者
Yang, Fengting [1 ]
Zhou, Zihan [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
来源
关键词
3D reconstruction; Plane segmentation; Deep learning;
D O I
10.1007/978-3-030-01249-6_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the problem of recovering 3D planar surfaces from a single image of man-made environment. We show that it is possible to directly train a deep neural network to achieve this goal. A novel plane structure-induced loss is proposed to train the network to simultaneously predict a plane segmentation map and the parameters of the 3D planes. Further, to avoid the tedious manual labeling process, we show how to leverage existing large-scale RGB-D dataset to train our network without explicit 3D plane annotations, and how to take advantage of the semantic labels come with the dataset for accurate planar and non-planar classification. Experiment results demonstrate that our method significantly outperforms existing methods, both qualitatively and quantitatively. The recovered planes could potentially benefit many important visual tasks such as vision-based navigation and human-robot interaction.
引用
收藏
页码:87 / 103
页数:17
相关论文
共 50 条
  • [31] Hyperspectral image classification based on optimized convolutional neural networks with 3D stacked blocks
    Zhang, Xiaoxia
    Guo, Yong
    Zhang, Xia
    [J]. EARTH SCIENCE INFORMATICS, 2022, 15 (01) : 383 - 395
  • [32] Hyperspectral image classification based on optimized convolutional neural networks with 3D stacked blocks
    Xiaoxia Zhang
    Yong Guo
    Xia Zhang
    [J]. Earth Science Informatics, 2022, 15 : 383 - 395
  • [33] Lightweight image super-resolution network using 3D convolutional neural networks
    Li, Hailong
    Liu, Zhonghua
    Liu, Yong
    Wu, Di
    Zhang, Kaibing
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (01)
  • [34] 3D multi-resolution wavelet convolutional neural networks for hyperspectral image classification
    Shi, Cheng
    Pun, Chi-Man
    [J]. INFORMATION SCIENCES, 2017, 420 : 49 - 65
  • [35] 3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5679 - 5688
  • [36] Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image
    Tome, Denis
    Russell, Chris
    Agapito, Lourdes
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5689 - 5698
  • [37] Single Image Dehazing via Multi-scale Convolutional Neural Networks
    Ren, Wenqi
    Liu, Si
    Zhang, Hua
    Pan, Jinshan
    Cao, Xiaochun
    Yang, Ming-Hsuan
    [J]. COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 154 - 169
  • [38] Frequency Domain Compact 3D Convolutional Neural Networks
    Chen, Hanting
    Wang, Yunhe
    Shu, Han
    Tang, Yehui
    Xu, Chunjing
    Shi, Boxin
    Xu, Chao
    Tian, Qi
    Xu, Chang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1638 - 1647
  • [39] Video Steganography Using 3D Convolutional Neural Networks
    Abdolmohammadi, Mahdi
    Toroghi, Rahil Mahdian
    Bastanfard, Azam
    [J]. PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 1144 : 149 - 161
  • [40] Improving Resolution of 3D Surface With Convolutional Neural Networks
    Li, Zhen
    Yang, Xiaomin
    Song, Jianwen
    Liu, Kai
    Wang, Zuping
    Wu, Wei
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2018, 42 : 127 - 138