Adaptive octree 3D image reconstruction based on plane patch

被引:0
|
作者
Yao C. [1 ,2 ]
Ma C. [1 ,2 ]
机构
[1] Xi'an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an
[2] University of Chinese Academy of Sciences, Beijing
关键词
Computer vision; Convolutional neural network; Neural network; Three-dimensional reconstruction;
D O I
10.37188/OPE.20223009.1113
中图分类号
学科分类号
摘要
In this study, an adaptive octree convolutional neural network based on plane patches is proposed for effective 3D shape encoding and decoding. Unlike volume-based or octree-based convolutional neural network (CNN) methods, which represent 3D shapes with the same voxel resolution, the proposed method can use planes and adaptively represent the 3D shapes of octree nodes with different levels. The patch models the 3D shape within each octree node, whereby the patch-based adaptive representation is utilized in the proposed adaptive patch octree convolutional neural network (O-CNN) encoder and decoder for the encoding and decoding of 3D shapes. The adaptive patch O-CNN encoder takes the plane patch normal and displacement as input and performs three-dimensional convolution on the octree nodes of each level, whereas the adaptive patch O-CNN decoder infers each level. The shape occupancy rate and subdivision state of the octree node as well as the best plane normal and displacement of each leaf octree node are estimated. As a general framework for 3D shape analysis and generation, adaptive patch O-CNN not only reduces memory and computational costs but also exhibits better shape generation capabilities than existing 3D-CNN methods. Shape prediction is performed using a single image to verify the efficiency and effectiveness of the generation task of the adaptive O-CNN. The chamfer distance error is 0.274, which is lower than that of OctGen (0.294), resulting in a better reconstruction effect. © 2022, Science Press. All right reserved.
引用
收藏
页码:1113 / 1122
页数:9
相关论文
共 28 条
  • [1] FAN L L, ZHAO H W, ZHAO H Y, Et al., Survey of target detection based on deep convolutional neural networks, Opt. Precision Eng, 28, 5, pp. 1152-1164, (2020)
  • [2] LUN Z L, GADELHA M, KALOGERAKIS E, Et al., 3D shape reconstruction from sketches via multi-view convolutional networks, 2017 International Conference on 3D Vision (3DV), pp. 67-77, (2017)
  • [3] PAN X ZH, ZHANG SH Q, GUO W P., Video-based facial expression recognition using multimodal deep convolutional neural networks, Opt. Precision Eng, 27, 4, pp. 963-970, (2019)
  • [4] NIU Z J, LIU W, ZHAO J Y, Et al., DeepLab-based spatial feature extraction for hyperspectral image classification, IEEE Geoscience and Remote Sensing Letters, 16, 2, pp. 251-255, (2019)
  • [5] WU J J, ZHANG C K, XUE T F, Et al., Learning a probabilistic latent space of object shapes via 3D generative-adversarial modeling, NIPS'16: Proceedings of the 30th International Conference on Neural Information Processing Systems, pp. 82-90, (2016)
  • [6] WU Z R, SONG S R, KHOSLA A, Et al., 3D ShapeNets: a deep representation for volumetric shapes, 2015 IEEE Conference on Computer Vision and Pattern Recognition, 712, pp. 1912-1920, (2015)
  • [7] BRONSTEIN M M, BRUNA J, LECUN Y, Et al., Geometric deep learning: going beyond euclidean data, IEEE Signal Processing Magazine, 34, 4, pp. 18-42, (2017)
  • [8] GROUEIX T, FISHER M, KIM V G, Et al., AtlasNet: a papier-mché approach to learning 3D surface generation, (2018)
  • [9] KATO H, USHIKU Y, HARADA T., Neural 3D mesh renderer, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1823, pp. 3907-3916, (2018)
  • [10] WU X T, YANG H, SUN X L., Image restoring method based on region selection network and its application in computational imaging, Opt. Precision Eng, 29, 4, pp. 864-876, (2021)