Adaptive octree 3D image reconstruction based on plane patch

被引：0

作者：

Yao C. ^{[1
,2
]}

Ma C. ^{[1
,2
]}

机构：

[1] Xi'an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an

[2] University of Chinese Academy of Sciences, Beijing

来源：

Guangxue Jingmi Gongcheng/Optics and Precision Engineering | 2022年 / 30卷 / 09期

关键词：

Computer vision; Convolutional neural network; Neural network; Three-dimensional reconstruction;

D O I：

10.37188/OPE.20223009.1113

中图分类号：

学科分类号：

摘要：

In this study, an adaptive octree convolutional neural network based on plane patches is proposed for effective 3D shape encoding and decoding. Unlike volume-based or octree-based convolutional neural network (CNN) methods, which represent 3D shapes with the same voxel resolution, the proposed method can use planes and adaptively represent the 3D shapes of octree nodes with different levels. The patch models the 3D shape within each octree node, whereby the patch-based adaptive representation is utilized in the proposed adaptive patch octree convolutional neural network (O-CNN) encoder and decoder for the encoding and decoding of 3D shapes. The adaptive patch O-CNN encoder takes the plane patch normal and displacement as input and performs three-dimensional convolution on the octree nodes of each level, whereas the adaptive patch O-CNN decoder infers each level. The shape occupancy rate and subdivision state of the octree node as well as the best plane normal and displacement of each leaf octree node are estimated. As a general framework for 3D shape analysis and generation, adaptive patch O-CNN not only reduces memory and computational costs but also exhibits better shape generation capabilities than existing 3D-CNN methods. Shape prediction is performed using a single image to verify the efficiency and effectiveness of the generation task of the adaptive O-CNN. The chamfer distance error is 0.274, which is lower than that of OctGen (0.294), resulting in a better reconstruction effect. © 2022, Science Press. All right reserved.

引用

页码：1113 / 1122

页数：9

共 28 条

[1] FAN L L, ZHAO H W, ZHAO H Y, Et al., Survey of target detection based on deep convolutional neural networks, Opt. Precision Eng, 28, 5, pp. 1152-1164, (2020)
[2] LUN Z L, GADELHA M, KALOGERAKIS E, Et al., 3D shape reconstruction from sketches via multi-view convolutional networks, 2017 International Conference on 3D Vision (3DV), pp. 67-77, (2017)
[3] PAN X ZH, ZHANG SH Q, GUO W P., Video-based facial expression recognition using multimodal deep convolutional neural networks, Opt. Precision Eng, 27, 4, pp. 963-970, (2019)
[4] NIU Z J, LIU W, ZHAO J Y, Et al., DeepLab-based spatial feature extraction for hyperspectral image classification, IEEE Geoscience and Remote Sensing Letters, 16, 2, pp. 251-255, (2019)
[5] WU J J, ZHANG C K, XUE T F, Et al., Learning a probabilistic latent space of object shapes via 3D generative-adversarial modeling, NIPS'16: Proceedings of the 30th International Conference on Neural Information Processing Systems, pp. 82-90, (2016)
[6] WU Z R, SONG S R, KHOSLA A, Et al., 3D ShapeNets: a deep representation for volumetric shapes, 2015 IEEE Conference on Computer Vision and Pattern Recognition, 712, pp. 1912-1920, (2015)
[7] BRONSTEIN M M, BRUNA J, LECUN Y, Et al., Geometric deep learning: going beyond euclidean data, IEEE Signal Processing Magazine, 34, 4, pp. 18-42, (2017)
[8] GROUEIX T, FISHER M, KIM V G, Et al., AtlasNet: a papier-mché approach to learning 3D surface generation, (2018)
[9] KATO H, USHIKU Y, HARADA T., Neural 3D mesh renderer, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1823, pp. 3907-3916, (2018)
[10] WU X T, YANG H, SUN X L., Image restoring method based on region selection network and its application in computational imaging, Opt. Precision Eng, 29, 4, pp. 864-876, (2021)

← 1 2 3 →