Adaptive octree 3D image reconstruction based on plane patch

被引：0

作者：

Yao C. ^{[1
,2
]}

Ma C. ^{[1
,2
]}

机构：

[1] Xi'an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an

[2] University of Chinese Academy of Sciences, Beijing

来源：

Guangxue Jingmi Gongcheng/Optics and Precision Engineering | 2022年 / 30卷 / 09期

关键词：

Computer vision; Convolutional neural network; Neural network; Three-dimensional reconstruction;

D O I：

10.37188/OPE.20223009.1113

中图分类号：

学科分类号：

摘要：

In this study, an adaptive octree convolutional neural network based on plane patches is proposed for effective 3D shape encoding and decoding. Unlike volume-based or octree-based convolutional neural network (CNN) methods, which represent 3D shapes with the same voxel resolution, the proposed method can use planes and adaptively represent the 3D shapes of octree nodes with different levels. The patch models the 3D shape within each octree node, whereby the patch-based adaptive representation is utilized in the proposed adaptive patch octree convolutional neural network (O-CNN) encoder and decoder for the encoding and decoding of 3D shapes. The adaptive patch O-CNN encoder takes the plane patch normal and displacement as input and performs three-dimensional convolution on the octree nodes of each level, whereas the adaptive patch O-CNN decoder infers each level. The shape occupancy rate and subdivision state of the octree node as well as the best plane normal and displacement of each leaf octree node are estimated. As a general framework for 3D shape analysis and generation, adaptive patch O-CNN not only reduces memory and computational costs but also exhibits better shape generation capabilities than existing 3D-CNN methods. Shape prediction is performed using a single image to verify the efficiency and effectiveness of the generation task of the adaptive O-CNN. The chamfer distance error is 0.274, which is lower than that of OctGen (0.294), resulting in a better reconstruction effect. © 2022, Science Press. All right reserved.

引用

页码：1113 / 1122

页数：9

共 28 条

[11] ATTENE M, CAMPEN M, KOBBELT L., Polygon mesh repairing, ACM Computing Surveys, 45, 2, pp. 1-33, (2013)
[12] BOSCAINI D, MASCI J, MELZI S, Et al., Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks, Computer Graphics Forum, 34, 5, pp. 13-23, (2015)
[13] BROCK A, LIM T, RITCHIE J M, Et al., Generative and discriminative voxel modeling with convolutional neural networks, (2016)
[14] CHOY C B, XU D F, GWAK J, Et al., 3D-R2N2: a unified approach for single and multi-view 3D object reconstruction, Computer Vision-ECCV, 2016, pp. 628-644, (2016)
[15] GRAHAM B., Sparse 3D convolutional neural networks, Proceedings ofthe British Machine Vision Conference 2015, pp. 1-1509, (2015)
[16] HANE C, TULSIANI S, MALIK J., Hierarchical surface prediction, IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 6, pp. 1348-1361, (2020)
[17] FUHRMANN S, GOESELE M., Floating scale surface reconstruction, ACM Transactions on Graphics, 33, 4, pp. 1-11, (2014)
[18] DENG J, DONG W, SOCHER R, Et al., ImageNet: a large-scale hierarchical image database[C], 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2025, pp. 248-255
[19] CHARLES R Q, HAO S, MO K C, Et al., PointNet: deep learning on point sets for 3D classification and segmentation[C], 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2126, pp. 77-85, (2017)
[20] ACHLIOPTAS P, DIAMANTI O, MITLIAGKAS I, Et al., Learning representations and generative models for 3D point clouds, ICML, (2017)

← 1 2 3 →