Adaptive octree 3D image reconstruction based on plane patch

被引:0
|
作者
Yao C. [1 ,2 ]
Ma C. [1 ,2 ]
机构
[1] Xi'an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an
[2] University of Chinese Academy of Sciences, Beijing
关键词
Computer vision; Convolutional neural network; Neural network; Three-dimensional reconstruction;
D O I
10.37188/OPE.20223009.1113
中图分类号
学科分类号
摘要
In this study, an adaptive octree convolutional neural network based on plane patches is proposed for effective 3D shape encoding and decoding. Unlike volume-based or octree-based convolutional neural network (CNN) methods, which represent 3D shapes with the same voxel resolution, the proposed method can use planes and adaptively represent the 3D shapes of octree nodes with different levels. The patch models the 3D shape within each octree node, whereby the patch-based adaptive representation is utilized in the proposed adaptive patch octree convolutional neural network (O-CNN) encoder and decoder for the encoding and decoding of 3D shapes. The adaptive patch O-CNN encoder takes the plane patch normal and displacement as input and performs three-dimensional convolution on the octree nodes of each level, whereas the adaptive patch O-CNN decoder infers each level. The shape occupancy rate and subdivision state of the octree node as well as the best plane normal and displacement of each leaf octree node are estimated. As a general framework for 3D shape analysis and generation, adaptive patch O-CNN not only reduces memory and computational costs but also exhibits better shape generation capabilities than existing 3D-CNN methods. Shape prediction is performed using a single image to verify the efficiency and effectiveness of the generation task of the adaptive O-CNN. The chamfer distance error is 0.274, which is lower than that of OctGen (0.294), resulting in a better reconstruction effect. © 2022, Science Press. All right reserved.
引用
收藏
页码:1113 / 1122
页数:9
相关论文
共 28 条
  • [11] ATTENE M, CAMPEN M, KOBBELT L., Polygon mesh repairing, ACM Computing Surveys, 45, 2, pp. 1-33, (2013)
  • [12] BOSCAINI D, MASCI J, MELZI S, Et al., Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks, Computer Graphics Forum, 34, 5, pp. 13-23, (2015)
  • [13] BROCK A, LIM T, RITCHIE J M, Et al., Generative and discriminative voxel modeling with convolutional neural networks, (2016)
  • [14] CHOY C B, XU D F, GWAK J, Et al., 3D-R2N2: a unified approach for single and multi-view 3D object reconstruction, Computer Vision-ECCV, 2016, pp. 628-644, (2016)
  • [15] GRAHAM B., Sparse 3D convolutional neural networks, Proceedings ofthe British Machine Vision Conference 2015, pp. 1-1509, (2015)
  • [16] HANE C, TULSIANI S, MALIK J., Hierarchical surface prediction, IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 6, pp. 1348-1361, (2020)
  • [17] FUHRMANN S, GOESELE M., Floating scale surface reconstruction, ACM Transactions on Graphics, 33, 4, pp. 1-11, (2014)
  • [18] DENG J, DONG W, SOCHER R, Et al., ImageNet: a large-scale hierarchical image database[C], 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2025, pp. 248-255
  • [19] CHARLES R Q, HAO S, MO K C, Et al., PointNet: deep learning on point sets for 3D classification and segmentation[C], 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2126, pp. 77-85, (2017)
  • [20] ACHLIOPTAS P, DIAMANTI O, MITLIAGKAS I, Et al., Learning representations and generative models for 3D point clouds, ICML, (2017)