O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

被引：738

作者：

Wang, Peng-Shuai ^{[1
,2
]}

Liu, Yang ^{[2
]}

Guo, Yu-Xiao ^{[2
,3
]}

Sun, Chun-Yu ^{[1
,2
]}

Tong, Xin ^{[2
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Microsoft Res Asia, Beijing, Peoples R China

[3] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China

来源：

ACM TRANSACTIONS ON GRAPHICS | 2017年 / 36卷 / 04期

关键词：

octree; convolutional neural network; object classification; shape retrieval; shape segmentation;

D O I：

10.1145/3072959.3073608

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We present O-CNN, an Octree-based Convolutional Neural Network (CNN) for 3D shape analysis. Built upon the octree representation of 3D shapes, our method takes the average normal vectors of a 3D model sampled in the finest leaf octants as input and performs 3D CNN operations on the octants occupied by the 3D shape surface. We design a novel octree data structure to efficiently store the octant information and CNN features into the graphics memory and execute the entire O-CNN training and evaluation on the GPU. O-CNN supports various CNN structures and works for 3D shapes in different representations. By restraining the computations on the octants occupied by 3D surfaces, the memory and computational costs of the O-CNN grow quadratically as the depth of the octree increases, which makes the 3D CNN feasible for high-resolution 3D models. We compare the performance of the O-CNN with other existing 3D CNN solutions and demonstrate the efficiency and efficacy of O-CNN in three shape analysis tasks, including object classification, shape retrieval, and shape segmentation.

引用

页数：11

共 50 条

[21] Similarity Analysis of 3D Models Based on Convolutional Neural Networks with Threshold
Qin, Shengwei
Li, Zhong
Chen, Zihao
PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING (ICVIP 2018), 2018, : 95 - 102
[22] 3D Semantic Mapping Based on Convolutional Neural Networks
Li, Jing
Liu, Yanyu
Wang, Junzheng
Yan, Min
Yao, Yanzhi
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9303 - 9308
[23] Multi-view Convolutional Neural Networks for 3D Shape Recognition
Su, Hang
Maji, Subhransu
Kalogerakis, Evangelos
Learned-Miller, Erik
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
[24] Web3D Learning Framework for 3D Shape Retrieval Based on Hybrid Convolutional Neural Networks
Wen Zhou
Jinyuan Jia
Chengxi Huang
Yongqing Cheng
Tsinghua Science and Technology, 2020, 25 (01) : 93 - 102
[25] Web3D Learning Framework for 3D Shape Retrieval Based on Hybrid Convolutional Neural Networks
Zhou, Wen
Jia, Jinyuan
Huang, Chengxi
Cheng, Yongqing
TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (01) : 93 - 102
[26] ParallelNN: A Parallel Octree-based Nearest Neighbor Search Accelerator for 3D Point Clouds
Chen, Faquan
Ying, Rendong
Xue, Jianwei
Wen, Fei
Liu, Peilin
2023 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA, 2023, : 403 - 414
[27] Voxel-Based 3D Shape Segmentation Using Deep Volumetric Convolutional Neural Networks
Liu, Yuqi
Long, Wei
Shu, Zhenyu
Yi, Shun
Xin, Shiqing
ADVANCES IN COMPUTER GRAPHICS, CGI 2022, 2022, 13443 : 489 - 500
[28] Neural 3D Morphable Models: Spiral Convolutional Networks for 3D Shape Representation Learning and Generation
Bouritsas, Giorgos
Bokhnyak, Sergiy
Ploumpis, Stylianos
Bronstein, Michael
Zafeiriou, Stefanos
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7212 - 7221
[29] 3D object understanding with 3D Convolutional Neural Networks
Leng, Biao
Liu, Yu
Yu, Kai
Zhang, Xiangyang
Xiong, Zhang
INFORMATION SCIENCES, 2016, 366 : 188 - 201
[30] Octree-Based 3D Logic and Computation of Spatial Relationships in Live Video Query Processing
Ye, Jun
Hua, Kien A.
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 11 (02)

← 1 2 3 4 5 →