O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

被引：738

作者：

Wang, Peng-Shuai ^{[1
,2
]}

Liu, Yang ^{[2
]}

Guo, Yu-Xiao ^{[2
,3
]}

Sun, Chun-Yu ^{[1
,2
]}

Tong, Xin ^{[2
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Microsoft Res Asia, Beijing, Peoples R China

[3] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China

来源：

ACM TRANSACTIONS ON GRAPHICS | 2017年 / 36卷 / 04期

关键词：

octree; convolutional neural network; object classification; shape retrieval; shape segmentation;

D O I：

10.1145/3072959.3073608

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We present O-CNN, an Octree-based Convolutional Neural Network (CNN) for 3D shape analysis. Built upon the octree representation of 3D shapes, our method takes the average normal vectors of a 3D model sampled in the finest leaf octants as input and performs 3D CNN operations on the octants occupied by the 3D shape surface. We design a novel octree data structure to efficiently store the octant information and CNN features into the graphics memory and execute the entire O-CNN training and evaluation on the GPU. O-CNN supports various CNN structures and works for 3D shapes in different representations. By restraining the computations on the octants occupied by 3D surfaces, the memory and computational costs of the O-CNN grow quadratically as the depth of the octree increases, which makes the 3D CNN feasible for high-resolution 3D models. We compare the performance of the O-CNN with other existing 3D CNN solutions and demonstrate the efficiency and efficacy of O-CNN in three shape analysis tasks, including object classification, shape retrieval, and shape segmentation.

引用

页数：11

共 50 条

[31] Octree-based language and optimization algorithm for 3D-packing
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 1 (67):
[32] H-CNN: Spatial Hashing Based CNN for 3D Shape Analysis
Shao, Tianjia
Yang, Yin
Weng, Yanlin
Hou, Qiming
Zhou, Kun
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (07) : 2403 - 2416
[33] GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition
Feng, Yifan
Zhang, Zizhao
Zhao, Xibin
Ji, Rongrong
Gao, Yue
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 264 - 272
[34] Analysis on Temporal Dimension of Inputs for 3D Convolutional Neural Networks
Koepueklue, Okan
Rigoll, Gerhard
2018 IEEE THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2018, : 79 - 84
[35] Training deep convolutional neural networks to acquire the best view of a 3D shape
Zhou, Wen
Jia, Jinyuan
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (1-2) : 581 - 601
[36] Directionally Convolutional Networks for 3D Shape Segmentation
Xu, Haotian
Dong, Ming
Zhong, Zichun
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2717 - 2726
[37] Large-Scale Shape Retrieval with Sparse 3D Convolutional Neural Networks
Notchenko, Alexandr
Kapushev, Yermek
Burnaev, Evgeny
ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2017, 2018, 10716 : 245 - 254
[38] Balanced principal component for 3D shape recognition using convolutional neural networks
Luo, Wenjie
Zhang, Han
Ni, Peng
Tian, Xuedong
IET IMAGE PROCESSING, 2020, 14 (17) : 4468 - 4476
[39] Training deep convolutional neural networks to acquire the best view of a 3D shape
Wen Zhou
Jinyuan Jia
Multimedia Tools and Applications, 2020, 79 : 581 - 601
[40] 3D Shape Segmentation with Projective Convolutional Networks
Kalogerakis, Evangelos
Averkiou, Melinos
Maji, Subhransu
Chaudhuri, Siddhartha
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6630 - 6639

← 1 2 3 4 5 →