O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

被引:738
|
作者
Wang, Peng-Shuai [1 ,2 ]
Liu, Yang [2 ]
Guo, Yu-Xiao [2 ,3 ]
Sun, Chun-Yu [1 ,2 ]
Tong, Xin [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China
来源
ACM TRANSACTIONS ON GRAPHICS | 2017年 / 36卷 / 04期
关键词
octree; convolutional neural network; object classification; shape retrieval; shape segmentation;
D O I
10.1145/3072959.3073608
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present O-CNN, an Octree-based Convolutional Neural Network (CNN) for 3D shape analysis. Built upon the octree representation of 3D shapes, our method takes the average normal vectors of a 3D model sampled in the finest leaf octants as input and performs 3D CNN operations on the octants occupied by the 3D shape surface. We design a novel octree data structure to efficiently store the octant information and CNN features into the graphics memory and execute the entire O-CNN training and evaluation on the GPU. O-CNN supports various CNN structures and works for 3D shapes in different representations. By restraining the computations on the octants occupied by 3D surfaces, the memory and computational costs of the O-CNN grow quadratically as the depth of the octree increases, which makes the 3D CNN feasible for high-resolution 3D models. We compare the performance of the O-CNN with other existing 3D CNN solutions and demonstrate the efficiency and efficacy of O-CNN in three shape analysis tasks, including object classification, shape retrieval, and shape segmentation.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] 3D Object Detection Based on Convolutional Neural Networks: A Survey
    Wang Y.
    Tian Y.
    Li G.
    Wang K.
    Li D.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (12): : 1103 - 1119
  • [42] 3D CONVOLUTIONAL NEURAL NETWORKS BASED SPEAKER IDENTIFICATION AND AUTHENTICATION
    Liao, Jianguo
    Wang, Shilin
    Zhang, Xingxuan
    Liu, Gongshen
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2042 - 2046
  • [43] 3D Object Classification Based on Multi Convolutional Neural Networks
    Lu, Mei-qi
    Li, Wei
    Ning, Ya-guang
    INTERNATIONAL CONFERENCE ON APPLIED MECHANICS AND MECHANICAL AUTOMATION (AMMA 2017), 2017, : 204 - 208
  • [44] Sign Language Recognition Based on 3D Convolutional Neural Networks
    Ramos Neto, Geovane M.
    Braz Junior, Geraldo
    Sousa de Almeida, Joao Dallyson
    de Paiva, Anselmo Cardoso
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2018), 2018, 10882 : 399 - 407
  • [45] Lung Nodule Detection Based on 3D Convolutional Neural Networks
    Fan, Lei
    Xia, Zhaoqiang
    Zhang, Xiaobiao
    Feng, Xiaoyi
    2017 INTERNATIONAL CONFERENCE ON THE FRONTIERS AND ADVANCES IN DATA SCIENCE (FADS), 2017, : 7 - 10
  • [46] ExMeshCNN: An Explainable Convolutional Neural Network Architecture for 3D Shape Analysis
    Kim, Seonggyeom
    Chae, Dong-Kyu
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 795 - 803
  • [47] 3D shape segmentation via shape fully convolutional networks
    Wang, Pengyu
    Gan, Yuan
    Shui, Panpan
    Yu, Fenggen
    Zhang, Yan
    Chen, Songle
    Sun, Zhengxing
    COMPUTERS & GRAPHICS-UK, 2018, 70 : 128 - 139
  • [48] 3D shape segmentation via shape fully convolutional networks
    Wang, Pengyu
    Gan, Yuan
    Shui, Panpan
    Yu, Fenggen
    Zhang, Yan
    Chen, Songle
    Sun, Zhengxing
    COMPUTERS & GRAPHICS-UK, 2018, 76 : 182 - 192
  • [49] A Sparse Voxel Octree-Based Framework for Computing Solar Radiation Using 3D City Models
    Liang, Jianming
    Gong, Jianhua
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (04)
  • [50] 3D CONVOLUTIONAL NEURAL NETWORKS BY MODAL FUSION
    Yoshiyasu, Yusuke
    Yoshida, Eiichi
    Pirk, Soeren
    Guibas, Leonidas
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1777 - 1781