O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

被引:738
|
作者
Wang, Peng-Shuai [1 ,2 ]
Liu, Yang [2 ]
Guo, Yu-Xiao [2 ,3 ]
Sun, Chun-Yu [1 ,2 ]
Tong, Xin [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China
来源
ACM TRANSACTIONS ON GRAPHICS | 2017年 / 36卷 / 04期
关键词
octree; convolutional neural network; object classification; shape retrieval; shape segmentation;
D O I
10.1145/3072959.3073608
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present O-CNN, an Octree-based Convolutional Neural Network (CNN) for 3D shape analysis. Built upon the octree representation of 3D shapes, our method takes the average normal vectors of a 3D model sampled in the finest leaf octants as input and performs 3D CNN operations on the octants occupied by the 3D shape surface. We design a novel octree data structure to efficiently store the octant information and CNN features into the graphics memory and execute the entire O-CNN training and evaluation on the GPU. O-CNN supports various CNN structures and works for 3D shapes in different representations. By restraining the computations on the octants occupied by 3D surfaces, the memory and computational costs of the O-CNN grow quadratically as the depth of the octree increases, which makes the 3D CNN feasible for high-resolution 3D models. We compare the performance of the O-CNN with other existing 3D CNN solutions and demonstrate the efficiency and efficacy of O-CNN in three shape analysis tasks, including object classification, shape retrieval, and shape segmentation.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Octree-based language and optimization algorithm for 3D-packing
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 1 (67):
  • [32] H-CNN: Spatial Hashing Based CNN for 3D Shape Analysis
    Shao, Tianjia
    Yang, Yin
    Weng, Yanlin
    Hou, Qiming
    Zhou, Kun
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (07) : 2403 - 2416
  • [33] GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition
    Feng, Yifan
    Zhang, Zizhao
    Zhao, Xibin
    Ji, Rongrong
    Gao, Yue
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 264 - 272
  • [34] Analysis on Temporal Dimension of Inputs for 3D Convolutional Neural Networks
    Koepueklue, Okan
    Rigoll, Gerhard
    2018 IEEE THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2018, : 79 - 84
  • [35] Training deep convolutional neural networks to acquire the best view of a 3D shape
    Zhou, Wen
    Jia, Jinyuan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (1-2) : 581 - 601
  • [36] Directionally Convolutional Networks for 3D Shape Segmentation
    Xu, Haotian
    Dong, Ming
    Zhong, Zichun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2717 - 2726
  • [37] Large-Scale Shape Retrieval with Sparse 3D Convolutional Neural Networks
    Notchenko, Alexandr
    Kapushev, Yermek
    Burnaev, Evgeny
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2017, 2018, 10716 : 245 - 254
  • [38] Balanced principal component for 3D shape recognition using convolutional neural networks
    Luo, Wenjie
    Zhang, Han
    Ni, Peng
    Tian, Xuedong
    IET IMAGE PROCESSING, 2020, 14 (17) : 4468 - 4476
  • [39] Training deep convolutional neural networks to acquire the best view of a 3D shape
    Wen Zhou
    Jinyuan Jia
    Multimedia Tools and Applications, 2020, 79 : 581 - 601
  • [40] 3D Shape Segmentation with Projective Convolutional Networks
    Kalogerakis, Evangelos
    Averkiou, Melinos
    Maji, Subhransu
    Chaudhuri, Siddhartha
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6630 - 6639