O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

被引:738
|
作者
Wang, Peng-Shuai [1 ,2 ]
Liu, Yang [2 ]
Guo, Yu-Xiao [2 ,3 ]
Sun, Chun-Yu [1 ,2 ]
Tong, Xin [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China
来源
ACM TRANSACTIONS ON GRAPHICS | 2017年 / 36卷 / 04期
关键词
octree; convolutional neural network; object classification; shape retrieval; shape segmentation;
D O I
10.1145/3072959.3073608
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present O-CNN, an Octree-based Convolutional Neural Network (CNN) for 3D shape analysis. Built upon the octree representation of 3D shapes, our method takes the average normal vectors of a 3D model sampled in the finest leaf octants as input and performs 3D CNN operations on the octants occupied by the 3D shape surface. We design a novel octree data structure to efficiently store the octant information and CNN features into the graphics memory and execute the entire O-CNN training and evaluation on the GPU. O-CNN supports various CNN structures and works for 3D shapes in different representations. By restraining the computations on the octants occupied by 3D surfaces, the memory and computational costs of the O-CNN grow quadratically as the depth of the octree increases, which makes the 3D CNN feasible for high-resolution 3D models. We compare the performance of the O-CNN with other existing 3D CNN solutions and demonstrate the efficiency and efficacy of O-CNN in three shape analysis tasks, including object classification, shape retrieval, and shape segmentation.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Similarity Analysis of 3D Models Based on Convolutional Neural Networks with Threshold
    Qin, Shengwei
    Li, Zhong
    Chen, Zihao
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING (ICVIP 2018), 2018, : 95 - 102
  • [22] 3D Semantic Mapping Based on Convolutional Neural Networks
    Li, Jing
    Liu, Yanyu
    Wang, Junzheng
    Yan, Min
    Yao, Yanzhi
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9303 - 9308
  • [23] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
  • [24] Web3D Learning Framework for 3D Shape Retrieval Based on Hybrid Convolutional Neural Networks
    Wen Zhou
    Jinyuan Jia
    Chengxi Huang
    Yongqing Cheng
    Tsinghua Science and Technology, 2020, 25 (01) : 93 - 102
  • [25] Web3D Learning Framework for 3D Shape Retrieval Based on Hybrid Convolutional Neural Networks
    Zhou, Wen
    Jia, Jinyuan
    Huang, Chengxi
    Cheng, Yongqing
    TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (01) : 93 - 102
  • [26] ParallelNN: A Parallel Octree-based Nearest Neighbor Search Accelerator for 3D Point Clouds
    Chen, Faquan
    Ying, Rendong
    Xue, Jianwei
    Wen, Fei
    Liu, Peilin
    2023 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA, 2023, : 403 - 414
  • [27] Voxel-Based 3D Shape Segmentation Using Deep Volumetric Convolutional Neural Networks
    Liu, Yuqi
    Long, Wei
    Shu, Zhenyu
    Yi, Shun
    Xin, Shiqing
    ADVANCES IN COMPUTER GRAPHICS, CGI 2022, 2022, 13443 : 489 - 500
  • [28] Neural 3D Morphable Models: Spiral Convolutional Networks for 3D Shape Representation Learning and Generation
    Bouritsas, Giorgos
    Bokhnyak, Sergiy
    Ploumpis, Stylianos
    Bronstein, Michael
    Zafeiriou, Stefanos
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7212 - 7221
  • [29] 3D object understanding with 3D Convolutional Neural Networks
    Leng, Biao
    Liu, Yu
    Yu, Kai
    Zhang, Xiangyang
    Xiong, Zhang
    INFORMATION SCIENCES, 2016, 366 : 188 - 201
  • [30] Octree-Based 3D Logic and Computation of Spatial Relationships in Live Video Query Processing
    Ye, Jun
    Hua, Kien A.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 11 (02)