3D ShapeNets: A Deep Representation for Volumetric Shapes

被引:0
|
作者
Wu, Zhirong [1 ,2 ]
Song, Shuran [1 ]
Khosla, Aditya [3 ]
Yu, Fisher [1 ]
Zhang, Linguang [1 ]
Tang, Xiaoou [2 ]
Xiao, Jianxiong [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
[2] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[3] MIT, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D shape is a crucial but heavily underutilized cue in today's computer vision systems, mostly due to the lack of a good generic shape representation. With the recent availability of inexpensive 2.5D depth sensors (e.g. Microsoft Kinect), it is becoming increasingly important to have a powerful 3D shape representation in the loop. Apart from category recognition, recovering full 3D shapes from view-based 2.5D depth maps is also a critical part of visual understanding. To this end, we propose to represent a geometric 3D shape as a probability distribution of binary variables on a 3D voxel grid, using a Convolutional Deep Belief Network. Our model, 3D ShapeNets, learns the distribution of complex 3D shapes across different object categories and arbitrary poses from raw CAD data, and discovers hierarchical compositional part representation automatically. It naturally supports joint object recognition and shape completion from 2.5D depth maps, and it enables active object recognition through view planning. To train our 3D deep learning model, we construct ModelNet - a large-scale 3D CAD model dataset. Extensive experiments show that our 3D deep representation enables significant performance improvement over the-state-of-the-arts in a variety of tasks.
引用
收藏
页码:1912 / 1920
页数:9
相关论文
共 50 条
  • [21] Content-adaptive 3D mesh modeling for representation of volumetric images
    Brankov, JG
    Yang, YY
    Wernick, MN
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2002, : 849 - 852
  • [22] 3D CADASTRE IN THE PROVINCE OF QUEBEC: A FIRST EXPERIMENT FOR THE CONSTRUCTION OF A VOLUMETRIC REPRESENTATION
    Pouliot, Jacynthe
    Roy, Tania
    Fouquet-Asselin, Guillaume
    Desgroseilliers, Joanie
    5TH INTERNATIONAL CONFERENCE ON 3D GEOINFORMATION, 2010, 38-4 (W15): : 189 - 189
  • [23] Identifying Style of 3D Shapes using Deep Metric Learning
    Lim, Isaak
    Gehre, Anne
    Kobbelt, Leif
    COMPUTER GRAPHICS FORUM, 2016, 35 (05) : 207 - 215
  • [24] Towards stable and salient multi-view representation of 3D shapes
    Yamauchi, Hitoshi
    Saleem, Waqar
    Yoshizawa, Shin
    Karni, Zachi
    Belyaev, Alexander
    Seidel, Hans-Peter
    IEEE INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS 2006, PROCEEDINGS, 2006, : 265 - +
  • [25] DEF: Deep Estimation of Sharp Geometric Features in 3D Shapes
    Matveev, Albert
    Rakhimov, Ruslan
    Artemov, Alexey
    Bobrovskikh, Gleb
    Egiazarian, Vage
    Bogomolov, Emil
    Panozzo, Daniele
    Zorin, Denis
    Burnaev, Evgeny
    ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (04):
  • [26] Multiresolution Deep Implicit Functions for 3D Shape Representation
    Chen, Zhang
    Zhang, Yinda
    Genova, Kyle
    Fanello, Sean
    Bouaziz, Sofien
    Hane, Christian
    Du, Ruofei
    Keskin, Cem
    Funkhouser, Thomas
    Tang, Danhang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13067 - 13076
  • [27] The Interestingness of 3D Shapes
    Lau, Manfred
    Power, Luther
    ACM SYMPOSIUM ON APPLIED PERCEPTION (SAP 2020), 2020,
  • [28] Skeletons of 3D shapes
    Shah, J
    SCALE SPACE AND PDE METHODS IN COMPUTER VISION, PROCEEDINGS, 2005, 3459 : 339 - 350
  • [29] 3D Object Recognition Based on Volumetric Representation Using Convolutional Neural Networks
    Xu, Xiaofan
    Corrigan, David
    Dehghani, Alireza
    Caulfield, Sam
    Moloney, David
    ARTICULATED MOTION AND DEFORMABLE OBJECTS, 2016, 9756 : 147 - 156
  • [30] 3D Deep Learning for Efficient and Robust Landmark Detection in Volumetric Data
    Zheng, Yefeng
    Liu, David
    Georgescu, Bogdan
    Hien Nguyen
    Comaniciu, Dorin
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2015, PT I, 2015, 9349 : 565 - 572