MVTN: Multi-View Transformation Network for 3D Shape Recognition

被引:87
|
作者
Hamdi, Abdullah [1 ]
Giancola, Silvio [1 ]
Ghanem, Bernard [1 ]
机构
[1] King Abdullah Univ Sci & Technol KAUST, Thuwal, Saudi Arabia
关键词
NEURAL-NETWORK;
D O I
10.1109/ICCV48922.2021.00007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view projection methods have demonstrated their ability to reach state-of-the-art performance on 3D shape recognition. Those methods learn different ways to aggregate information from multiple views. However, the camera view-points for those views tend to be heuristically set and fixed for all shapes. To circumvent the lack of dynamism of current multi-view methods, we propose to learn those viewpoints. In particular, we introduce the Multi-View Transformation Network (MVTN) that regresses optimal view-points for 3D shape recognition, building upon advances in differentiable rendering. As a result, MVTN can be trained end-to-end along with any multi-view network for 3D shape classification. We integrate MVTN in a novel adaptive multi-view pipeline that can render either 3D meshes or point clouds. MVTN exhibits clear performance gains in the tasks of 3D shape classification and 3D shape retrieval without the need for extra training supervision. In these tasks, MVTN achieves state-of-the-art performance on ModelNet40, ShapeNet Core55, and the most recent and realistic ScanObjectNN dataset (up to 6% improvement). Interestingly, we also show that MVTN can provide network robustness against rotation and occlusion in the 3D domain. The code is available at https://github.com/ajhamdi/MVTN.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [21] MHFP: Multi-view based hierarchical fusion pooling method for 3D shape recognition
    Liang, Qi
    Li, Qiang
    Zhang, Lihu
    Mi, Haixiao
    Nie, Weizhi
    Li, Xuanya
    [J]. PATTERN RECOGNITION LETTERS, 2021, 150 : 214 - 220
  • [22] Multi-view Manhole Detection, Recognition, and 3D Localisation
    Timofte, Radu
    Van Gool, Luc
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [23] Learning Relationships for Multi-View 3D Object Recognition
    Yang, Ze
    Wang, Liwei
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7504 - 7513
  • [24] VFMVAC: View-filtering-based multi-view aggregating convolution for 3D shape recognition and retrieval
    Liu, Zehua
    Zhang, Yuhe
    Gao, Jian
    Wang, Shurui
    [J]. PATTERN RECOGNITION, 2022, 129
  • [25] AirPose: Multi-View Fusion Network for Aeria 3D Human Pose and Shape Estimation
    Saini, Nitin
    Bonetto, Elia
    Price, Eric
    Ahmad, Aamir
    Black, Michael J.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 4805 - 4812
  • [26] Walk in Views: Multi-view Path Aggregation Graph Network for 3D Shape Analysis
    Xu, Lixiang
    Cui, Qingzhe
    Xu, Wei
    Chen, Enhong
    Tong, He
    Tang, Yuanyan
    [J]. INFORMATION FUSION, 2024, 103
  • [27] Drcnn: Dynamic routing convolutional neural network for multi-view 3d object recognition
    Sun, Kai
    Zhang, Jiangshe
    Liu, Junmin
    Yu, Ruixuan
    Song, Zengjie
    [J]. IEEE Transactions on Image Processing, 2021, 30 : 868 - 877
  • [28] DRCNN: Dynamic Routing Convolutional Neural Network for Multi-View 3D Object Recognition
    Sun, Kai
    Zhang, Jiangshe
    Liu, Junmin
    Yu, Ruixuan
    Song, Zengjie
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 868 - 877
  • [29] Multi-view Fusion with Deep Learning for 3D Shape Classification
    Huang, Xiang
    Wang, Mantao
    Zhang, Dejun
    Zhu, Yu
    Zou, Lu
    Sun, Jun
    Han, Fei
    He, Linchao
    [J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 189 - 194
  • [30] Emphasizing 3D Properties in Recurrent Multi-View Aggregation for 3D Shape Retrieval
    Xu, Cheng
    Leng, Biao
    Zhang, Cheng
    Zhou, Xiaochen
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7428 - 7435