MVTN: Multi-View Transformation Network for 3D Shape Recognition

被引:87
|
作者
Hamdi, Abdullah [1 ]
Giancola, Silvio [1 ]
Ghanem, Bernard [1 ]
机构
[1] King Abdullah Univ Sci & Technol KAUST, Thuwal, Saudi Arabia
关键词
NEURAL-NETWORK;
D O I
10.1109/ICCV48922.2021.00007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view projection methods have demonstrated their ability to reach state-of-the-art performance on 3D shape recognition. Those methods learn different ways to aggregate information from multiple views. However, the camera view-points for those views tend to be heuristically set and fixed for all shapes. To circumvent the lack of dynamism of current multi-view methods, we propose to learn those viewpoints. In particular, we introduce the Multi-View Transformation Network (MVTN) that regresses optimal view-points for 3D shape recognition, building upon advances in differentiable rendering. As a result, MVTN can be trained end-to-end along with any multi-view network for 3D shape classification. We integrate MVTN in a novel adaptive multi-view pipeline that can render either 3D meshes or point clouds. MVTN exhibits clear performance gains in the tasks of 3D shape classification and 3D shape retrieval without the need for extra training supervision. In these tasks, MVTN achieves state-of-the-art performance on ModelNet40, ShapeNet Core55, and the most recent and realistic ScanObjectNN dataset (up to 6% improvement). Interestingly, we also show that MVTN can provide network robustness against rotation and occlusion in the 3D domain. The code is available at https://github.com/ajhamdi/MVTN.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] MVPN: Multi-View Prototype Network for 3D Shape Recognition
    Wu, Zizhao
    Yang, Ping
    Wang, Yigang
    [J]. IEEE ACCESS, 2019, 7 : 130363 - 130372
  • [2] Multi-view Moments Embedding Network for 3D Shape Recognition
    Xiao, Jun
    Zhang, Yuanxing
    Zhao, Pengyu
    Xiao, Kecheng
    Bian, Kaigui
    Zhang, Chunli
    Yan, Wei
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2257 - 2260
  • [3] Multi-view 3D shape style transformation
    Dalian University of Technology, Dalian, China
    [J]. Visual Comput, 1600, 2 (669-684):
  • [4] Multi-view 3D shape style transformation
    Xiuping Liu
    Hua Huang
    Weiming Wang
    Jun Zhou
    [J]. The Visual Computer, 2022, 38 : 669 - 684
  • [5] Multi-view 3D shape style transformation
    Liu, Xiuping
    Huang, Hua
    Wang, Weiming
    Zhou, Jun
    [J]. VISUAL COMPUTER, 2022, 38 (02): : 669 - 684
  • [6] Dynamic View Aggregation for Multi-View 3D Shape Recognition
    Zhou, Yuan
    Sun, Zhongqi
    Huo, Shuwei
    Kung, Sun-Yuan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9163 - 9174
  • [7] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
  • [8] PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition
    You, Haoxuan
    Feng, Yifan
    Ji, Rongrong
    Gao, Yue
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1310 - 1318
  • [9] MHSAN: Multi-view hierarchical self-attention network for 3D shape recognition
    Cao, Jiangzhong
    Yu, Lianggeng
    Ling, Bingo Wing-Kuen
    Yao, Zijie
    Dai, Qingyun
    [J]. PATTERN RECOGNITION, 2024, 150
  • [10] Multi-view dual attention network for 3D object recognition
    Wenju Wang
    Yu Cai
    Tao Wang
    [J]. Neural Computing and Applications, 2022, 34 : 3201 - 3212