3D model retrieval based on multi-view attentional convolutional neural network

被引:7
|
作者
Liu, An-An [1 ]
Zhou, He-Yu [1 ]
Li, Meng-Jie [1 ]
Nie, Wei-Zhi [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
3D model retrieval; Multi-view; CNN; LSTM; SHAPE DESCRIPTOR;
D O I
10.1007/s11042-019-7521-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a discriminative Multi-View Attentional Convolutional Neural Network, dubbed as MVA-CNN, which takes the multiple views of an shape as input and output the object category. Unlike previous view-based approaches that simply "compile" the view features into a compact 3D descriptors, our method can discover the context among multiple views in both the visual and spatial domain. First, we extract multiple rendered images from a 3D object by virtual cameras, and then we use Convolutional Neural Network (CNN) to abstract the information of the views. Second, we aggregate the visual views by two steps: 1). an element-wise maximum operation across the view features is adopted to discover discriminative features. 2). a soft attention mechanism is used to dynamically adjust the shape descriptors for better representing the spatial information. The entire network can be trained in an end-to-end way with the standard backpropagation. We verify the effectiveness of MVA-CNN on two widely used datasets: ModelNet10, ModelNet40 by comparing our method with state-of-the-art methods.
引用
下载
收藏
页码:4699 / 4711
页数:13
相关论文
共 50 条
  • [1] 3D model retrieval based on multi-view attentional convolutional neural network
    An-An Liu
    He-Yu Zhou
    Meng-Jie Li
    Wei-Zhi Nie
    Multimedia Tools and Applications, 2020, 79 : 4699 - 4711
  • [2] An Improved Multi-View Convolutional Neural Network for 3D Object Retrieval
    He, Xinwei
    Bai, Song
    Chu, Jiajia
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7917 - 7930
  • [3] 3D object retrieval based on multi-view convolutional neural networks
    Xi-Xi Li
    Qun Cao
    Sha Wei
    Multimedia Tools and Applications, 2017, 76 : 20111 - 20124
  • [4] 3D object retrieval based on multi-view convolutional neural networks
    Li, Xi-Xi
    Cao, Qun
    Wei, Sha
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (19) : 20111 - 20124
  • [5] Learning-Based Multiple Pooling Fusion in Multi-View Convolutional Neural Network for 3D Model Classification and Retrieval
    Zeng, Hui
    Wang, Qi
    Li, Chen
    Song, Wei
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2019, 15 (05): : 1179 - 1191
  • [6] 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network
    Zhang, Le
    Sun, Jian
    Zheng, Qiang
    SENSORS, 2018, 18 (11)
  • [7] Impression Estimation Model of 3D Objects Using Multi-View Convolutional Neural Network
    Sakashita, Keisuke
    Tobitani, Kensuke
    Taguchi, Koichi
    Hashimoto, Manabu
    Tani, Iori
    Hashimoto, Sho
    Katahira, Kenji
    Nagata, Noriko
    FRONTIERS OF COMPUTER VISION (IW-FCV 2022), 2022, 1578 : 343 - 355
  • [8] Aggregated Deep Convolutional Neural Networks for Multi-View 3D Object Retrieval
    Alzu'bi, Ahmad
    Abuarqoub, Abdelrahman
    Al-Hmouz, Ahmed
    2019 11TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2019,
  • [9] Group-Pair Convolutional Neural Networks for Multi-View Based 3D Object Retrieval
    Gao, Zan
    Wang, Deyu
    He, Xiangnan
    Zhang, Hua
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2223 - 2231
  • [10] Multi-view-based siamese convolutional neural network for 3D object retrieval
    Li, Haisheng
    Zheng, Yanping
    Cao, Jian
    Cai, Qiang
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 78 : 11 - 21