3D Object retrieval based on non-local graph neural networks

被引:2
|
作者
Li, Yin-min [1 ,2 ]
Gao, Zan [2 ]
Tao, Ya-bin [3 ]
Wang, Li-li [4 ]
Xue, Yan-bing [1 ]
机构
[1] Tianjin Univ Technol, Tianjin Key Lab Intelligence Comp & Novel Softwar, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China
[2] Qilu Univ Technol, Shandong Artif Intelligence Inst, Shandong Acad Sci, Jinan 250014, Peoples R China
[3] Jiangxi Vocat Tech Coll Ind Trade, Nanchang 330038, Jiangxi, Peoples R China
[4] China Unicorn Yantai Branch, Yantai 264006, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
3D object retrieval; Non-local graph neural network; 3D shape descriptors; MODEL RETRIEVAL; DISCRIMINATION; SEARCH;
D O I
10.1007/s11042-020-09248-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
3D object retrieval is a hot research field in computer vision and multimedia analysis domain. Since the appearance feature and points of view of 3D objects are very different, thus, the distribution of the training set and test set are variant which is very suitable for transfer learning or cross-domain learning. In the transfer learning or cross-domain learning, the feature extraction is very important which should have good robust for different domains. Thus, in this work, we pay attention to the feature extraction of 3D objects. So far, different feature representations and object retrieval approaches have been proposed. Among them, view-based deep learning retrieval methods achieve state-of-the-art performance, but the existing deep learning retrieval methods only simply use a deep neural network to extract features from each view and directly obtain the view-level shape descriptors without utilizing the spatial relationship between the views. In order to mine the spatial relationship among different views and obtain more discriminative 3D shape descriptors, in this work, 3D object retrieval based on non-local graph neural networks (NGNN) is proposed. In detail, the residual network is firstly utilized as the infrastructure, and then the non-local structure is embedded in the resnet to learn the intrinsic relationship between the views. Finally, the view pooling layer is employed to further fuse the information from different views, and obtain the discriminate feature for the 3D object. Experimental results on two public MVRED and NTU 3D datasets show that the non-local graph network is very efficient for exploring the latent relationship among different views, and the performance ofNGNNsignificantly outperforms state-of-the-art approaches whose improvement can reaches 12.4%-22.7% on ANMRR.
引用
收藏
页码:34011 / 34027
页数:17
相关论文
共 50 条
  • [21] Ensembling Neural Networks-based 3d model retrieval
    Liu, Yujie
    Li, Zongmin
    Li, Hua
    2008 3RD INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2008, : 458 - +
  • [22] 3D MODEL RETRIEVAL BASED ON DEEP AUTOENCODER NEURAL NETWORKS
    Liu, Zhao-Ming
    Chen, Yung-Yao
    Hidayati, S.
    Chien, Shih-Che
    Chang, Feng-Chia
    Hua, Kai-Lung
    2017 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2017, : 290 - 296
  • [23] Group-Pair Convolutional Neural Networks for Multi-View Based 3D Object Retrieval
    Gao, Zan
    Wang, Deyu
    He, Xiangnan
    Zhang, Hua
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2223 - 2231
  • [24] 3D Object Detection in Substation Scene Based on Graph Neural Network
    Zhang, Ting
    Zhang, Xingzhong
    Wang, Huimin
    Yang, Gang
    Wang, Dawei
    Computer Engineering and Applications, 2023, 59 (09) : 329 - 336
  • [25] HYPERSPECTRAL IMAGE CLASSIFICATION BASED ON NON-LOCAL NEURAL NETWORKS
    Wang, Chen
    Bai, Xiao
    Zhou, Lei
    Zhou, Jun
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 584 - 587
  • [26] 3D Object Detection Based on Convolutional Neural Networks: A Survey
    Wang Y.
    Tian Y.
    Li G.
    Wang K.
    Li D.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (12): : 1103 - 1119
  • [27] 3D Object Classification Based on Multi Convolutional Neural Networks
    Lu, Mei-qi
    Li, Wei
    Ning, Ya-guang
    INTERNATIONAL CONFERENCE ON APPLIED MECHANICS AND MECHANICAL AUTOMATION (AMMA 2017), 2017, : 204 - 208
  • [28] 3D object retrieval with stacked local convolutional autoencoder
    Leng, Biao
    Guo, Shuang
    Zhang, Xiangyang
    Xiong, Zhang
    SIGNAL PROCESSING, 2015, 112 : 119 - 128
  • [29] Reeb graph path dissimilarity for 3D object matching and retrieval
    Mohamed, Waleed
    Ben Hamza, A.
    VISUAL COMPUTER, 2012, 28 (03): : 305 - 318
  • [30] Reeb graph path dissimilarity for 3D object matching and retrieval
    Waleed Mohamed
    A. Ben Hamza
    The Visual Computer, 2012, 28 : 305 - 318