3D Object retrieval based on non-local graph neural networks

被引：2

作者：

Li, Yin-min ^{[1
,2
]}

Gao, Zan ^{[2
]}

Tao, Ya-bin ^{[3
]}

Wang, Li-li ^{[4
]}

Xue, Yan-bing ^{[1
]}

机构：

[1] Tianjin Univ Technol, Tianjin Key Lab Intelligence Comp & Novel Softwar, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China

[2] Qilu Univ Technol, Shandong Artif Intelligence Inst, Shandong Acad Sci, Jinan 250014, Peoples R China

[3] Jiangxi Vocat Tech Coll Ind Trade, Nanchang 330038, Jiangxi, Peoples R China

[4] China Unicorn Yantai Branch, Yantai 264006, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2020年 / 79卷 / 45-46期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

3D object retrieval; Non-local graph neural network; 3D shape descriptors; MODEL RETRIEVAL; DISCRIMINATION; SEARCH;

D O I：

10.1007/s11042-020-09248-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

3D object retrieval is a hot research field in computer vision and multimedia analysis domain. Since the appearance feature and points of view of 3D objects are very different, thus, the distribution of the training set and test set are variant which is very suitable for transfer learning or cross-domain learning. In the transfer learning or cross-domain learning, the feature extraction is very important which should have good robust for different domains. Thus, in this work, we pay attention to the feature extraction of 3D objects. So far, different feature representations and object retrieval approaches have been proposed. Among them, view-based deep learning retrieval methods achieve state-of-the-art performance, but the existing deep learning retrieval methods only simply use a deep neural network to extract features from each view and directly obtain the view-level shape descriptors without utilizing the spatial relationship between the views. In order to mine the spatial relationship among different views and obtain more discriminative 3D shape descriptors, in this work, 3D object retrieval based on non-local graph neural networks (NGNN) is proposed. In detail, the residual network is firstly utilized as the infrastructure, and then the non-local structure is embedded in the resnet to learn the intrinsic relationship between the views. Finally, the view pooling layer is employed to further fuse the information from different views, and obtain the discriminate feature for the 3D object. Experimental results on two public MVRED and NTU 3D datasets show that the non-local graph network is very efficient for exploring the latent relationship among different views, and the performance ofNGNNsignificantly outperforms state-of-the-art approaches whose improvement can reaches 12.4%-22.7% on ANMRR.

引用

页码：34011 / 34027

页数：17

共 50 条

[41] 3D sketching for 3D object retrieval
Li, Bo
Yuan, Juefei
Ye, Yuxiang
Lu, Yijuan
Zhang, Chaoyang
Tian, Qi
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9569 - 9595
[42] 3D Object retrieval based on viewpoint segmentation
Biao Leng
Shuang Guo
Changchun Du
Jiabei Zeng
Zhang Xiong
Multimedia Systems, 2017, 23 : 19 - 28
[43] Content-based 3D object retrieval
Bustos, Benjamin
Keim, Daniel
Saupe, Dietmar
Schreck, Tobias
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2007, 27 (04) : 22 - 27
[44] 3D sketching for 3D object retrieval
Bo Li
Juefei Yuan
Yuxiang Ye
Yijuan Lu
Chaoyang Zhang
Qi Tian
Multimedia Tools and Applications, 2021, 80 : 9569 - 9595
[45] 3D Object retrieval based on viewpoint segmentation
Leng, Biao
Guo, Shuang
Du, Changchun
Zeng, Jiabei
Xiong, Zhang
MULTIMEDIA SYSTEMS, 2017, 23 (01) : 19 - 28
[46] 3D object retrieval based on subjective measures
Suzuki, MT
Kato, T
Tsukune, H
NINTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 1998, : 850 - 855
[47] 3D Human Pose Estimation Using Improved Semantic Graph Convolutional Based on Fusing Non-local Neural Network and Multi-Head Attention
Gui W.
Luo Y.
Journal of The Institution of Engineers (India): Series B, 2024, 105 (05) : 1109 - 1119
[48] Graph-based 3D object classification
Baloch, Sajjad
Krim, Hamid
COMPUTATIONAL IMAGING IV, 2006, 6065
[49] Non-local Scan Consolidation for 3D Urban Scenes
Zheng, Qian
Sharf, Andrei
Wan, Guowei
Li, Yangyan
Mitra, Niloy J.
Cohen-Or, Daniel
Chen, Baoquan
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
[50] Local and Non-local Context Graph Convolutional Networks for Skeleton-Based Action Recognition
Gao, Zikai
Zhao, Yang
Han, Zhe
Wang, Kang
Dou, Yong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 243 - 254

← 1 2 3 4 5 →