Recognition of 3D Object Based on Multi-View Recurrent Neural Networks

被引：0

作者：

Dong S. ^{[1
]}

Li W.-S. ^{[1
]}

Zhang W.-Q. ^{[1
]}

Zou K. ^{[1
]}

机构：

[1] Zhongshan Institute, University of Electronic Science and Technology of China, Zhongshan, 528406, Guangdong

来源：

Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China | 2020年 / 49卷 / 02期

关键词：

3D object; Feature extraction; Feature fusion; Image retrieval; Multi-view;

D O I：

10.12178/1001-0548.2019017

中图分类号：

学科分类号：

摘要：

Multi-view convolutional neural networks (MVCNN) is more accurate and faster than those methods based on state-of-the-art 3D shape descriptors in 3D object recognition tasks. However, the input of MVCNN are views rendered from cameras at fixed positions, which is not the case of most applications. Furthermore, MVCNN uses max-pooling operation to fuse multi-view features and the information of original features may be lost. To address those two problems, a new recognition method of 3D objects based on multi-view recurrent neural networks (MVRNN) is proposed based on MVCNN with improvements on three aspects. First, a new item which is defined as the measure of discrimination is introduced into the cross-entropy loss function to enhance the discrimination of features from different objects. Second, a recurrent neural networks (RNN) is used to fuse multi-view features from free positions into a compact one, instead of the max-pooling operation in MVCNN. RNN can keep the completeness of information about appearance feature. At last, single view feature from free positon is matched with fused features via a bi-classification network to attain fine-grained recognition of 3D objects. Experiments are conducted on the open dataset ModelNet and the private dataset MV3D separately to validate the performance of MVRNN. The results show that MVRNN can exact multi-view features with higher degree of discrimination, and achieve higher accuracy than MVCNN on both datasets. © 2020, Editorial Board of Journal of the University of Electronic Science and Technology of China. All right reserved.

引用

页码：269 / 275

页数：6

共 50 条

[21] Deep models for multi-view 3D object recognition: a review
Alzahrani, Mona
Usman, Muhammad
Jarraya, Salma Kammoun
Anwar, Saeed
Helmy, Tarek
[J]. Artificial Intelligence Review, 2024, 57 (12)
[22] Multi-view dual attention network for 3D object recognition
Wenju Wang
Yu Cai
Tao Wang
[J]. Neural Computing and Applications, 2022, 34 : 3201 - 3212
[23] Multi-view dual attention network for 3D object recognition
Wang, Wenju
Cai, Yu
Wang, Tao
[J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3201 - 3212
[24] Multi-view Harmonized Bilinear Network for 3D Object Recognition
Yu, Tan
Meng, Jingjing
Yuan, Junsong
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 186 - 194
[25] Review of multi-view 3D object recognition methods based on deep learning
Qi, Shaohua
Ning, Xin
Yang, Guowei
Zhang, Liping
Long, Peng
Cai, Weiwei
Li, Weijun
[J]. DISPLAYS, 2021, 69
[26] Joint Multi-view 2D Convolutional Neural Networks for 3D Object Classification
Xu, Jinglin
Zhang, Xiangsen
Li, Wenbin
Liu, Xinwang
Han, Junwei
[J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3202 - 3208
[27] A multi-view recurrent neural network for 3D mesh segmentation
Le, Truc
Bui, Giang
Duan, Ye
[J]. COMPUTERS & GRAPHICS-UK, 2017, 66 : 103 - 112
[28] 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network
Zhang, Le
Sun, Jian
Zheng, Qiang
[J]. SENSORS, 2018, 18 (11)
[29] Object-based encoding for multi-view sequences of 3D object
Yi, J
Rhee, K
Kim, S
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2002, 17 (03) : 293 - 304
[30] MORE: simultaneous multi-view 3D object recognition and pose estimation
Parisotto, Tommaso
Mukherjee, Subhaditya
Kasaei, Hamidreza
[J]. INTELLIGENT SERVICE ROBOTICS, 2023, 16 (04) : 497 - 508

← 1 2 3 4 5 →