Recognition of 3D Object Based on Multi-View Recurrent Neural Networks

被引:0
|
作者
Dong S. [1 ]
Li W.-S. [1 ]
Zhang W.-Q. [1 ]
Zou K. [1 ]
机构
[1] Zhongshan Institute, University of Electronic Science and Technology of China, Zhongshan, 528406, Guangdong
关键词
3D object; Feature extraction; Feature fusion; Image retrieval; Multi-view;
D O I
10.12178/1001-0548.2019017
中图分类号
学科分类号
摘要
Multi-view convolutional neural networks (MVCNN) is more accurate and faster than those methods based on state-of-the-art 3D shape descriptors in 3D object recognition tasks. However, the input of MVCNN are views rendered from cameras at fixed positions, which is not the case of most applications. Furthermore, MVCNN uses max-pooling operation to fuse multi-view features and the information of original features may be lost. To address those two problems, a new recognition method of 3D objects based on multi-view recurrent neural networks (MVRNN) is proposed based on MVCNN with improvements on three aspects. First, a new item which is defined as the measure of discrimination is introduced into the cross-entropy loss function to enhance the discrimination of features from different objects. Second, a recurrent neural networks (RNN) is used to fuse multi-view features from free positions into a compact one, instead of the max-pooling operation in MVCNN. RNN can keep the completeness of information about appearance feature. At last, single view feature from free positon is matched with fused features via a bi-classification network to attain fine-grained recognition of 3D objects. Experiments are conducted on the open dataset ModelNet and the private dataset MV3D separately to validate the performance of MVRNN. The results show that MVRNN can exact multi-view features with higher degree of discrimination, and achieve higher accuracy than MVCNN on both datasets. © 2020, Editorial Board of Journal of the University of Electronic Science and Technology of China. All right reserved.
引用
收藏
页码:269 / 275
页数:6
相关论文
共 50 条
  • [21] Deep models for multi-view 3D object recognition: a review
    Alzahrani, Mona
    Usman, Muhammad
    Jarraya, Salma Kammoun
    Anwar, Saeed
    Helmy, Tarek
    [J]. Artificial Intelligence Review, 2024, 57 (12)
  • [22] Multi-view dual attention network for 3D object recognition
    Wenju Wang
    Yu Cai
    Tao Wang
    [J]. Neural Computing and Applications, 2022, 34 : 3201 - 3212
  • [23] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3201 - 3212
  • [24] Multi-view Harmonized Bilinear Network for 3D Object Recognition
    Yu, Tan
    Meng, Jingjing
    Yuan, Junsong
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 186 - 194
  • [25] Review of multi-view 3D object recognition methods based on deep learning
    Qi, Shaohua
    Ning, Xin
    Yang, Guowei
    Zhang, Liping
    Long, Peng
    Cai, Weiwei
    Li, Weijun
    [J]. DISPLAYS, 2021, 69
  • [26] Joint Multi-view 2D Convolutional Neural Networks for 3D Object Classification
    Xu, Jinglin
    Zhang, Xiangsen
    Li, Wenbin
    Liu, Xinwang
    Han, Junwei
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3202 - 3208
  • [27] A multi-view recurrent neural network for 3D mesh segmentation
    Le, Truc
    Bui, Giang
    Duan, Ye
    [J]. COMPUTERS & GRAPHICS-UK, 2017, 66 : 103 - 112
  • [28] 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network
    Zhang, Le
    Sun, Jian
    Zheng, Qiang
    [J]. SENSORS, 2018, 18 (11)
  • [29] Object-based encoding for multi-view sequences of 3D object
    Yi, J
    Rhee, K
    Kim, S
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2002, 17 (03) : 293 - 304
  • [30] MORE: simultaneous multi-view 3D object recognition and pose estimation
    Parisotto, Tommaso
    Mukherjee, Subhaditya
    Kasaei, Hamidreza
    [J]. INTELLIGENT SERVICE ROBOTICS, 2023, 16 (04) : 497 - 508