View-relation constrained global representation learning for multi-view-based 3D object recognition

被引:4
|
作者
Xu, Ruchang [1 ]
Mi, Qing [1 ]
Ma, Wei [1 ]
Zha, Hongbin [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100020, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object recognition; Multi-views; View-relation constraints; 3D global representation;
D O I
10.1007/s10489-022-03949-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view observations provide complementary clues for 3D object recognition, but also include redundant information that appears different across views due to view-dependent projection, light reflection and self-occlusions. This paper presents a view-relation constrained global representation network (VCGR-Net) for 3D object recognition that can mitigate the view interference problem at all phases, from view-level source feature generation to multi-view feature aggregation. Specifically, we determine inter-view relations via LSTM implicitly. Based on the relations, we construct a two-stage feature selection module to filter features at each view according to their importance to the global representation and their reliability as observations at specific views. The selected features are then aggregated by referring to intra- and inter-view spatial context to generate global representation for 3D object recognition. Experiments on the ModelNet40 and ModelNet10 datasets demonstrate that the proposed method can suppress view interference and therefore outperform state-of-the-art methods in 3D object recognition.
引用
收藏
页码:7741 / 7750
页数:10
相关论文
共 50 条
  • [1] View-relation constrained global representation learning for multi-view-based 3D object recognition
    Ruchang Xu
    Qing Mi
    Wei Ma
    Hongbin Zha
    Applied Intelligence, 2023, 53 : 7741 - 7750
  • [2] Learning Disentangled Representation for Multi-View 3D Object Recognition
    Huang, Jingjia
    Yan, Wei
    Li, Ge
    Li, Thomas
    Liu, Shan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 646 - 659
  • [3] Multi-view-based siamese convolutional neural network for 3D object retrieval
    Li, Haisheng
    Zheng, Yanping
    Cao, Jian
    Cai, Qiang
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 78 : 11 - 21
  • [4] Learning Relationships for Multi-View 3D Object Recognition
    Yang, Ze
    Wang, Liwei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7504 - 7513
  • [5] CFMVOR: Federated Multi-view 3D Object Recognition Based on Compressed Learning
    Xiao, Di
    Zhang, Meng
    Zhang, Maolan
    Chen, Lvjun
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 280 - 293
  • [6] Review of multi-view 3D object recognition methods based on deep learning
    Qi, Shaohua
    Ning, Xin
    Yang, Guowei
    Zhang, Liping
    Long, Peng
    Cai, Weiwei
    Li, Weijun
    DISPLAYS, 2021, 69
  • [7] Scalable representation and learning for 3D object recognition using shared feature-based view clustering
    Kim, S
    Kweon, IS
    COMPUTER VISION - ACCV 2006, PT II, 2006, 3852 : 561 - 570
  • [8] Multi-view representation and synthesis for 3D object movie
    Lie, WN
    Wei, BE
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2002, : 529 - 532
  • [9] Feature representation for 3D object retrieval based on unconstrained multi-view
    Zhou, Bin
    Wang, Xuanyin
    MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1699 - 1711
  • [10] Feature representation for 3D object retrieval based on unconstrained multi-view
    Bin Zhou
    Xuanyin Wang
    Multimedia Systems, 2022, 28 : 1699 - 1711