View-relation constrained global representation learning for multi-view-based 3D object recognition

被引:4
|
作者
Xu, Ruchang [1 ]
Mi, Qing [1 ]
Ma, Wei [1 ]
Zha, Hongbin [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100020, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object recognition; Multi-views; View-relation constraints; 3D global representation;
D O I
10.1007/s10489-022-03949-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view observations provide complementary clues for 3D object recognition, but also include redundant information that appears different across views due to view-dependent projection, light reflection and self-occlusions. This paper presents a view-relation constrained global representation network (VCGR-Net) for 3D object recognition that can mitigate the view interference problem at all phases, from view-level source feature generation to multi-view feature aggregation. Specifically, we determine inter-view relations via LSTM implicitly. Based on the relations, we construct a two-stage feature selection module to filter features at each view according to their importance to the global representation and their reliability as observations at specific views. The selected features are then aggregated by referring to intra- and inter-view spatial context to generate global representation for 3D object recognition. Experiments on the ModelNet40 and ModelNet10 datasets demonstrate that the proposed method can suppress view interference and therefore outperform state-of-the-art methods in 3D object recognition.
引用
收藏
页码:7741 / 7750
页数:10
相关论文
共 50 条
  • [21] Deep models for multi-view 3D object recognition: a review
    Alzahrani, Mona
    Usman, Muhammad
    Jarraya, Salma Kammoun
    Anwar, Saeed
    Helmy, Tarek
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (12)
  • [22] Multi-view Harmonized Bilinear Network for 3D Object Recognition
    Yu, Tan
    Meng, Jingjing
    Yuan, Junsong
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 186 - 194
  • [23] Multi-view ensemble manifold regularization for 3D object recognition
    Hong, Chaoqun
    Yu, Jun
    You, Jane
    Chen, Xuhui
    Tao, Dapeng
    INFORMATION SCIENCES, 2015, 320 : 395 - 405
  • [24] MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition
    Luequan Wang
    Hongbin Xu
    Wenxiong Kang
    Machine Intelligence Research, 2023, 20 : 872 - 883
  • [25] MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition
    Wang, Luequan
    Xu, Hongbin
    Kang, Wenxiong
    MACHINE INTELLIGENCE RESEARCH, 2023, 20 (06) : 872 - 883
  • [26] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3201 - 3212
  • [27] Scalable representation for 3D object recognition using feature sharing and view clustering
    Kim, Sungho
    Kweon, In So
    PATTERN RECOGNITION, 2008, 41 (02) : 754 - 773
  • [28] Performance evaluation of a 3D multi-view-based particle filter for visual object tracking using GPUs and multicore CPUs
    Concha, David
    Cabido, Raul
    Jose Pantrigo, Juan
    Montemayor, Antonio S.
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2018, 15 (02) : 309 - 327
  • [29] Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views
    Wei, Xin
    Gong, Yifei
    Wang, Fudong
    Sun, Xing
    Sun, Jian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 397 - 406
  • [30] JLCRB: A unified multi-view-based joint representation learning for CircRNA binding sites prediction
    Du, Xiuquan
    Xue, Zhigang
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 136