View-relation constrained global representation learning for multi-view-based 3D object recognition

被引:4
|
作者
Xu, Ruchang [1 ]
Mi, Qing [1 ]
Ma, Wei [1 ]
Zha, Hongbin [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100020, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object recognition; Multi-views; View-relation constraints; 3D global representation;
D O I
10.1007/s10489-022-03949-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view observations provide complementary clues for 3D object recognition, but also include redundant information that appears different across views due to view-dependent projection, light reflection and self-occlusions. This paper presents a view-relation constrained global representation network (VCGR-Net) for 3D object recognition that can mitigate the view interference problem at all phases, from view-level source feature generation to multi-view feature aggregation. Specifically, we determine inter-view relations via LSTM implicitly. Based on the relations, we construct a two-stage feature selection module to filter features at each view according to their importance to the global representation and their reliability as observations at specific views. The selected features are then aggregated by referring to intra- and inter-view spatial context to generate global representation for 3D object recognition. Experiments on the ModelNet40 and ModelNet10 datasets demonstrate that the proposed method can suppress view interference and therefore outperform state-of-the-art methods in 3D object recognition.
引用
收藏
页码:7741 / 7750
页数:10
相关论文
共 50 条
  • [31] View-based virtual learning and recognition of 3D object using view model obtained by motion-stereo
    Wang, CH
    Sakaue, K
    NEW GENERATION COMPUTING, 2000, 18 (02) : 127 - 135
  • [32] View-based virtual learning and recognition of 3D object using view model obtained by motion-stereo
    Caihua Wang
    Katsuhiko Sakaue
    New Generation Computing, 2000, 18 : 127 - 135
  • [33] View-based virtual learning and recognition of 3D object using view model obtained by motion-stereo
    Wang, Caihua
    Sakaue, Katsuhiko
    2000, Ohmsha Ltd, Tokyo, Japan (18)
  • [34] Multimodal learning for view-based 3D object classification
    Chen, Fuhai
    Ji, Rongrong
    Cao, Liujuan
    NEUROCOMPUTING, 2016, 195 : 23 - 29
  • [35] A multi-view-based noise correction algorithm for crowdsourcing learning
    Li, Xinyang
    Li, Chaoqun
    Jiang, Liangxiao
    INFORMATION FUSION, 2023, 91 : 529 - 541
  • [36] Best View Selection of 3D Object Based on Sample Learning
    Liu, Zhi
    Feng, Yipan
    Chen, Qihua
    Pan, Xiang
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2011), 2011, 122 : 557 - +
  • [37] Multi-view representation learning for multi-view action recognition
    Hao, Tong
    Wu, Dan
    Wang, Qian
    Sun, Jin-Sheng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 453 - 460
  • [38] Object-based encoding for multi-view sequences of 3D object
    Yi, J
    Rhee, K
    Kim, S
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2002, 17 (03) : 293 - 304
  • [39] View planning for efficient contour-based 3D object recognition
    Urdiales, C.
    de Trazegnies, C.
    Pacheco, J.
    Sandoval, F.
    MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, : 206 - 211
  • [40] MORE: simultaneous multi-view 3D object recognition and pose estimation
    Parisotto, Tommaso
    Mukherjee, Subhaditya
    Kasaei, Hamidreza
    INTELLIGENT SERVICE ROBOTICS, 2023, 16 (04) : 497 - 508