View-relation constrained global representation learning for multi-view-based 3D object recognition

被引:4
|
作者
Xu, Ruchang [1 ]
Mi, Qing [1 ]
Ma, Wei [1 ]
Zha, Hongbin [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100020, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object recognition; Multi-views; View-relation constraints; 3D global representation;
D O I
10.1007/s10489-022-03949-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view observations provide complementary clues for 3D object recognition, but also include redundant information that appears different across views due to view-dependent projection, light reflection and self-occlusions. This paper presents a view-relation constrained global representation network (VCGR-Net) for 3D object recognition that can mitigate the view interference problem at all phases, from view-level source feature generation to multi-view feature aggregation. Specifically, we determine inter-view relations via LSTM implicitly. Based on the relations, we construct a two-stage feature selection module to filter features at each view according to their importance to the global representation and their reliability as observations at specific views. The selected features are then aggregated by referring to intra- and inter-view spatial context to generate global representation for 3D object recognition. Experiments on the ModelNet40 and ModelNet10 datasets demonstrate that the proposed method can suppress view interference and therefore outperform state-of-the-art methods in 3D object recognition.
引用
收藏
页码:7741 / 7750
页数:10
相关论文
共 50 条
  • [41] Multi-View Token Clustering and Fusion for 3D Object Recognition and Retrieval
    Fan, Linlong
    Ge, Yanqi
    Li, Wen
    Duan, Lixin
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1145 - 1150
  • [42] 3D Object Recognition via Multi-View Inspection in Unknown Environments
    Westell, Jamie
    Saeedi, Parvaneh
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2088 - 2095
  • [43] MORE: simultaneous multi-view 3D object recognition and pose estimation
    Tommaso Parisotto
    Subhaditya Mukherjee
    Hamidreza Kasaei
    Intelligent Service Robotics, 2023, 16 : 497 - 508
  • [44] Fast and Robust Multi-View 3D Object Recognition in Point Clouds
    Pang, Guan
    Neumann, Ulrich
    2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 171 - 179
  • [45] Local feature view clustering for 3D object recognition
    Lowe, DG
    2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2001, : 682 - 688
  • [46] Multi-View Attentive Contextualization for Multi-View 3D Object Detection
    Liu, Xianpeng
    Zheng, Ce
    Qian, Ming
    Xue, Nan
    Chen, Chen
    Zhang, Zhebin
    Li, Chen
    Wu, Tianfu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16688 - 16698
  • [47] Dynamic View Aggregation for Multi-View 3D Shape Recognition
    Zhou, Yuan
    Sun, Zhongqi
    Huo, Shuwei
    Kung, Sun-Yuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9163 - 9174
  • [48] Metasample based sparse representation classification for multi-view object recognition
    Sun, H. (clhaosun@gmail.com), 2013, Central South University of Technology (44):
  • [49] Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-view 3D Detection and Tracking
    Guo, Mingzhe
    Zhang, Zhipeng
    Jing, Liping
    He, Yuan
    Wang, Ke
    Fan, Heng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 6184 - 6206
  • [50] View-based 3D object retrieval via multi-modal graph learning
    Zhao, Sicheng
    Yao, Hongxun
    Zhang, Yanhao
    Wang, Yasi
    Liu, Shaohui
    SIGNAL PROCESSING, 2015, 112 : 110 - 118