A Progressive Multi-View Learning Approach for Multi-Loss Optimization in 3D Object Recognition

被引:2
|
作者
Prasad, Shitala [1 ]
Li, Yiqun [1 ]
Lin, Dongyun [1 ]
Dong, Sheng [1 ]
Nwe, Ma Tin Lay [1 ]
机构
[1] ASTAR, Visual Intelligence Dept, Inst Infocomm Res I2R, Singapore 138632, Singapore
关键词
Object recognition; Training; Three-dimensional displays; Task analysis; Visualization; Prototypes; Solid modeling; 3D unseen learning; DCNN; progressive multi-view learning; object detection; self-supervised learning; NETWORKS;
D O I
10.1109/LSP.2021.3132794
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
3D object recognition is a well studied 2D multi-view object classification task that achieves high accuracy if the object textures are distinctive. However, if objects are texture-less and are only differentiable by their shapes but at certain viewpoints. Thus, the problem is still very challenging. Furthermore, the existing methods are mostly based on supervised learning with lots of images per object which are difficult to collect and label them for training. In this letter, we introduced a multi-loss view invariant stochastic prototype embedding to minimize and improve the recognition accuracy of novel objects at different viewpoints by using a progressive multi-view learning approach. An extensive experimental results show that the proposed method outperforms the state-of-the-art methods on different types datasets and also on different backbones.
引用
收藏
页码:707 / 711
页数:5
相关论文
共 50 条
  • [1] A Progressive Multi-View Learning Approach for Multi-Loss Optimization in 3D Object Recognition
    Prasad, Shitala
    Li, Yiqun
    Lin, Dongyun
    Dong, Sheng
    Nwe, Ma Tin Lay
    IEEE Signal Processing Letters, 2022, 29 : 707 - 711
  • [2] Learning Relationships for Multi-View 3D Object Recognition
    Yang, Ze
    Wang, Liwei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7504 - 7513
  • [3] Learning Disentangled Representation for Multi-View 3D Object Recognition
    Huang, Jingjia
    Yan, Wei
    Li, Ge
    Li, Thomas
    Liu, Shan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 646 - 659
  • [4] CFMVOR: Federated Multi-view 3D Object Recognition Based on Compressed Learning
    Xiao, Di
    Zhang, Meng
    Zhang, Maolan
    Chen, Lvjun
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 280 - 293
  • [5] Review of multi-view 3D object recognition methods based on deep learning
    Qi, Shaohua
    Ning, Xin
    Yang, Guowei
    Zhang, Liping
    Long, Peng
    Cai, Weiwei
    Li, Weijun
    DISPLAYS, 2021, 69
  • [6] Multi-View Attentive Contextualization for Multi-View 3D Object Detection
    Liu, Xianpeng
    Zheng, Ce
    Qian, Ming
    Xue, Nan
    Chen, Chen
    Zhang, Zhebin
    Li, Chen
    Wu, Tianfu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16688 - 16698
  • [7] 3D LayoutCRF for multi-view object class recognition and segmentation
    Hoiem, Derek
    Rother, Carsten
    Winn, John
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 580 - +
  • [8] Multi-view convolutional vision transformer for 3D object recognition
    Li, Jie
    Liu, Zhao
    Li, Li
    Lin, Junqin
    Yao, Jian
    Tu, Jingmin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [9] Multi-view ensemble manifold regularization for 3D object recognition
    Hong, Chaoqun
    Yu, Jun
    You, Jane
    Chen, Xuhui
    Tao, Dapeng
    INFORMATION SCIENCES, 2015, 320 : 395 - 405
  • [10] MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition
    Luequan Wang
    Hongbin Xu
    Wenxiong Kang
    Machine Intelligence Research, 2023, 20 : 872 - 883