Learning shared template representation with augmented feature for multi-object pose estimation

被引:0
|
作者
Luo, Qifeng [1 ]
Xu, Ting -Bing [1 ,2 ]
Liu, Fulin [1 ]
Li, Tianren [1 ]
Wei, Zhenzhong [1 ]
机构
[1] Beihang Univ, Sch Instrumentat & Optoelect Engn, Minist Educ, Key Lab Precis Optomechatron Technol, Beijing 100191, Peoples R China
[2] SenseTime Grp Ltd, Beijing 100191, Peoples R China
关键词
Pose estimation; Shared template matching; Representation learning; Occluded objects; Augmented semantic feature;
D O I
10.1016/j.neunet.2024.106352
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Template matching pose estimation methods based on deep learning have made significant advancements via metric learning or reconstruction learning. Existing approaches primarily build distinct template representation libraries (codebooks) from rendered images for each object, which complicate the training process and increase memory cost for multi -object tasks. Additionally, they struggle to effectively handle discrepancies between the distributions of training and test sets, particularly for occluded objects, resulting in suboptimal matching accuracy. In this study, we propose a shared template representation learning method with augmented semantic features to address these issues. Our method learns representations concurrently using metric and reconstruction learning as similarity constraints, and augments response of network to objects through semantic feature constraints for better generalization performance. Furthermore, rotation matrices serve as templates for codebook construction, leading to excellent matching accuracy compared to rendered images. Notably, it contributes to the effective decoupling of object categories and templates, necessitating the maintenance of only a shared codebook in multi -object pose estimation tasks. Extensive experiments on Linemod, LinemodOccluded and TLESS datasets demonstrate that the proposed method employing shared templates achieves superior matching accuracy. Moreover, proposed method exhibits robustness on a collected aircraft dataset, further validating its efficacy.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Cognitive Template-Clustering Improved LineMod for Efficient Multi-object Pose Estimation
    Zhang, Tielin
    Yang, Yang
    Zeng, Yi
    Zhao, Yuxuan
    [J]. COGNITIVE COMPUTATION, 2020, 12 (04) : 834 - 843
  • [2] Cognitive Template-Clustering Improved LineMod for Efficient Multi-object Pose Estimation
    Tielin Zhang
    Yang Yang
    Yi Zeng
    Yuxuan Zhao
    [J]. Cognitive Computation, 2020, 12 : 834 - 843
  • [3] Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization
    Foo, Alex
    Hsu, Wynne
    Lee, Mong Li
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Learning Discriminative Proposal Representation for Multi-object Tracking
    Huang, Yejia
    Liu, Xianqin
    Zhang, Yijun
    Hu, Jian-Fang
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14356 LNCS : 300 - 310
  • [5] Multi-Object Representation Learning with Iterative Variational Inference
    Greff, Klaus
    Kaufman, Raphael Lopez
    Kabra, Rishabh
    Watters, Nick
    Burgess, Chris
    Zoran, Daniel
    Matthey, Loic
    Botvinick, Matthew
    Lerchner, Alexander
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [6] Feature space trajectory distorted object representation for classification and pose estimation
    Casasent, D
    Neiberg, LM
    Sipe, MA
    [J]. OPTICAL ENGINEERING, 1998, 37 (03) : 914 - 923
  • [7] Active 6D Multi-Object Pose Estimation in Cluttered Scenarios with Deep Reinforcement Learning
    Sock, Juil
    Garcia-Hernando, Guillermo
    Kim, Tae-Kyun
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10564 - 10571
  • [8] Online Multi-Object Tracking Based on Feature Representation and Bayesian Filtering Within a Deep Learning Architecture
    Xiang, Jun
    Zhang, Guoshuai
    Hou, Jianhua
    [J]. IEEE ACCESS, 2019, 7 : 27923 - 27935
  • [9] Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
    Lipson, Lahav
    Teed, Zachary
    Goyal, Ankit
    Deng, Jia
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6718 - 6727
  • [10] Fast Algorithms of Multi-object Recognition and High Precision Localization for Pose Estimation
    Zhang, Yingjin
    Qin, Shiyin
    Hu, Xiaohui
    [J]. MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 1192 - 1197