Learning shared template representation with augmented feature for multi-object pose estimation

被引:0
|
作者
Luo, Qifeng [1 ]
Xu, Ting -Bing [1 ,2 ]
Liu, Fulin [1 ]
Li, Tianren [1 ]
Wei, Zhenzhong [1 ]
机构
[1] Beihang Univ, Sch Instrumentat & Optoelect Engn, Minist Educ, Key Lab Precis Optomechatron Technol, Beijing 100191, Peoples R China
[2] SenseTime Grp Ltd, Beijing 100191, Peoples R China
关键词
Pose estimation; Shared template matching; Representation learning; Occluded objects; Augmented semantic feature;
D O I
10.1016/j.neunet.2024.106352
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Template matching pose estimation methods based on deep learning have made significant advancements via metric learning or reconstruction learning. Existing approaches primarily build distinct template representation libraries (codebooks) from rendered images for each object, which complicate the training process and increase memory cost for multi -object tasks. Additionally, they struggle to effectively handle discrepancies between the distributions of training and test sets, particularly for occluded objects, resulting in suboptimal matching accuracy. In this study, we propose a shared template representation learning method with augmented semantic features to address these issues. Our method learns representations concurrently using metric and reconstruction learning as similarity constraints, and augments response of network to objects through semantic feature constraints for better generalization performance. Furthermore, rotation matrices serve as templates for codebook construction, leading to excellent matching accuracy compared to rendered images. Notably, it contributes to the effective decoupling of object categories and templates, necessitating the maintenance of only a shared codebook in multi -object pose estimation tasks. Extensive experiments on Linemod, LinemodOccluded and TLESS datasets demonstrate that the proposed method employing shared templates achieves superior matching accuracy. Moreover, proposed method exhibits robustness on a collected aircraft dataset, further validating its efficacy.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Single-Shot and Multi-Shot Feature Learning for Multi-Object Tracking
    Li, Yizhe
    Zhou, Sanping
    Qin, Zheng
    Wang, Le
    Wang, Jinjun
    Zheng, Nanning
    [J]. IEEE Transactions on Multimedia, 2024, 26 : 9515 - 9526
  • [22] Tree representation and feature fusion based method for multi-object binary image retrieval
    Liu, Dong
    Wang, Shengsheng
    Liu, Yiting
    Zeng, Fantao
    Wu, Jimin
    Li, Wenyang
    [J]. Journal of Information and Computational Science, 2013, 10 (04): : 1055 - 1064
  • [23] Identity Variance for Multi-Object Estimation
    Crouse, David F.
    Willett, Peter
    [J]. SIGNAL AND DATA PROCESSING OF SMALL TARGETS 2011, 2011, 8137
  • [24] Label space: A multi-object shape representation
    Malcolm, James
    Rathi, Yogesh
    Tannenbaum, Allen
    [J]. COMBINATORIAL IMAGE ANALYSIS, 2008, 4958 : 185 - +
  • [25] MULTI-OBJECT TRACKING USING SPARSE REPRESENTATION
    Lu, Weizhi
    Bai, Cong
    Kpalma, Kidiyo
    Ronsin, Joseph
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 2312 - 2316
  • [26] Simultaneous Estimation of Feature Correspondence and Stereo Object Pose with Application to Ultrasound Augmented Robotic Laparoscopy
    Jayarathne, Uditha L.
    Luo, Xiongbiao
    Chen, Elvis C. S.
    Peters, Terry M.
    [J]. AUGMENTED ENVIRONMENTS FOR COMPUTER-ASSISTED INTERVENTIONS, AE-CAI 2015, 2015, 9365 : 134 - 144
  • [27] Design of deformable template: A case for multi-object tracking
    Institute of Space Medico-Engineering, Beijing 100094, China
    [J]. Xitong Fangzhen Xuebao, 2006, 4 (1073-1077):
  • [28] Online Multi-object Tracking Exploiting Pose Estimation and Global-Local Appearance Features
    Jiang, Na
    Bai, Sichen
    Xu, Yue
    Zhou, Zhong
    Wu, Wei
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 814 - 816
  • [29] Feature Compression for Multimodal Multi-Object Tracking
    Li, Xinlin
    Hanna, Osama A.
    Fragouli, Christina
    Diggavi, Suhas
    Verma, Gunjan
    Bhattacharyya, Joydeep
    [J]. MILCOM 2023 - 2023 IEEE MILITARY COMMUNICATIONS CONFERENCE, 2023,
  • [30] Human skeleton behavior recognition model based on multi-object pose estimation with spatiotemporal semantics
    Liu, Jiaji
    Mu, Xiaofang
    Liu, Zhenyu
    Li, Hao
    [J]. MACHINE VISION AND APPLICATIONS, 2023, 34 (03)