Learning shared template representation with augmented feature for multi-object pose estimation

被引:0
|
作者
Luo, Qifeng [1 ]
Xu, Ting -Bing [1 ,2 ]
Liu, Fulin [1 ]
Li, Tianren [1 ]
Wei, Zhenzhong [1 ]
机构
[1] Beihang Univ, Sch Instrumentat & Optoelect Engn, Minist Educ, Key Lab Precis Optomechatron Technol, Beijing 100191, Peoples R China
[2] SenseTime Grp Ltd, Beijing 100191, Peoples R China
关键词
Pose estimation; Shared template matching; Representation learning; Occluded objects; Augmented semantic feature;
D O I
10.1016/j.neunet.2024.106352
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Template matching pose estimation methods based on deep learning have made significant advancements via metric learning or reconstruction learning. Existing approaches primarily build distinct template representation libraries (codebooks) from rendered images for each object, which complicate the training process and increase memory cost for multi -object tasks. Additionally, they struggle to effectively handle discrepancies between the distributions of training and test sets, particularly for occluded objects, resulting in suboptimal matching accuracy. In this study, we propose a shared template representation learning method with augmented semantic features to address these issues. Our method learns representations concurrently using metric and reconstruction learning as similarity constraints, and augments response of network to objects through semantic feature constraints for better generalization performance. Furthermore, rotation matrices serve as templates for codebook construction, leading to excellent matching accuracy compared to rendered images. Notably, it contributes to the effective decoupling of object categories and templates, necessitating the maintenance of only a shared codebook in multi -object pose estimation tasks. Extensive experiments on Linemod, LinemodOccluded and TLESS datasets demonstrate that the proposed method employing shared templates achieves superior matching accuracy. Moreover, proposed method exhibits robustness on a collected aircraft dataset, further validating its efficacy.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] IEKF based object pose estimation for Augmented Reality
    Song, Jiaru
    Hu, Shiqiang
    Yang, Yongsheng
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY (ICVR 2018), 2018, : 15 - 20
  • [32] Multi-Stage Feature Learning Based Object Recognition and 3D Pose Estimation with Kinect
    Zeng, Wei
    Liang, Guoyuan
    Wang, Can
    Wu, Xinyu
    [J]. 2016 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2016, : 498 - 504
  • [33] Human skeleton behavior recognition model based on multi-object pose estimation with spatiotemporal semantics
    Jiaji Liu
    Xiaofang Mu
    Zhenyu Liu
    Hao Li
    [J]. Machine Vision and Applications, 2023, 34
  • [34] Discrimination analysis using multi-object statistics of shape and pose
    Gorczowski, Kevin
    Styner, Martin
    Jeong, Ja Yeon
    Marron, J. S.
    Piven, Joseph
    Hazlett, Heather Cody B.
    Pizer, Stephen M.
    Gerig, Guido
    [J]. MEDICAL IMAGING 2007: IMAGE PROCESSING, PTS 1-3, 2007, 6512
  • [35] Simple Multi-Resolution Representation Learning for Human Pose Estimation
    Tran, Trung Q.
    Nguyen, Giang, V
    Kim, Daeyoung
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 511 - 518
  • [36] Structured Feature Learning for Pose Estimation
    Chu, Xiao
    Ouyang, Wanli
    Li, Hongsheng
    Wang, Xiaogang
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4715 - 4723
  • [37] Multi-object Tracking Based on Nearest Optimal Template Library
    Tian, Ran
    Zhang, Xiang
    Chen, Donghang
    Hu, Yujie
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 331 - 342
  • [38] Multi-Object Grasping Detection With Hierarchical Feature Fusion
    Wu, Guangbin
    Chen, Weishan
    Cheng, Hui
    Zuo, Wangmeng
    Zhang, David
    You, Jane
    [J]. IEEE ACCESS, 2019, 7 : 43884 - 43894
  • [39] Joint Template Matching Algorithm for Associated Multi-object Detection
    Xie, Jianbin
    Liu, Tong
    Chen, Zhangyong
    Zhuang, Zhaowen
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (01): : 395 - 405
  • [40] Object Pose Estimation and Feature Extraction Based on PVNet
    Kao, Yi-Hsiang
    Chen, Ching-Kun
    Chen, Chih-Cheng
    Lan, Chen-Yen
    [J]. IEEE ACCESS, 2022, 10 : 122387 - 122398