Cognitive Template-Clustering Improved LineMod for Efficient Multi-object Pose Estimation

被引:10
|
作者
Zhang, Tielin [1 ]
Yang, Yang [5 ]
Zeng, Yi [1 ,2 ,3 ,4 ]
Zhao, Yuxuan [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Res Ctr Brain Inspired Intelligence, Beijing, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Shanghai, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[4] Univ Chinese Acad Sci, Beijing, Peoples R China
[5] Peking Univ, Sch Software & Microelect, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Muller-Lyer illusion; Cognitive template-clustering; Brain-inspired computation; LineMod; 6D pose estimation;
D O I
10.1007/s12559-020-09717-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various types of theoretical algorithms have been proposed for 6D pose estimation, e.g., the point pair method, template matching method, Hough forest method, and deep learning method. However, they are still far from the performance of our natural biological systems, which can undertake 6D pose estimation of multi-objects efficiently, especially with severe occlusion. With the inspiration of the Muller-Lyer illusion in the biological visual system, in this paper, we propose a cognitive template-clustering improved LineMod (CT-LineMod) model. The model uses a 7D cognitive feature vector to replace standard 3D spatial points in the clustering procedure of Patch-LineMod, in which the cognitive distance of different 3D spatial points will be further influenced by the additional 4D information related with direction and magnitude of features in the Muller-Lyer illusion. The 7D vector will be dimensionally reduced into the 3D vector by the gradient-descent method, and then further clustered by K-means to aggregately match templates and automatically eliminate superfluous clusters, which makes the template matching possible on both holistic and part-based scales. The model has been verified on the standard Doumanoglou dataset and demonstrates a state-of-the-art performance, which shows the accuracy and efficiency of the proposed model on cognitive feature distance measurement and template selection on multiple pose estimation under severe occlusion. The powerful feature representation in the biological visual system also includes characteristics of the Muller-Lyer illusion, which, to some extent, will provide guidance towards a biologically plausible algorithm for efficient 6D pose estimation under severe occlusion.
引用
收藏
页码:834 / 843
页数:10
相关论文
共 50 条
  • [1] Cognitive Template-Clustering Improved LineMod for Efficient Multi-object Pose Estimation
    Tielin Zhang
    Yang Yang
    Yi Zeng
    Yuxuan Zhao
    [J]. Cognitive Computation, 2020, 12 : 834 - 843
  • [2] Learning shared template representation with augmented feature for multi-object pose estimation
    Luo, Qifeng
    Xu, Ting -Bing
    Liu, Fulin
    Li, Tianren
    Wei, Zhenzhong
    [J]. NEURAL NETWORKS, 2024, 176
  • [3] Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable Attention and Query Aggregation
    Periyasamy, Arul Selvam
    Tsaturyan, Vladimir
    Behnke, Sven
    [J]. 2023 SEVENTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2023, 2023, : 247 - 254
  • [4] Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
    Lipson, Lahav
    Teed, Zachary
    Goyal, Ankit
    Deng, Jia
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6718 - 6727
  • [5] Fast Algorithms of Multi-object Recognition and High Precision Localization for Pose Estimation
    Zhang, Yingjin
    Qin, Shiyin
    Hu, Xiaohui
    [J]. MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 1192 - 1197
  • [6] An Efficient Multi-Object Tracking Guided by Spatial Clustering on Vision Sensors
    Chen, Yulin
    Li, Zongtan
    Wu, Linhuang
    Chen, Pingping
    [J]. IEEE SENSORS JOURNAL, 2024, 24 (12) : 19344 - 19351
  • [7] Efficient Model-Based Object Pose Estimation Based on Multi-Template Tracking and PnP Algorithms
    Tsai, Chi-Yi
    Hsu, Kuang-Jui
    Nisar, Humaira
    [J]. ALGORITHMS, 2018, 11 (08)
  • [8] Multi-Object Recognition and 6-DoF Pose Estimation Based on Synthetic Datasets
    Hu G.
    Ou M.
    Li Z.
    [J]. Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2024, 52 (04): : 42 - 50
  • [9] A Framework for 3D Object Detection and Pose Estimation in Unstructured Environment Using Single Shot Detector and Refined LineMOD Template Matching
    Chen, Shili
    Hong, Jie
    Liu, Xineng
    Li, Jian
    Zhang, Tao
    Wang, Danwei
    Guan, Yisheng
    [J]. 2019 24TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2019, : 499 - 504
  • [10] Multi-object statistical pose plus shape models
    Bossa, M. N.
    Mos, S.
    [J]. 2007 4TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING : MACRO TO NANO, VOLS 1-3, 2007, : 1204 - 1207