YOLOPose V2: Understanding and improving transformer-based 6D pose estimation

被引:13
|
作者
Periyasamy, Arul Selvam [1 ]
Amini, Arash [1 ]
Tsaturyan, Vladimir [1 ]
Behnke, Sven [1 ]
机构
[1] Univ Bonn, Autonomous Intelligent Syst, Bonn, Germany
关键词
Vision transformers; Object pose estimation; Object detection; CALIBRATION;
D O I
10.1016/j.robot.2023.104490
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
6D object pose estimation is a crucial prerequisite for autonomous robot manipulation applications. The state-of-the-art models for pose estimation are convolutional neural network (CNN)-based. Lately, Transformers, an architecture originally proposed for natural language processing, is achieving state -of-the-art results in many computer vision tasks as well. Equipped with the multi-head self-attention mechanism, Transformers enable simple single-stage end-to-end architectures for learning object detection and 6D object pose estimation jointly. In this work, we propose YOLOPose (short form for You Only Look Once Pose estimation), a Transformer-based multi-object 6D pose estimation method based on keypoint regression and an improved variant of the YOLOPose model. In contrast to the standard heatmaps for predicting keypoints in an image, we directly regress the keypoints. Additionally, we employ a learnable orientation estimation module to predict the orientation from the keypoints. Along with a separate translation estimation module, our model is end-to-end differentiable. Our method is suitable for real-time applications and achieves results comparable to state-of-the-art methods. We analyze the role of object queries in our architecture and reveal that the object queries specialize in detecting objects in specific image regions. Furthermore, we quantify the accuracy trade-off of using datasets of smaller sizes to train our model. & COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] AiPE: A Novel Transformer-Based Pose Estimation Method
    Lu, Kai
    Min, Dugki
    ELECTRONICS, 2024, 13 (05)
  • [22] Improving Zero-Shot Template-Based 6D Pose Estimation with Geometric Features
    Poellabauer, Thomas
    Weyel, Johannes
    Knauthe, Volker
    Berkei, Sarah
    Kuijper, Arjan
    ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT I, 2025, 15046 : 44 - 57
  • [23] Estimation of 6D Pose of Objects Based on a Variant Adversarial Autoencoder
    Huang, Dan
    Ahn, Hyemin
    Li, Shile
    Hu, Yueming
    Lee, Dongheui
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9581 - 9596
  • [24] Reconstruction-based 6D pose estimation for robotic assembly
    Shi, Zhongchen
    Xu, Kai
    Li, Zhang
    Guan, Banglei
    Wang, Gang
    Shang, Yang
    APPLIED OPTICS, 2020, 59 (31) : 9824 - 9835
  • [25] Estimation of 6D Pose of Objects Based on a Variant Adversarial Autoencoder
    Dan Huang
    Hyemin Ahn
    Shile Li
    Yueming Hu
    Dongheui Lee
    Neural Processing Letters, 2023, 55 : 9581 - 9596
  • [26] MULTISTREAM VALIDNET: IMPROVING 6D OBJECT POSE ESTIMATION BY AUTOMATIC MULTISTREAM VALIDATION
    Mazumder, Joy
    Zand, Mohsen
    Greenspan, Michael
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3143 - 3147
  • [27] FormerPose: An efficient multi-scale fusion Transformer network based on RGB-D for 6D pose estimation
    Hou, Pihong
    Zhang, Yongfang
    Wu, Yi
    Yan, Pengyu
    Zhang, Fuqiang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 106
  • [28] Single Shot 6D Object Pose Estimation
    Kleeberger, Kilian
    Huber, Marco F.
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245
  • [29] BOP: Benchmark for 6D Object Pose Estimation
    Hodan, Tomas
    Michel, Frank
    Brachmann, Eric
    Kehl, Wadim
    Buch, Anders Glent
    Kraft, Dirk
    Drost, Bertram
    Vidal, Joel
    Ihrke, Stephan
    Zabulis, Xenophon
    Sahin, Caner
    Manhardt, Fabian
    Tombari, Federico
    Kim, Tae-Kyun
    Matas, Jiri
    Rother, Carsten
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
  • [30] Transformer-based 3D Human pose estimation and action achievement evaluation
    Yang, Aolei
    Zhou, Yinghong
    Yang, Banghua
    Xu, Yulin
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2024, 45 (04): : 136 - 144