YOLO-6D-Pose: Enhancing YOLO for Single-Stage Monocular Multi-Object 6D Pose Estimation

被引:0
|
作者
Maji, Debapriya [1 ]
Nagori, Soyeb [1 ]
Mathew, Manu [1 ]
Poddar, Deepak [1 ]
机构
[1] Texas Instruments Inc, Bangalore, India
关键词
D O I
10.1109/3DV62453.2024.00160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Directly regressing 6 degrees of freedom for all the objects from a single RGB image is not well explored. Even end-to-end pose estimation approaches for a single object are inferior compared to state-of-the-art methods in terms of accuracy. Most 6D pose estimation frameworks are multi-stage relying on off-the-shelf deep networks for object and keypoint detection to establish correspondences between 3D object keypoints and 2D image locations. This is followed by applying a variant of a RANSAC-based Perspective-n-Point (PnP) followed by complex refinement operation. In this work, we propose a multi-object 6D pose estimation framework by enhancing the popular YOLOX object detector. The network is end-to-end trainable and detects each object along with its pose from a single RGB image without any additional post-processing. We show that by properly parameterizing the 6D pose and carefully designing the loss function, we can achieve state-of-theart accuracy without further refinement or any intermediate representations. YOLO-6D-Pose achieves SOTA results on YCBV and LMO dataset, surpassing all existing monocular approaches. We systematically analyze various 6D augmentations to verify their correctness and propose a new translation augmentation for this task. The network does not rely on any correspondences and is independent of the CAD model during inference. Code is available at https:// github. com/ TexasInstruments/ edgeai-yolox.
引用
收藏
页码:1616 / 1625
页数:10
相关论文
共 50 条
  • [41] Improved target pose estimation algorithm based on YOLO-6D
    Cong M.
    Zhang B.
    Du Y.
    Li J.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (12): : 8 - 13
  • [42] Occlusion-Aware Self-Supervised Monocular 6D Object Pose Estimation
    Wang, Gu
    Manhardt, Fabian
    Liu, Xingyu
    Ji, Xiangyang
    Tombari, Federico
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1788 - 1803
  • [43] Prior Geometry Guided Direct Regression Network for Monocular 6D Object Pose Estimation
    Liu, Chongpei
    Sun, Wei
    Zhang, Keyi
    Liu, Jian
    Zhang, Xing
    Fan, Shimeng
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6241 - 6246
  • [44] MONO6D: MONOCULAR VEHICLE 6D POSE ESTIMATION WITH 3D PRIORS
    Lyu, Yangxintong
    Royen, Remco
    Munteanu, Adrian
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2187 - 2191
  • [45] Challenges for Monocular 6-D Object Pose Estimation in Robotics
    Thalhammer, Stefan
    Bauer, Dominik
    Hoenig, Peter
    Weibel, Jean-Baptiste
    Garcia-Rodriguez, Jose
    Vincze, Markus
    IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 : 4065 - 4084
  • [46] 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation
    Xu, Li
    Qui, Haoxuan
    Cai, Yujun
    Liu, Jun
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9676 - 9686
  • [47] Deep Quaternion Pose Proposals for 6D Object Pose Tracking
    Majcher, Mateusz
    Kwolek, Bogdan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 243 - 251
  • [48] Dual Branch PnP Based Network for Monocular 6D Pose Estimation
    Liang, Jia-Yu
    Zhang, Hong-Bo
    Lei, Qing
    Du, Ji-Xiang
    Lin, Tian-Liang
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3243 - 3256
  • [49] 6D Object Pose Estimation With Color/Geometry Attention Fusion
    Yuan, Honglin
    Veltkamp, Remco C.
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 529 - 535
  • [50] Hybrid 6D Object Pose Estimation from the RGB Image
    Staszak, Rafal
    Belter, Dominik
    ICINCO: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 1, 2019, : 541 - 549