Two-Phase Approach for Monocular Object Detection and 6-DoF Pose Estimation

被引:1
|
作者
Jang, Jae-hoon [1 ]
Lee, Jungyoon [2 ]
Kim, Seong-heum [1 ]
机构
[1] Soongsil Univ, Coll Informat Technol, Sch AI Convergence, Seoul, South Korea
[2] Soongsil Univ, Dept Intelligent Syst, Seoul, South Korea
关键词
Deep learning; Object detection; 6-DoF pose estimation; Perspective-n-point (PnP) algorithm;
D O I
10.1007/s42835-023-01640-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a two-phase algorithm that first identifies the categories and 2D proposal regions of 3D objects and then estimates the eight corners of cubes bounding the target objects. Given the predicted corners, the six-degrees-of-freedom (6-DoF) poses of the 3D objects are calculated using the conventional perspective-n-point (PnP) algorithm and evaluated with respect to manually annotated corners. In addition, several 3D models with high-quality shapes, texture information, 2D images, and annotations, such as 2D boxes, 3D cuboids, and segmentation masks, are collected. New objects are included while validating the proposed method. Our results are compared qualitatively and quantitatively with those of the baseline model using the publicly accessible LineMOD dataset, additional annotations in the OCCLUSION dataset, and our own custom dataset. While handling single and multiple objects in testing scenes, the proposed method is observed to exhibit clear improvements on both the aforementioned datasets and in real-world examples.
引用
收藏
页码:1817 / 1825
页数:9
相关论文
共 50 条
  • [21] Two-Steps Framework for Highly Accurate 6-DoF Pose Estimation
    Piriyatharawet, Teerawat
    Teo, Wei-De
    Chong, Shin-Horng
    2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,
  • [22] Multi-modal 6-DoF object pose tracking: integrating spatial cues with monocular RGB imagery
    Mei, Yunpeng
    Wang, Shuze
    Li, Zhuo
    Sun, Jian
    Wang, Gang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (02) : 1327 - 1340
  • [23] Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset
    Zheng, Tianyu
    Zhang, Chunyan
    Zhang, Shengwen
    Wang, Yanyan
    SENSORS, 2023, 23 (24)
  • [24] Cylinder object 6-DOF pose estimation via single perspective circle on cylindrical surface
    Yu, Aidi
    Wang, Yujia
    Guo, Bing
    Li, Haoyuan
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [25] 6-DoF grasp pose estimation based on instance reconstruction
    Han, Huiyan
    Wang, Wenjun
    Han, Xie
    Yang, Xiaowen
    INTELLIGENT SERVICE ROBOTICS, 2024, 17 (02) : 251 - 264
  • [26] 6-DoF grasp pose estimation based on instance reconstruction
    Huiyan Han
    Wenjun Wang
    Xie Han
    Xiaowen Yang
    Intelligent Service Robotics, 2024, 17 : 251 - 264
  • [27] Robust 6-DoF Pose Estimation under Hybrid Constraints
    Ren, Hong
    Lin, Lin
    Wang, Yanjie
    Dong, Xin
    SENSORS, 2022, 22 (22)
  • [28] NEMA: 6-DoF Pose Estimation Dataset for Deep Learning
    Roman, Philippe Perez de San
    Desbarats, Pascal
    Domenger, Jean-Philippe
    Buendia, Axel
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 682 - 690
  • [29] Map-aided 6-DOF Relative Pose Estimation for Monocular SLAM using Sparse Information Filters
    Wang, Zhan
    Dissanayake, Gamini
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 1006 - 1011
  • [30] Multi-Modal Pose Representations for 6-DOF Object Tracking
    Majcher, Mateusz
    Kwolek, Bogdan
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (04)