A Comprehensive Review on 3D Object Detection and 6D Pose Estimation With Deep Learning

被引:26
|
作者
Hoque, Sabera [1 ]
Arafat, Md. Yasir [1 ]
Xu, Shuxiang [1 ]
Maiti, Ananda [1 ]
Wei, Yuchen [1 ]
机构
[1] Univ Tasmania, Sch Informat & Commun Technol, Newnham, Tas 7248, Australia
关键词
Three-dimensional displays; Object detection; Pose estimation; Laser radar; Cameras; Visualization; Automobiles; Machine learning; deep neural network; computer vision; image processing; convolutional neural network; 3D object detection; 6D pose estimation; NEURAL-NETWORKS; IMAGE FEATURES; RECOGNITION; REPRESENTATION; LOCALIZATION; TRACKING; SEGMENTATION;
D O I
10.1109/ACCESS.2021.3114399
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, computer vision with 3D (dimension) object detection and 6D (degree of freedom) pose assumptions are widely discussed and studied in the field. In the 3D object detection process, classifications are centered on the object's size, position, and direction. And in 6D pose assumptions, networks emphasize 3D translation and rotation vectors. Successful application of these strategies can have a huge impact on various machine learning-based applications, including the autonomous vehicles, the robotics industry, and the augmented reality sector. Although extensive work has been done on 3D object detection with a pose assumption from RGB images, the challenges have not been fully resolved. Our analysis provides a comprehensive review of the proposed contemporary techniques for complete 3D object detection and the recovery of 6D pose assumptions of an object. In this review research paper, we have discussed several proposed sophisticated methods in 3D object detection and 6D pose estimation, including some popular data sets, evaluation matrix, and proposed method challenges. Most importantly, this study makes an effort to offer some possible future directions in 3D object detection and 6D pose estimation. We accept the autonomous vehicle as the sample case for this detailed review. Finally, this review provides a complete overview of the latest in-depth learning-based research studies related to 3D object detection and 6D pose estimation systems and points out a comparison between some popular frameworks. To be more concise, we propose a detailed summary of the state-of-the-art techniques of modern deep learning-based object detection and pose estimation models.
引用
收藏
页码:143746 / 143770
页数:25
相关论文
共 50 条
  • [11] Rigidity-Aware Detection for 6D Object Pose Estimation
    Hai, Yang
    Song, Rui
    Li, Jiaojiao
    Salzmann, Mathieu
    Hu, Yinlin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8927 - 8936
  • [12] Robust 6D Object Pose Estimation by Learning RGB-D Features
    Tian, Meng
    Pan, Liang
    Ang, Marcelo H., Jr.
    Lee, Gim Hee
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6218 - 6224
  • [13] Joint Optimization of the 3D Model and 6D Pose for Monocular Pose Estimation
    Guo, Liangchao
    Chen, Lin
    Wang, Qiufu
    Zhang, Zhuo
    Sun, Xiaoliang
    Drones, 2024, 8 (11)
  • [14] Single Shot 6D Object Pose Estimation
    Kleeberger, Kilian
    Huber, Marco F.
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245
  • [15] BOP: Benchmark for 6D Object Pose Estimation
    Hodan, Tomas
    Michel, Frank
    Brachmann, Eric
    Kehl, Wadim
    Buch, Anders Glent
    Kraft, Dirk
    Drost, Bertram
    Vidal, Joel
    Ihrke, Stephan
    Zabulis, Xenophon
    Sahin, Caner
    Manhardt, Fabian
    Tombari, Federico
    Kim, Tae-Kyun
    Matas, Jiri
    Rother, Carsten
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
  • [16] 3D Object Detection and 6D Pose Estimation Using RGB-D Images and Mask R-CNN
    Tran, Van Luan
    Lin, Huei-Yung
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [17] Survey on 6D Pose Estimation of Rigid Object
    Chen, Jiale
    Zhang, Lijun
    Liu, Yi
    Xu, Chi
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7440 - 7445
  • [18] Unsupervised Joint 3D Object Model Learning and 6D Pose Estimation for Depth-Based Instance Segmentation
    Wu, Yuanwei
    Marks, Tim K.
    Cherian, Anoop
    Chen, Siheng
    Feng, Chen
    Wang, Guanghui
    Sullivan, Alan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2777 - 2786
  • [19] A review on object pose recovery: From 3D bounding box detectors to full 6D pose estimators
    Sahin, Caner
    Garcia-Hernando, Guillermo
    Sock, Juil
    Kim, Tae-Kyun
    IMAGE AND VISION COMPUTING, 2020, 96
  • [20] Augmented Autoencoders: Implicit 3D Orientation Learning for 6D Object Detection
    Martin Sundermeyer
    Zoltan-Csaba Marton
    Maximilian Durner
    Rudolph Triebel
    International Journal of Computer Vision, 2020, 128 : 714 - 729