Deep Learning on Monocular Object Pose Detection and Tracking: A Comprehensive Overview

被引:0
|
作者
Fan, Zhaoxin [1 ]
Zhu, Yazhi [2 ]
He, Yulin [1 ]
Sun, Qi [1 ]
Liu, Hongyan [3 ]
He, Jun [1 ]
机构
[1] Renmin Univ China, Sch Informat, Key Lab Data Engn & Knowledge Engn MOE, 59 Zhongguancun St, Beijing 100872, Peoples R China
[2] Beijing Jiaotong Univ, Inst Informat Sci, 3 Shangyuancun, Beijing, Peoples R China
[3] Tsinghua Univ, Sch Econ & Management, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Object pose detection; object pose tracking; instance-level; category-level; monocular; AUGMENTED REALITY;
D O I
10.1145/3524496
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Object pose detection and tracking has recently attracted increasing attention due to its wide applications in many areas, such as autonomous driving, robotics, and augmented reality. Among methods for object pose detection and tracking, deep learning is the most promising one that has shown better performance than others. However, survey study about the latest development of deep learning-based methods is lacking. Therefore, this study presents a comprehensive review of recent progress in object pose detection and tracking that belongs to the deep learning technical route. To achieve a more thorough introduction, the scope of this study is limited to methods taking monocular RGB/RGBD data as input and covering three kinds of major tasks: instance-level monocular object pose detection, category-level monocular object pose detection, and monocular object pose tracking. In our work, metrics, datasets, and methods of both detection and tracking are presented in detail. Comparative results of current state-of-the-art methods on several publicly available datasets are also presented, together with insightful observations and inspiring future research directions.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] A Comprehensive Overview of Object Detection Based on Deep Learning
    Yuan, Gaoling
    Chen, Linshu
    Cai, Jiahong
    Yang, Chaoyi
    Liu, Jinnian
    [J]. PROCEEDINGS OF THE 2024 IEEE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS 2024, 2024, : 80 - 85
  • [2] A comprehensive review of object detection with deep learning
    Kaur, Ravpreet
    Singh, Sarbjeet
    [J]. DIGITAL SIGNAL PROCESSING, 2023, 132
  • [3] Object Detection and Tracking Based on Deep Learning
    Lee, Yong-Hwan
    Lee, Wan-Bum
    [J]. INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2019, 2020, 994 : 629 - 635
  • [4] Monocular 3D Pose Tracking of a Specular Object
    Oumer, Nassir W.
    [J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, : 458 - 465
  • [5] Underwater Object Detection and Pose Estimation using Deep Learning
    Jeon, MyungHwan
    Lee, Yeongjun
    Shin, Young-Sik
    Jang, Hyesu
    Kim, Ayoung
    [J]. IFAC PAPERSONLINE, 2019, 52 (21): : 78 - 81
  • [6] Robust monocular object pose tracking for large pose shift using 2D tracking
    Qiufu Wang
    Jiexin Zhou
    Zhang Li
    Xiaoliang Sun
    Qifeng Yu
    [J]. Visual Intelligence, 1 (1):
  • [7] An Overview of Deep Learning Based Object Detection Techniques
    Bhagya, C.
    Shyna, A.
    [J]. PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
  • [8] An Overview of Object Detection and Tracking
    Zhao, Yi
    Shi, Haobin
    Chen, Xuanwen
    Li, Xuesi
    Wang, Cong
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 280 - 286
  • [9] A Comprehensive Review on 3D Object Detection and 6D Pose Estimation With Deep Learning
    Hoque, Sabera
    Arafat, Md. Yasir
    Xu, Shuxiang
    Maiti, Ananda
    Wei, Yuchen
    [J]. IEEE ACCESS, 2021, 9 : 143746 - 143770
  • [10] Monocular 3D Pose Estimation and Tracking by Detection
    Andriluka, Mykhaylo
    Roth, Stefan
    Schiele, Bernt
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 623 - 630