Deep Learning on Monocular Object Pose Detection and Tracking: A Comprehensive Overview

被引:0
|
作者
Fan, Zhaoxin [1 ]
Zhu, Yazhi [2 ]
He, Yulin [1 ]
Sun, Qi [1 ]
Liu, Hongyan [3 ]
He, Jun [1 ]
机构
[1] Renmin Univ China, Sch Informat, Key Lab Data Engn & Knowledge Engn MOE, 59 Zhongguancun St, Beijing 100872, Peoples R China
[2] Beijing Jiaotong Univ, Inst Informat Sci, 3 Shangyuancun, Beijing, Peoples R China
[3] Tsinghua Univ, Sch Econ & Management, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Object pose detection; object pose tracking; instance-level; category-level; monocular; AUGMENTED REALITY;
D O I
10.1145/3524496
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Object pose detection and tracking has recently attracted increasing attention due to its wide applications in many areas, such as autonomous driving, robotics, and augmented reality. Among methods for object pose detection and tracking, deep learning is the most promising one that has shown better performance than others. However, survey study about the latest development of deep learning-based methods is lacking. Therefore, this study presents a comprehensive review of recent progress in object pose detection and tracking that belongs to the deep learning technical route. To achieve a more thorough introduction, the scope of this study is limited to methods taking monocular RGB/RGBD data as input and covering three kinds of major tasks: instance-level monocular object pose detection, category-level monocular object pose detection, and monocular object pose tracking. In our work, metrics, datasets, and methods of both detection and tracking are presented in detail. Comparative results of current state-of-the-art methods on several publicly available datasets are also presented, together with insightful observations and inspiring future research directions.
引用
收藏
页数:40
相关论文
共 50 条
  • [21] Object Detection in Monocular Infrared Images Using Classification - Regresion Deep Learning Architectures
    Brehar, Raluca
    Vancea, Flaviu
    Marita, Tiberiu
    Vancea, Cristian Cosmin
    Nedevschi, Sergiu
    [J]. 2019 IEEE 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2019), 2019, : 207 - 212
  • [22] Object Detection from Video Sequences Using Deep Learning: An Overview
    Garg, Dweepna
    Kotecha, Ketan
    [J]. ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2018, 562 : 137 - 148
  • [23] Monocular depth estimation based on deep learning: An overview
    ZHAO ChaoQiang
    SUN Qi Yu
    ZHANG ChongZhen
    TANG Yang
    QIAN Feng
    [J]. Science China Technological Sciences, 2020, (09) : 1612 - 1627
  • [24] A hybrid optimisation enabled deep learning for object detection and multi-object tracking
    Thirumalai, J.
    Gomathi, M.
    Sindhu, T. S.
    Kumar, A. Senthil
    Puviarasi, R.
    [J]. INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2024, 46 (03)
  • [25] Monocular depth estimation based on deep learning: An overview
    ZHAO ChaoQiang
    SUN Qi Yu
    ZHANG ChongZhen
    TANG Yang
    QIAN Feng
    [J]. Science China(Technological Sciences), 2020, 63 (09) - 1627
  • [26] Monocular depth estimation based on deep learning: An overview
    ChaoQiang Zhao
    QiYu Sun
    ChongZhen Zhang
    Yang Tang
    Feng Qian
    [J]. Science China Technological Sciences, 2020, 63 : 1612 - 1627
  • [27] Monocular depth estimation based on deep learning: An overview
    Zhao, ChaoQiang
    Sun, QiYu
    Zhang, ChongZhen
    Tang, Yang
    Qian, Feng
    [J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (09) : 1612 - 1627
  • [28] Deep learning in multi-object detection and tracking: state of the art
    Sankar K. Pal
    Anima Pramanik
    J. Maiti
    Pabitra Mitra
    [J]. Applied Intelligence, 2021, 51 : 6400 - 6429
  • [29] Deep Learning based Object Detection and Tracking for Maritime Situational Awareness
    Lahouli, Rihab
    De Cubber, Geert
    Pairet, Benoit
    Hamesse, Charles
    Freville, Timothee
    Haelterman, Rob
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 643 - 650
  • [30] Deep learning in multi-object detection and tracking: state of the art
    Pal, Sankar K.
    Pramanik, Anima
    Maiti, J.
    Mitra, Pabitra
    [J]. APPLIED INTELLIGENCE, 2021, 51 (09) : 6400 - 6429