Realtime RGB-Based 3D Object Pose Detection Using Convolutional Neural Networks

被引:7
|
作者
Liu, Jin [1 ]
He, Sheng [1 ]
Tao, Yiting [1 ]
Liu, Daifei [2 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Peoples R China
[2] Changsha Univ Sci &Technol, Sch Energy & Power Engn, Changsha 410004, Peoples R China
基金
中国国家自然科学基金;
关键词
Quaternions; Three-dimensional displays; Cameras; Training; Object detection; Two dimensional displays; Sensors; 3D pose detection; convolutional neural network;
D O I
10.1109/JSEN.2019.2946279
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a new approach to efficiently detect the 3D pose of objects in images. The approach uses a single neural network we call TQ-Net to predict the translation vector and the quaternion, after which the quaternion is converted to a rotation matrix, and through a projection algorithm, we can project eight points surrounding the object to the image and connect them to form a 3D bounding box. Considering there is a constraint existing in the quaternion, we add a normalization layer to get a more precise result. Experiments show that our approach performs well and can detect 3D poses in real time. Our approach processes images with a resolution of 1280*720 at 61 frames per second on a GTX 1080 GPU.
引用
收藏
页码:11812 / 11819
页数:8
相关论文
共 50 条
  • [1] RGB-D Salient Object Detection via 3D Convolutional Neural Networks
    Chen, Qian
    Liu, Ze
    Zhang, Yi
    Fu, Keren
    Zhao, Qijun
    Du, Hongwei
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1063 - 1071
  • [2] 3D Object Detection Based on Convolutional Neural Networks: A Survey
    Wang, Yadong
    Tian, Yonglin
    Li, Guoqiang
    Wang, Kunfeng
    Li, Dazi
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (12): : 1103 - 1119
  • [3] 3D Pose Regression using Convolutional Neural Networks
    Mahendran, Siddharth
    Ali, Haider
    Vidal, Rene
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 494 - 495
  • [4] 3D Pose Regression using Convolutional Neural Networks
    Mahendran, Siddharth
    Ali, Haider
    Vidal, Rene
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2174 - 2182
  • [5] RGB-based Android Malware Detection and Classification Using Convolutional Neural Network
    Darwaish, Asim
    Nait-Abdesselam, Farid
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [6] Cascaded Hierarchical CNN for RGB-Based 3D Hand Pose Estimation
    Dai, Shiming
    Liu, Wei
    Yang, Wenji
    Fan, Lili
    Zhang, Jihao
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [7] Pushing the Envelope for RGB-based Dense 3D Hand Pose Estimation via Neural Rendering
    Baek, Seungryul
    Kim, Kwang In
    Kim, Tae-Kyun
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1067 - 1076
  • [8] 3D Human Pose Estimation from RGB plus D Images with Convolutional Neural Networks
    Cai, Yiheng
    Wang, Xueyan
    Kong, Xinran
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND BIOINFORMATICS (ICBEB 2018), 2018, : 64 - 69
  • [9] Pose prediction using 3D deep convolutional neural networks
    Wallach, Izhar
    Dzamba, Michael
    Schrodl, Stefan
    Rampasek, Ladislav
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [10] 3-D Convolutional Neural Networks for RGB-D Salient Object Detection and Beyond
    Chen, Qian
    Zhang, Zhenxi
    Lu, Yanye
    Fu, Keren
    Zhao, Qijun
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4309 - 4323