MagicVO: An End-to-End Hybrid CNN and Bi-LSTM Method for Monocular Visual Odometry

被引:19
|
作者
Jiao, Jichao [1 ]
Jiao, Jian [1 ]
Mo, Yaokai [1 ]
Liu, Weilun [1 ]
Deng, Zhongliang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Unmanned vehicle; visual odometry; bi-directional LSTM; convolutional neural network; VERSATILE; FILTER;
D O I
10.1109/ACCESS.2019.2926350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the robotic positioning and navigation, visual odometry (VO) system is widely used. However, the errors of the traditional VO accumulate when the robot moves. Besides, this paper proposes a new framework to solve the problem of monocular VO, called MagicVO. Based on the convolutional neural network (CNN) and the bi-directional LSTM (Bi-LSTM), MagicVO outputs a 6-DoF absolute-scale pose at each position of the camera with a sequence of continuous monocular images as input. It does not only utilize the outstanding performance of CNN in extracting the rich features of image frames fully but also learns the geometric relationship from image sequences pre and post through Bi-LSTM to get a more accurate prediction. A pipeline of the MagicVO is shown in this paper. The MagicVO is an end-to-end system, and the results of the experiments on the KITTI and ETH datasets show that MagicVO has a better performance than the traditional VO systems in the accuracy of pose and the generalization ability.
引用
收藏
页码:94118 / 94127
页数:10
相关论文
共 50 条
  • [1] An End-to-end Network for Monocular Visual Odometry Based on Image Sequence
    Yao, Mingwei
    Quan, Hongyan
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [2] A Bi-LSTM memory network for end-to-end goal-oriented dialog learning
    Kim, Byoungjae
    Chung, KyungTae
    Lee, Jeongpil
    Seo, Jungyun
    Koo, Myoung-Wan
    [J]. COMPUTER SPEECH AND LANGUAGE, 2019, 53 : 217 - 230
  • [3] End-to-end Answer Selection via Attention-Based Bi-LSTM Network
    Ren, Yuqi
    Zhang, Tongxuan
    Liu, Xikai
    Lin, Hongfei
    [J]. PROCEEDINGS OF 2018 1ST IEEE INTERNATIONAL CONFERENCE ON HOT INFORMATION-CENTRIC NETWORKING (HOTICN 2018), 2018, : 264 - 265
  • [4] End-to-End Trajectory Transportation Mode Classification Using Bi-LSTM Recurrent Neural Network
    Liu, Hongbin
    Lee, Ickjai
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [5] An End-to-End Multi-Channel Convolutional Bi-LSTM Network for Automatic Sleep Stage Detection
    Toma, Tabassum Islam
    Choi, Sunwoong
    [J]. SENSORS, 2023, 23 (10)
  • [6] Towards End-to-end Learning of Visual Inertial Odometry with an EKF
    Li, Chunshang
    Waslander, Steven L.
    [J]. 2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 190 - 197
  • [7] Geometric Consistency for Self-Supervised End-to-End Visual Odometry
    Iyer, Ganesh
    Murthy, J. Krishna
    Gupta, Gunshi
    Krishna, K. Madhava
    Paull, Liam
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 380 - 388
  • [8] Smoothing CNN for end-to-end training in visual question answering
    Long, Yu
    Tang, Pengjie
    Wang, Hanli
    Li, Qinyu
    [J]. DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 784 - 791
  • [9] End-to-end CNN + LSTM deep learning approach for bearing fault diagnosis
    Amin Khorram
    Mohammad Khalooei
    Mansoor Rezghi
    [J]. Applied Intelligence, 2021, 51 : 736 - 751
  • [10] An End-to-End model based on CNN-LSTM for Industrial Fault Diagnosis and Prognosis
    Yue, Gao
    Ping, Gong
    Li Lanxin
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 274 - 278