SITPOSE: A SIAMESE CONVOLUTIONAL TRANSFORMER FOR RELATIVE CAMERA POSE ESTIMATION

被引:0
|
作者
Leng, Kai [1 ]
Yang, Cong [2 ]
Sui, Wei [3 ]
Liu, Jie [1 ]
Li, Zhijun [1 ,2 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[2] Soochow Univ, Sch Future Sci & Engn, Suzhou, Peoples R China
[3] Horizon Robot, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Relative Pose Estimation; SLAM; Camera Pose Estimation; Cross Attention;
D O I
10.1109/ICME55011.2023.00321
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relative Camera Pose Estimation (RCPE) aims to calculate the translation and rotation between two frames with overlapped regions, which is crucial to computer vision and robotics. This paper presents a novel siamese convolutional transformer model, SiTPose, to regress relative camera pose directly. SiTPose is distinguished in three aspects: (1) With a cross-attention feature extractor and a compact transformer encoder, extreme rotation errors (> 150 degrees) are significantly reduced: from 9.7% with the state-of-the-art 8-Points to 1. on the 7Scenes dataset. (2) SiTPose is also robust to narrow-baseline cases (slight rotation angle and large translation between neighboring frames), while existing RCPE methods mainly focus on wide-baseline cases. (3) SiTPose can be flexibly extended to geometry-based vSLAM (namely SiT-SLAM) in a multi-threaded way to prevent tracking lost and scale ambiguity problems. Results on multiple datasets show that SiT-SLAM yields a marked improvement in robustness and localization accuracy in complex scenarios, e.g., RMSE error is reduced from 26.36m with the classic ORBSLAM3 method to 6.94m on the KITTI-09.
引用
收藏
页码:1871 / 1876
页数:6
相关论文
共 50 条
  • [1] Relative Camera Pose Estimation Using Convolutional Neural Networks
    Melekhov, Iaroslav
    Ylioinas, Juha
    Kannala, Juho
    Rahtu, Esa
    [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017), 2017, 10617 : 675 - 687
  • [2] Camera localization with Siamese neural networks using iterative relative pose estimation
    Kim, Daewoon
    Ko, Kwanghee
    [J]. JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2022, 9 (04) : 1482 - 1497
  • [3] Siamese Convolutional Neural Network for Sub-millimeter-accurate Camera Pose Estimation and Visual Servoing
    Yu, Cunjun
    Cai, Zhongang
    Hung Pham
    Quang-Cuong Pham
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 935 - 941
  • [4] Camera calibration and relative pose estimation from gravity
    Sturm, PF
    Quan, L
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 72 - 75
  • [5] Globally Optimal Relative Pose Estimation for Camera on a Selfie Stick
    Joo, Kyungdon
    Li, Hongdong
    Oh, Tae-Hyun
    Bok, Yunsu
    Kweon, In So
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4983 - 4989
  • [6] Approximated Relative Pose Solvers for Efficient Camera Motion Estimation
    Ventura, Jonathan
    Arth, Clemens
    Lepetit, Vincent
    [J]. COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 180 - 193
  • [7] RELATIVE CAMERA POSE ESTIMATION METHOD USING OPTIMIZATION ON THE MANIFOLD
    Cheng, Chuanqi
    Hao, Xiangyang
    Li, Jiansheng
    [J]. ISPRS HANNOVER WORKSHOP: HRIGI 17 - CMRT 17 - ISA 17 - EUROCOW 17, 2017, 42-1 (W1): : 41 - 46
  • [8] Camera relative pose estimation for visual servoing using quaternions
    Fathian, Kaveh
    Jin, Jingfu
    Wee, Sung-Gil
    Lee, Dong-Ha
    Kim, Yoon-Gu
    Gans, Nicholas R.
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 107 : 45 - 62
  • [9] Head Pose Estimation for an Omnidirectional Camera using a Convolutional Neural Network
    Yamaura, Yusuke
    Tsuboshita, Yukihiro
    Onishi, Takeshi
    [J]. PROCEEDINGS 2018 IEEE 13TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2018,
  • [10] CONVOLUTIONAL NEURAL NETWORK FOR CAMERA POSE ESTIMATION FROM OBJECT DETECTIONS
    Shalnov, E. V.
    Konushin, A. S.
    [J]. INTERNATIONAL WORKSHOP PHOTOGRAMMETRIC AND COMPUTER VISION TECHNIQUES FOR VIDEO SURVEILLANCE, BIOMETRICS AND BIOMEDICINE, 2017, 42-2 (W4): : 1 - 6