Siamese Convolutional Neural Network for Sub-millimeter-accurate Camera Pose Estimation and Visual Servoing

被引:0
|
作者
Yu, Cunjun [1 ]
Cai, Zhongang [1 ]
Hung Pham [2 ]
Quang-Cuong Pham [1 ]
机构
[1] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore, Singapore
[2] Eureka Robot, Singapore, Singapore
关键词
D O I
10.1109/iros40897.2019.8967925
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual Servoing (VS), where images taken from a camera typically attached to the robot end-effector are used to guide the robot motions, is an important technique to tackle robotic tasks that require a high level of accuracy. We propose a new neural network, based on a Siamese architecture, for highly accurate camera pose estimation. This, in turn, can be used as a final refinement step following a coarse VS or, if applied in an iterative manner, as a standalone VS on its own. The key feature of our neural network is that it outputs the relative pose between any pair of images, and does so with sub-millimeter accuracy. We show that our network can reduce pose estimation errors to 0.6 mm in translation and 0.4 degrees in rotation, from initial errors of 10 mm / 5 degrees if applied once, or of several cm / tens of degrees if applied iteratively. The network can generalize to similar objects, is robust against changing lighting conditions, and to partial occlusions (when used iteratively). The high accuracy achieved enables tackling low-tolerance assembly tasks downstream: using our network, an industrial robot can achieve 97.5% success rate on a VGA-connector insertion task without any force sensing mechanism.
引用
收藏
页码:935 / 941
页数:7
相关论文
共 50 条
  • [1] SITPOSE: A SIAMESE CONVOLUTIONAL TRANSFORMER FOR RELATIVE CAMERA POSE ESTIMATION
    Leng, Kai
    Yang, Cong
    Sui, Wei
    Liu, Jie
    Li, Zhijun
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1871 - 1876
  • [2] EA-CTFVS: An Environment-Agnostic Coarse-to-Fine Visual Servoing Method for Sub-Millimeter-Accurate Assembly
    Bai, Yuxuan
    Dong, Mingshuai
    Wei, Shimin
    Yu, Xiuli
    ACTUATORS, 2024, 13 (08)
  • [3] Head Pose Estimation for an Omnidirectional Camera using a Convolutional Neural Network
    Yamaura, Yusuke
    Tsuboshita, Yukihiro
    Onishi, Takeshi
    PROCEEDINGS 2018 IEEE 13TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2018,
  • [4] CONVOLUTIONAL NEURAL NETWORK FOR CAMERA POSE ESTIMATION FROM OBJECT DETECTIONS
    Shalnov, E. V.
    Konushin, A. S.
    INTERNATIONAL WORKSHOP PHOTOGRAMMETRIC AND COMPUTER VISION TECHNIQUES FOR VIDEO SURVEILLANCE, BIOMETRICS AND BIOMEDICINE, 2017, 42-2 (W4): : 1 - 6
  • [5] Camera relative pose estimation for visual servoing using quaternions
    Fathian, Kaveh
    Jin, Jingfu
    Wee, Sung-Gil
    Lee, Dong-Ha
    Kim, Yoon-Gu
    Gans, Nicholas R.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 107 : 45 - 62
  • [6] Adaptive neural network control for visual servoing of underwater vehicles with pose estimation
    Jian Gao
    Puguo Wu
    Bo Yang
    Fei Xia
    Journal of Marine Science and Technology, 2017, 22 : 470 - 478
  • [7] Adaptive neural network control for visual servoing of underwater vehicles with pose estimation
    Gao, Jian
    Wu, Puguo
    Yang, Bo
    Xia, Fei
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY, 2017, 22 (03) : 470 - 478
  • [8] Robotic Visual Servoing Based on Convolutional Neural Network
    Liu, Jingshu
    Li, Yuan
    Yang, Renxing
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2245 - 2250
  • [9] Camera pose estimation by an artificial neural network
    Benton, Ryan G.
    Chu, Chee-hung Henry
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 604 - 611
  • [10] CAMERA POSE ESTIMATION USING VISUAL SERVOING FOR AERIAL VIDEO CHANGE DETECTION
    Bourdis, Nicolas
    Marraud, Denis
    Sahbi, Hichem
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 3459 - 3462