Siamese Convolutional Neural Network for Sub-millimeter-accurate Camera Pose Estimation and Visual Servoing

被引:0
|
作者
Yu, Cunjun [1 ]
Cai, Zhongang [1 ]
Hung Pham [2 ]
Quang-Cuong Pham [1 ]
机构
[1] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore, Singapore
[2] Eureka Robot, Singapore, Singapore
关键词
D O I
10.1109/iros40897.2019.8967925
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual Servoing (VS), where images taken from a camera typically attached to the robot end-effector are used to guide the robot motions, is an important technique to tackle robotic tasks that require a high level of accuracy. We propose a new neural network, based on a Siamese architecture, for highly accurate camera pose estimation. This, in turn, can be used as a final refinement step following a coarse VS or, if applied in an iterative manner, as a standalone VS on its own. The key feature of our neural network is that it outputs the relative pose between any pair of images, and does so with sub-millimeter accuracy. We show that our network can reduce pose estimation errors to 0.6 mm in translation and 0.4 degrees in rotation, from initial errors of 10 mm / 5 degrees if applied once, or of several cm / tens of degrees if applied iteratively. The network can generalize to similar objects, is robust against changing lighting conditions, and to partial occlusions (when used iteratively). The high accuracy achieved enables tackling low-tolerance assembly tasks downstream: using our network, an industrial robot can achieve 97.5% success rate on a VGA-connector insertion task without any force sensing mechanism.
引用
收藏
页码:935 / 941
页数:7
相关论文
共 50 条
  • [31] Camera calibration based on the RBF neural network with tunable nodes for visual servoing in robotics
    Zong, Xiaoping
    Xu, Yan
    Hao, Lei
    Huai, Xiaoli
    2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 5708 - +
  • [32] Convolutional Neural Network-Based Visual Servoing for Eye-to-Hand Manipulator
    Tokuda, Fuyuki
    Arai, Shogo
    Kosuge, Kazuhiro
    IEEE ACCESS, 2021, 9 : 91820 - 91835
  • [33] Industry robotic motion and pose recognition method based on camera pose estimation and neural network
    Wang, Ding
    Xie, Fei
    Yang, Jiquan
    Lu, Rongjian
    Zhu, Tengfei
    Liu, Yijian
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2021, 18 (03):
  • [34] Lightweight Underwater Visual Loop Detection and Classification using a Siamese Convolutional Neural Network
    Burguera, Antoni
    IFAC PAPERSONLINE, 2021, 54 (16): : 410 - 415
  • [35] RGB-D CAMERA POSE ESTIMATION USING DEEP NEURAL NETWORK
    Guo, Fei
    He, Yifeng
    Guan, Ling
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 408 - 412
  • [36] Optimal Deep Convolutional Neural Network with Pose Estimation for Human Activity Recognition
    Nandagopal, S.
    Karthy, G.
    Oliver, A. Sheryl
    Subha, M.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (02): : 1719 - 1733
  • [37] Human Pose Estimation via Multi-resolution Convolutional Neural Network
    Zhu, Aichun
    Jin, Jing
    Wang, Tian
    Zhu, Qiurong
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 700 - 705
  • [38] Dance Action Recognition and Pose Estimation Based on Deep Convolutional Neural Network
    Zhu, Fengling
    Zhu, Ruichao
    TRAITEMENT DU SIGNAL, 2021, 38 (02) : 529 - 538
  • [39] Simultaneous Face Detection and Pose Estimation Using Convolutional Neural Network Cascade
    Wu, Hao
    Zhang, Ke
    Tian, Guohui
    IEEE ACCESS, 2018, 6 : 49563 - 49575
  • [40] Multi-person pose estimation based on a deep convolutional neural network
    Duan, Peng
    Wang, Tingwei
    Cui, Maowei
    Sang, Hongyan
    Sun, Qun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 245 - 252