gvnn: Neural Network Library for Geometric Computer Vision

被引:48
|
作者
Handa, Ankur [1 ]
Bloesch, Michael [3 ]
Patraucean, Viorica [2 ]
Stent, Simon [2 ]
McCormac, John [1 ]
Davison, Andrew [1 ]
机构
[1] Imperial Coll London, Dept Comp, Dyson Robot Lab, London, England
[2] Univ Cambridge, Dept Engn, Cambridge, England
[3] ETH, Robot Syst Lab, Zurich, Switzerland
关键词
Spatial transformer networks; Geometric vision; Unsupervised learning;
D O I
10.1007/978-3-319-49409-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce gvnn, a neural network library in Torch aimed towards bridging the gap between classic geometric computer vision and deep learning. Inspired by the recent success of Spatial Transformer Networks, we propose several new layers which are often used as parametric transformations on the data in geometric computer vision. These layers can be inserted within a neural network much in the spirit of the original spatial transformers and allow backpropagation to enable end-to-end learning of a network involving any domain knowledge in geometric computer vision. This opens up applications in learning invariance to 3D geometric transformation for place recognition, end-to-end visual odometry, depth estimation and unsupervised learning through warping with a parametric transformation for image reconstruction error.
引用
收藏
页码:67 / 82
页数:16
相关论文
共 50 条
  • [41] The neural network for vision
    不详
    VISION RESEARCH, 1996, 36 : 366 - 366
  • [42] Neural network and vision
    Lelasseux, P.
    Caron, A.
    Mathematical Modelling and Scientific Computing, 1993, 2 (sectioA):
  • [43] New geometric techniques in computer vision - Preface
    Lasenby, J
    Zisserman, A
    Cipolla, R
    Longuet-Higgins, HC
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1998, 356 (1740): : 1065 - 1065
  • [44] CellTypeGraph: A New Geometric Computer Vision Benchmark
    Cerrone, Lorenzo
    Vijayan, Athul
    Mody, Tejasvinee
    Schneitz, Kay
    Hamprecht, Fred A.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20865 - 20875
  • [45] SeLibCV: A Service Library for Computer Vision Researchers
    Tafti, Ahmad P.
    Hassannia, Hamid
    Piziak, Dee
    Yu, Zeyun
    ADVANCES IN VISUAL COMPUTING, PT II (ISVC 2015), 2015, 9475 : 542 - 553
  • [46] ChainerCV: a Library for Deep Learning in Computer Vision
    Niitani, Yusuke
    Ogawa, Toru
    Saito, Shunta
    Saito, Masaki
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1217 - 1220
  • [47] MinGPU: a minimum GPU library for computer vision
    Pavel Babenko
    Mubarak Shah
    Journal of Real-Time Image Processing, 2008, 3 : 255 - 268
  • [48] MinGPU: a minimum GPU library for computer vision
    Babenko, Pavel
    Shah, Mubarak
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2008, 3 (04) : 255 - 268
  • [49] Computer vision-based object-recognition by a Hopfield neural network
    Chen, Ching-Shun
    Chen, Mao-Lin
    Zhongguo Hangkong Taikong Xuehui Huikan/Transactions of the Aeronautical and Astronautical Society of the Republic of China, 2002, 34 (01): : 31 - 37
  • [50] A Fruit Quality Classification Algorithm Based on BP Neural Network and Computer Vision
    Qiang, Hequn
    Qian, Chunhua
    Ren, Yi
    PROCEEDINGS OF THE 2018 8TH INTERNATIONAL CONFERENCE ON MANAGEMENT, EDUCATION AND INFORMATION (MEICI 2018), 2018, 163 : 998 - 1001