gvnn: Neural Network Library for Geometric Computer Vision

被引：48

作者：

Handa, Ankur ^{[1
]}

Bloesch, Michael ^{[3
]}

Patraucean, Viorica ^{[2
]}

Stent, Simon ^{[2
]}

McCormac, John ^{[1
]}

Davison, Andrew ^{[1
]}

机构：

[1] Imperial Coll London, Dept Comp, Dyson Robot Lab, London, England

[2] Univ Cambridge, Dept Engn, Cambridge, England

[3] ETH, Robot Syst Lab, Zurich, Switzerland

来源：

COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III | 2016年 / 9915卷

关键词：

Spatial transformer networks; Geometric vision; Unsupervised learning;

D O I：

10.1007/978-3-319-49409-8_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce gvnn, a neural network library in Torch aimed towards bridging the gap between classic geometric computer vision and deep learning. Inspired by the recent success of Spatial Transformer Networks, we propose several new layers which are often used as parametric transformations on the data in geometric computer vision. These layers can be inserted within a neural network much in the spirit of the original spatial transformers and allow backpropagation to enable end-to-end learning of a network involving any domain knowledge in geometric computer vision. This opens up applications in learning invariance to 3D geometric transformation for place recognition, end-to-end visual odometry, depth estimation and unsupervised learning through warping with a parametric transformation for image reconstruction error.

引用

页码：67 / 82

页数：16

共 50 条

[41] The neural network for vision
不详
VISION RESEARCH, 1996, 36 : 366 - 366
[42] Neural network and vision
Lelasseux, P.
Caron, A.
Mathematical Modelling and Scientific Computing, 1993, 2 (sectioA):
[43] New geometric techniques in computer vision - Preface
Lasenby, J
Zisserman, A
Cipolla, R
Longuet-Higgins, HC
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1998, 356 (1740): : 1065 - 1065
[44] CellTypeGraph: A New Geometric Computer Vision Benchmark
Cerrone, Lorenzo
Vijayan, Athul
Mody, Tejasvinee
Schneitz, Kay
Hamprecht, Fred A.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20865 - 20875
[45] SeLibCV: A Service Library for Computer Vision Researchers
Tafti, Ahmad P.
Hassannia, Hamid
Piziak, Dee
Yu, Zeyun
ADVANCES IN VISUAL COMPUTING, PT II (ISVC 2015), 2015, 9475 : 542 - 553
[46] ChainerCV: a Library for Deep Learning in Computer Vision
Niitani, Yusuke
Ogawa, Toru
Saito, Shunta
Saito, Masaki
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1217 - 1220
[47] MinGPU: a minimum GPU library for computer vision
Pavel Babenko
Mubarak Shah
Journal of Real-Time Image Processing, 2008, 3 : 255 - 268
[48] MinGPU: a minimum GPU library for computer vision
Babenko, Pavel
Shah, Mubarak
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2008, 3 (04) : 255 - 268
[49] Computer vision-based object-recognition by a Hopfield neural network
Chen, Ching-Shun
Chen, Mao-Lin
Zhongguo Hangkong Taikong Xuehui Huikan/Transactions of the Aeronautical and Astronautical Society of the Republic of China, 2002, 34 (01): : 31 - 37
[50] A Fruit Quality Classification Algorithm Based on BP Neural Network and Computer Vision
Qiang, Hequn
Qian, Chunhua
Ren, Yi
PROCEEDINGS OF THE 2018 8TH INTERNATIONAL CONFERENCE ON MANAGEMENT, EDUCATION AND INFORMATION (MEICI 2018), 2018, 163 : 998 - 1001

← 1 2 3 4 5 →