An end-to-end stereo matching algorithm based on improved convolutional neural network

被引：3

作者：

Liu, Yan ^{[1
]}

Lv, Bingxue ^{[1
]}

Wang, Yuheng ^{[1
]}

Huang, Wei ^{[1
]}

机构：

[1] Zhengzhou Univ Light Ind, Coll Comp & Commun Engn, Zhengzhou 45000, Peoples R China

来源：

MATHEMATICAL BIOSCIENCES AND ENGINEERING | 2020年 / 17卷 / 06期

基金：

中国国家自然科学基金;

关键词：

image sensor; stereo matching; binocular vision; convolutional neural network; SHAPE MEASUREMENT;

D O I：

10.3934/mbe.2020396

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Deep end-to-end learning based stereo matching methods have achieved great success as witnessed by the leaderboards across different benchmarking datasets. Depth information in stereo vision systems are obtained by a dense and accurate disparity map, which is computed by a robust stereo matching algorithm. However, previous works adopt network layer with the same size to train the feature parameters and get an unsatisfactory efficiency, which cannot be satisfied for the real scenarios by existing methods. In this paper, we present an end-to-end stereo matching algorithm based on "downsize" convolutional neural network (CNN) for autonomous driving scenarios. Firstly, the road images are feed into the designed CNN to get the depth information. And then the "downsize" full-connection layer combined with subsequent network optimization is employed to improve the accuracy of the algorithm. Finally, the improved loss function is utilized to approximate the similarity of positive and negative samples in a more relaxed constraint to improve the matching effect of the output. The loss function error of the proposed method for KITTI 2012 and KITTI 2015 datasets are reduced to 2.62 and 3.26% respectively, which also reduces the runtime of the proposed algorithm. Experimental results illustrate that the proposed end-to-end algorithm can obtain a dense disparity map and the corresponding depth information can be used for the binocular vision system in autonomous driving scenarios. In addition,our method also achieves better performance when the size of the network is compressed compared with previous methods.

引用

页码：7787 / 7803

页数：17

共 50 条

[31] End-to-End Speech Emotion Recognition Based on One-Dimensional Convolutional Neural Network
Gao, Mengna
Dong, Jing
Zhou, Dongsheng
Zhang, Qiang
Yang, Deyun
[J]. 3RD INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2019), 2019, : 78 - 82
[32] A novel end-to-end deep convolutional neural network based skin lesion classification framework
A, Razia Sulthana
Chamola, Vinay
Hussain, Zain
Albalwy, Faisal
Hussain, Amir
[J]. Expert Systems with Applications, 2024, 246
[33] End-to-End Pedestrian Collision Warning System Based on a Convolutional Neural Network with Semantic Segmentation
Jung, Heechul
Choi, Min-Kook
Soon, Kwon
Jung, Woo Young
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
[34] End-to-end cubic phase signal recovery method based on deep convolutional neural network
Li, Kang
Jiu, Bo
Liu, Hongwei
Lu, Ruiying
[J]. IET RADAR SONAR AND NAVIGATION, 2020, 14 (01): : 110 - 117
[35] End-To-End Convolutional Neural Network Model for Gear Fault Diagnosis Based on Sound Signals
Yao, Yong
Wang, Honglei
Li, Shaobo
Liu, Zhonghao
Gui, Gui
Dan, Yabo
Hu, Jianjun
[J]. APPLIED SCIENCES-BASEL, 2018, 8 (09):
[36] A novel end-to-end deep convolutional neural network based skin lesion classification framework
Sulthana, Razia
Chamola, Vinay
Hussain, Zain
Albalwy, Faisal
Hussain, Amir
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
[37] Stereo Matching Algorithm Based on Three-Dimensional Convolutional Neural Network
Wang Yufeng
Wang Hongwei
Yu Guang
Yang Mingquan
Yuan Yuwei
Quan Jicheng
[J]. ACTA OPTICA SINICA, 2019, 39 (11)
[38] End-to-End Learning for Omnidirectional Stereo Matching With Uncertainty Prior
Won, Changhee
Ryu, Jongbin
Lim, Jongwoo
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 3850 - 3862
[39] Task-Adaptive End-to-End Networks for Stereo Matching
Li T.
Ma W.
Xu S.
Zhang X.
[J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (07): : 1531 - 1538
[40] WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-End Speech Enhancement
Hsieh, Tsun-An
Wang, Hsin-Min
Lu, Xugang
Tsao, Yu
[J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 2149 - 2153

← 1 2 3 4 5 →