An end-to-end stereo matching algorithm based on improved convolutional neural network

被引:3
|
作者
Liu, Yan [1 ]
Lv, Bingxue [1 ]
Wang, Yuheng [1 ]
Huang, Wei [1 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Comp & Commun Engn, Zhengzhou 45000, Peoples R China
基金
中国国家自然科学基金;
关键词
image sensor; stereo matching; binocular vision; convolutional neural network; SHAPE MEASUREMENT;
D O I
10.3934/mbe.2020396
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep end-to-end learning based stereo matching methods have achieved great success as witnessed by the leaderboards across different benchmarking datasets. Depth information in stereo vision systems are obtained by a dense and accurate disparity map, which is computed by a robust stereo matching algorithm. However, previous works adopt network layer with the same size to train the feature parameters and get an unsatisfactory efficiency, which cannot be satisfied for the real scenarios by existing methods. In this paper, we present an end-to-end stereo matching algorithm based on "downsize" convolutional neural network (CNN) for autonomous driving scenarios. Firstly, the road images are feed into the designed CNN to get the depth information. And then the "downsize" full-connection layer combined with subsequent network optimization is employed to improve the accuracy of the algorithm. Finally, the improved loss function is utilized to approximate the similarity of positive and negative samples in a more relaxed constraint to improve the matching effect of the output. The loss function error of the proposed method for KITTI 2012 and KITTI 2015 datasets are reduced to 2.62 and 3.26% respectively, which also reduces the runtime of the proposed algorithm. Experimental results illustrate that the proposed end-to-end algorithm can obtain a dense disparity map and the corresponding depth information can be used for the binocular vision system in autonomous driving scenarios. In addition,our method also achieves better performance when the size of the network is compressed compared with previous methods.
引用
收藏
页码:7787 / 7803
页数:17
相关论文
共 50 条
  • [31] End-to-End Speech Emotion Recognition Based on One-Dimensional Convolutional Neural Network
    Gao, Mengna
    Dong, Jing
    Zhou, Dongsheng
    Zhang, Qiang
    Yang, Deyun
    [J]. 3RD INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2019), 2019, : 78 - 82
  • [32] A novel end-to-end deep convolutional neural network based skin lesion classification framework
    A, Razia Sulthana
    Chamola, Vinay
    Hussain, Zain
    Albalwy, Faisal
    Hussain, Amir
    [J]. Expert Systems with Applications, 2024, 246
  • [33] End-to-End Pedestrian Collision Warning System Based on a Convolutional Neural Network with Semantic Segmentation
    Jung, Heechul
    Choi, Min-Kook
    Soon, Kwon
    Jung, Woo Young
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [34] End-to-end cubic phase signal recovery method based on deep convolutional neural network
    Li, Kang
    Jiu, Bo
    Liu, Hongwei
    Lu, Ruiying
    [J]. IET RADAR SONAR AND NAVIGATION, 2020, 14 (01): : 110 - 117
  • [35] End-To-End Convolutional Neural Network Model for Gear Fault Diagnosis Based on Sound Signals
    Yao, Yong
    Wang, Honglei
    Li, Shaobo
    Liu, Zhonghao
    Gui, Gui
    Dan, Yabo
    Hu, Jianjun
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [36] A novel end-to-end deep convolutional neural network based skin lesion classification framework
    Sulthana, Razia
    Chamola, Vinay
    Hussain, Zain
    Albalwy, Faisal
    Hussain, Amir
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [37] Stereo Matching Algorithm Based on Three-Dimensional Convolutional Neural Network
    Wang Yufeng
    Wang Hongwei
    Yu Guang
    Yang Mingquan
    Yuan Yuwei
    Quan Jicheng
    [J]. ACTA OPTICA SINICA, 2019, 39 (11)
  • [38] End-to-End Learning for Omnidirectional Stereo Matching With Uncertainty Prior
    Won, Changhee
    Ryu, Jongbin
    Lim, Jongwoo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 3850 - 3862
  • [39] Task-Adaptive End-to-End Networks for Stereo Matching
    Li T.
    Ma W.
    Xu S.
    Zhang X.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (07): : 1531 - 1538
  • [40] WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-End Speech Enhancement
    Hsieh, Tsun-An
    Wang, Hsin-Min
    Lu, Xugang
    Tsao, Yu
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 2149 - 2153