Joint Detection of RGB-D Images Based on Double Flow Convolutional Neural Network

被引:5
|
作者
Liu fan [1 ]
Liu Pengyuan [1 ]
Zhang Junning [1 ]
Xu Binbin [1 ]
机构
[1] Mech Engn Coll, Shijiazhuang 050003, Hebei, Peoples R China
关键词
machine vision; RGB-D; convolutional neural network; multimodal information; joint detection; depth learning;
D O I
10.3788/LOP55.021503
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The convolutional neural network structure fails to consider the independence and correlation between RGH images and depth images fully, so its detection is not high. A new double flow convolution network is proposed for the joint detection of RGH-D images. The RGH image and depth image are inputted to the two convolutional networks and the two networks have the same structure and weight sharing. After several convolutions, the independent features arc extracted. According to the optimal weights in the convolution layer, the two convolutional networks are fused. The fused features are extracted continuously using convolution kernels, and the output is obtained by full connection layer finally. When the detection time is similar, the detection accuracy and the success rate are increased by 4. 1% and 3. 5% respectively, compared with the previous early and late fusion methods.
引用
收藏
页数:9
相关论文
共 16 条
  • [1] Cai Qiang, 2016, Journal of System Simulation, V28, P2260
  • [2] Chen CH, 2011, CHIN CONT DECIS CONF, P4017, DOI 10.1109/CCDC.2011.5968926
  • [3] Coupric C, ARXIV13013572V2
  • [4] Eitel A, 2015, IEEE INT C INT ROBOT, P681, DOI 10.1109/IROS.2015.7353446
  • [5] Gupta S, 2011, EUR C COMP VIS, V8695, P345
  • [6] Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images
    Gupta, Saurabh
    Arbelaez, Pablo
    Malik, Jitendra
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 564 - 571
  • [7] Caffe: Convolutional Architecture for Fast Feature Embedding
    Jia, Yangqing
    Shelhamer, Evan
    Donahue, Jeff
    Karayev, Sergey
    Long, Jonathan
    Girshick, Ross
    Guadarrama, Sergio
    Darrell, Trevor
    [J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 675 - 678
  • [8] Lai K, 2011, IEEE INT CONF ROBOT, P1817
  • [9] Lu H T, 2016, DATA ACQUISITION PRO, V31, P1
  • [10] Lu L. F., 2015, COMPUTER ENG, V42, P187