FCNet: Stereo 3D Object Detection with Feature Correlation Networks

被引:3
|
作者
Wu, Yingyu [1 ]
Liu, Ziyan [1 ,2 ,3 ]
Chen, Yunlei [1 ]
Zheng, Xuhui [1 ]
Zhang, Qian [1 ]
Yang, Mo [1 ]
Tang, Guangming [3 ]
机构
[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
[2] Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
关键词
3D object detection; deep learning; stereo matching; multi-scale cost-volume; channel similarity; parallel convolutional attention;
D O I
10.3390/e24081121
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Deep-learning techniques have significantly improved object detection performance, especially with binocular images in 3D scenarios. To supervise the depth information in stereo 3D object detection, reconstructing the 3D dense depth of LiDAR point clouds causes higher computational costs and lower inference speed. After exploring the intrinsic relationship between the implicit depth information and semantic texture features of the binocular images, we propose an efficient and accurate 3D object detection algorithm, FCNet, in stereo images. First, we construct a multi-scale cost-volume containing implicit depth information using the normalized dot-product by generating multi-scale feature maps from the input stereo images. Secondly, the variant attention model enhances its global and local description, and the sparse region monitors the depth loss deep regression. Thirdly, for balancing the channel information preservation of the re-fused left-right feature maps and computational burden, a reweighting strategy is employed to enhance the feature correlation in merging the last-layer features of binocular images. Extensive experiment results on the challenging KITTI benchmark demonstrate that the proposed algorithm achieves better performance, including a lower computational cost and higher inference speed in 3D object detection.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Multi-feature Fusion VoteNet for 3D Object Detection
    Wang, Zhoutao
    Xie, Qian
    Wei, Mingqiang
    Long, Kun
    Wang, Jun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [32] Object Detection Using a Combination of Multiple 3D Feature Descriptors
    Kiforenko, Lilita
    Buch, Anders Glent
    Kruger, Norbert
    COMPUTER VISION SYSTEMS (ICVS 2015), 2015, 9163 : 343 - 353
  • [33] Strong-Weak Feature Alignment for 3D Object Detection
    Wang, Zhiyu
    Wang, Li
    Dai, Bin
    ELECTRONICS, 2021, 10 (10)
  • [34] Feature extraction for 3D object detection using integral imaging
    Aloni, Doron
    Yitzhaky, Yitzhak
    IMAGE RECONSTRUCTION FROM INCOMPLETE DATA VIII, 2015, 9600
  • [35] 3D Probabilistic feature point model for object detection and recognition
    Romdhani, Sami
    Vetter, Thomas
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2407 - +
  • [36] Focal Sparse Convolutional Networks for 3D Object Detection
    Chen, Yukang
    Li, Yanwei
    Zhang, Xiangyu
    Sun, Jian
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5418 - 5427
  • [37] PointGAT: Graph attention networks for 3D object detection
    Zhou H.
    Wang W.
    Liu G.
    Zhou Q.
    Intelligent and Converged Networks, 2022, 3 (02): : 204 - 216
  • [38] Silhouette and stereo fusion for 3D object modeling
    Esteban, CH
    Schmitt, F
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 96 (03) : 367 - 392
  • [39] Evaluation of Stereo Algorithms for 3D Object Recognition
    Tombari, Federico
    Gori, Fabio
    Di Stefano, Luigi
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [40] Multi-stereo 3D object reconstruction
    Esteban, CH
    Schmitt, F
    FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 159 - 166