FCNet: Stereo 3D Object Detection with Feature Correlation Networks

被引：3

作者：

Wu, Yingyu ^{[1
]}

Liu, Ziyan ^{[1
,2
,3
]}

Chen, Yunlei ^{[1
]}

Zheng, Xuhui ^{[1
]}

Zhang, Qian ^{[1
]}

Yang, Mo ^{[1
]}

Tang, Guangming ^{[3
]}

机构：

[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China

[2] Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China

[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China

来源：

ENTROPY | 2022年 / 24卷 / 08期

关键词：

3D object detection; deep learning; stereo matching; multi-scale cost-volume; channel similarity; parallel convolutional attention;

D O I：

10.3390/e24081121

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Deep-learning techniques have significantly improved object detection performance, especially with binocular images in 3D scenarios. To supervise the depth information in stereo 3D object detection, reconstructing the 3D dense depth of LiDAR point clouds causes higher computational costs and lower inference speed. After exploring the intrinsic relationship between the implicit depth information and semantic texture features of the binocular images, we propose an efficient and accurate 3D object detection algorithm, FCNet, in stereo images. First, we construct a multi-scale cost-volume containing implicit depth information using the normalized dot-product by generating multi-scale feature maps from the input stereo images. Secondly, the variant attention model enhances its global and local description, and the sparse region monitors the depth loss deep regression. Thirdly, for balancing the channel information preservation of the re-fused left-right feature maps and computational burden, a reweighting strategy is employed to enhance the feature correlation in merging the last-layer features of binocular images. Extensive experiment results on the challenging KITTI benchmark demonstrate that the proposed algorithm achieves better performance, including a lower computational cost and higher inference speed in 3D object detection.

引用

页数：17

共 50 条

[31] Multi-feature Fusion VoteNet for 3D Object Detection
Wang, Zhoutao
Xie, Qian
Wei, Mingqiang
Long, Kun
Wang, Jun
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
[32] Object Detection Using a Combination of Multiple 3D Feature Descriptors
Kiforenko, Lilita
Buch, Anders Glent
Kruger, Norbert
COMPUTER VISION SYSTEMS (ICVS 2015), 2015, 9163 : 343 - 353
[33] Strong-Weak Feature Alignment for 3D Object Detection
Wang, Zhiyu
Wang, Li
Dai, Bin
ELECTRONICS, 2021, 10 (10)
[34] Feature extraction for 3D object detection using integral imaging
Aloni, Doron
Yitzhaky, Yitzhak
IMAGE RECONSTRUCTION FROM INCOMPLETE DATA VIII, 2015, 9600
[35] 3D Probabilistic feature point model for object detection and recognition
Romdhani, Sami
Vetter, Thomas
2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2407 - +
[36] Focal Sparse Convolutional Networks for 3D Object Detection
Chen, Yukang
Li, Yanwei
Zhang, Xiangyu
Sun, Jian
Jia, Jiaya
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5418 - 5427
[37] PointGAT: Graph attention networks for 3D object detection
Zhou H.
Wang W.
Liu G.
Zhou Q.
Intelligent and Converged Networks, 2022, 3 (02): : 204 - 216
[38] Silhouette and stereo fusion for 3D object modeling
Esteban, CH
Schmitt, F
COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 96 (03) : 367 - 392
[39] Evaluation of Stereo Algorithms for 3D Object Recognition
Tombari, Federico
Gori, Fabio
Di Stefano, Luigi
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
[40] Multi-stereo 3D object reconstruction
Esteban, CH
Schmitt, F
FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 159 - 166

← 1 2 3 4 5 →