A Fusion of RGB Features and Local Descriptors for Object Detection in Road Scene

被引：0

作者：

Dinh Nguyen, Vinh ^{[1
]}

机构：

[1] FPT Univ, Dept Informat Technol, Can Tho Campus, Can Tho City 94000, Vietnam

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Feature extraction; Object detection; Training; Noise measurement; Mathematical models; Laser radar; Detectors; Multimodal sensors; Multi-modal fusion; object detection; local pattern;

D O I：

10.1109/ACCESS.2024.3404248

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Many texture descriptors have been introduced in recent years to improve texture analysis and classification outcomes, which are important in many computer vision tasks including object recognition and detection, human detector, and especially in face recognition. Local pattern is a texture descriptor that can successfully extract distinctive texture features that possesses noise and illumination variance robustness. This paper focuses on making use of local pattern features in boosting object detection models in a multi-modal fusion paradigm to acquire reliable feature maps in forward propagation throughout the network regardless of variations in photo taking conditions. We propose an adaptive fusion architecture for RGB and Local Ternary Pattern information. This architecture leverage local pattern to enrich information of original feature maps and adapt to many object detection models. Our local pattern fusion network concentrates on backbone and neck modules with an simple and efficient operation. The notable accuracy advancement is 8.03% observed in Cascade R-CNN in KITTI Dataset. In difficult conditions, our fusion models significantly lift the original performance from 4.7% to 66.3% mAP score.

引用

页码：72957 / 72967

页数：11

共 50 条

[31] Local Background Enclosure for RGB-D Salient Object Detection
Feng, David
Barnes, Nick
You, Shaodi
McCarthy, Chris
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2343 - 2350
[32] Learning Local-Global Multi-Graph Descriptors for RGB-T Object Tracking
Li, Chenglong
Zhu, Chengli
Zhang, Jian
Luo, Bin
Wu, Xiaohao
Tang, Jin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 2913 - 2926
[33] RGB-D Scene Recognition with Object-to-Object Relation
Song, Xinhang
Chen, Chengpeng
Jiang, Shuqiang
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 600 - 608
[34] MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
Zhou, Wujie
Lin, Xinyang
Lei, Jingsheng
Yu, Lu
Hwang, Jenq-Neng
IEEE Transactions on Multimedia, 2022, 24 : 2526 - 2538
[35] MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
Zhou, Wujie
Lin, Xinyang
Lei, Jingsheng
Yu, Lu
Hwang, Jenq-Neng
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2526 - 2538
[36] RGB-D Object Classification Using Covariance Descriptors
Fehr, Duc
Beksi, William J.
Zermas, Dimitris
Papanikolopoulos, Nikolaos
2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 5467 - 5472
[37] Modal complementary fusion network for RGB-T salient object detection
Shuai Ma
Kechen Song
Hongwen Dong
Hongkun Tian
Yunhui Yan
Applied Intelligence, 2023, 53 : 9038 - 9055
[38] An adaptive guidance fusion network for RGB-D salient object detection
Sun, Haodong
Wang, Yu
Ma, Xinpeng
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) : 1683 - 1693
[39] Exploring RGB plus Depth Fusion for Real-Time Object Detection
Ophoff, Tanguy
Van Beeck, Kristof
Goedeme, Toon
SENSORS, 2019, 19 (04)
[40] Modal complementary fusion network for RGB-T salient object detection
Ma, Shuai
Song, Kechen
Dong, Hongwen
Tian, Hongkun
Yan, Yunhui
APPLIED INTELLIGENCE, 2023, 53 (08) : 9038 - 9055

← 1 2 3 4 5 →