A Fusion of RGB Features and Local Descriptors for Object Detection in Road Scene

被引:0
|
作者
Dinh Nguyen, Vinh [1 ]
机构
[1] FPT Univ, Dept Informat Technol, Can Tho Campus, Can Tho City 94000, Vietnam
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Object detection; Training; Noise measurement; Mathematical models; Laser radar; Detectors; Multimodal sensors; Multi-modal fusion; object detection; local pattern;
D O I
10.1109/ACCESS.2024.3404248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many texture descriptors have been introduced in recent years to improve texture analysis and classification outcomes, which are important in many computer vision tasks including object recognition and detection, human detector, and especially in face recognition. Local pattern is a texture descriptor that can successfully extract distinctive texture features that possesses noise and illumination variance robustness. This paper focuses on making use of local pattern features in boosting object detection models in a multi-modal fusion paradigm to acquire reliable feature maps in forward propagation throughout the network regardless of variations in photo taking conditions. We propose an adaptive fusion architecture for RGB and Local Ternary Pattern information. This architecture leverage local pattern to enrich information of original feature maps and adapt to many object detection models. Our local pattern fusion network concentrates on backbone and neck modules with an simple and efficient operation. The notable accuracy advancement is 8.03% observed in Cascade R-CNN in KITTI Dataset. In difficult conditions, our fusion models significantly lift the original performance from 4.7% to 66.3% mAP score.
引用
收藏
页码:72957 / 72967
页数:11
相关论文
共 50 条
  • [31] Local Background Enclosure for RGB-D Salient Object Detection
    Feng, David
    Barnes, Nick
    You, Shaodi
    McCarthy, Chris
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2343 - 2350
  • [32] Learning Local-Global Multi-Graph Descriptors for RGB-T Object Tracking
    Li, Chenglong
    Zhu, Chengli
    Zhang, Jian
    Luo, Bin
    Wu, Xiaohao
    Tang, Jin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 2913 - 2926
  • [33] RGB-D Scene Recognition with Object-to-Object Relation
    Song, Xinhang
    Chen, Chengpeng
    Jiang, Shuqiang
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 600 - 608
  • [34] MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
    Zhou, Wujie
    Lin, Xinyang
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE Transactions on Multimedia, 2022, 24 : 2526 - 2538
  • [35] MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
    Zhou, Wujie
    Lin, Xinyang
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2526 - 2538
  • [36] RGB-D Object Classification Using Covariance Descriptors
    Fehr, Duc
    Beksi, William J.
    Zermas, Dimitris
    Papanikolopoulos, Nikolaos
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 5467 - 5472
  • [37] Modal complementary fusion network for RGB-T salient object detection
    Shuai Ma
    Kechen Song
    Hongwen Dong
    Hongkun Tian
    Yunhui Yan
    Applied Intelligence, 2023, 53 : 9038 - 9055
  • [38] An adaptive guidance fusion network for RGB-D salient object detection
    Sun, Haodong
    Wang, Yu
    Ma, Xinpeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) : 1683 - 1693
  • [39] Exploring RGB plus Depth Fusion for Real-Time Object Detection
    Ophoff, Tanguy
    Van Beeck, Kristof
    Goedeme, Toon
    SENSORS, 2019, 19 (04)
  • [40] Modal complementary fusion network for RGB-T salient object detection
    Ma, Shuai
    Song, Kechen
    Dong, Hongwen
    Tian, Hongkun
    Yan, Yunhui
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9038 - 9055