MSI-RGB Dual-Source Multiscale Fusion-Based Solid Waste Object Detection

被引:0
|
作者
Liu, Haonan [1 ,2 ]
Yang, Jianhong [1 ,2 ]
Fang, Huaiying [1 ,2 ]
Ji, Tianchen [1 ,2 ]
Cai, Zhenxing [1 ,2 ]
机构
[1] The Key Laboratory of Process Monitoring and System Optimization for Mechanical and Electrical Equipment, Fujian Province University, China
[2] School of Electromechanical and Automation, Huaqiao University, Xiamen,361021, China
关键词
Effluent treatment - Image coding - Image enhancement - Image fusion - Multispectral scanners - Photomapping - Remote sensing - RGB color model - Waste paper;
D O I
10.1109/TIM.2024.3480227
中图分类号
学科分类号
摘要
This article focuses on the significant challenges faced by single-modal visual sensors in object detection under the fine-sorting process of solid waste. Object detection using a single sensor has inherent limitations due to the lack of comprehensive material and contour information about the object. Therefore, this article proposes a real-time detection network based on the MSI-RGB dual-source multiscale fusion network (DMFNet) and introduces a coarse-to-fine dual-stage band selection (dual-stage BS) method for reducing spectral redundant information. As a case study, we focus on plastic solid waste and build a dual-sensor experimental platform to collect color and hyperspectral images (HSIs) using a face-matrix color camera and a hyperspectral camera. Specifically, the dual-stage BS method is employed to analyze the hyper-spectral data and a small number of spectral bands were selected to generate a multispectral image (MSI) as the first input source for DMFNet. Meanwhile, the RGB image serves as the second input source for DMFNet. First, a spectral image feature extraction backbone (SIFEBackbone) and a multiscale stacking unit (MSU) are designed to extract multiscale features from spectral images fully. Second, a multibranch fusion unit (MBFU) is designed to realize the fusion of multiscale features with the spectral feature maps in the RGB-enhanced feature extraction stage. The experimental results show that dual-stage BS exhibits advantages over other methods, while DMF-YOLOv7, which integrates DMFNet and YOLOv7 detectors, improves the average precision and recall by 7.2% and 5.5%, respectively, compared with the original YOLOv7, and the mAP reaches 95.4% at IOU = 0.5. It effectively reduces the material misrecognition problem and reflects the advantages of the dual-sensor algorithm. The source code is available at: https://github.com/Caicai-D/DMFNet. © 1963-2012 IEEE.
引用
收藏
相关论文
共 50 条
  • [31] Transformer-based difference fusion network for RGB-D salient object detection
    Cui, Zhi-Qiang
    Wang, Feng
    Feng, Zheng-Yong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [32] Object-based change detection on multiscale fusion for VHR remote sensing images
    Zhang, Hansong
    Chen, Jianyu
    Liu, Xin
    MIPPR 2015: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2015, 9815
  • [33] RGB-D and Laser Data Fusion-based Human Detection and Tracking for Socially Aware Robot Navigation Framework
    Xuan-Tung Truong
    Voo Nyuk Yoong
    Trung-Dung Ngo
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 608 - 613
  • [34] Improving Radar-Camera Fusion-based 3D Object Detection for Autonomous Vehicles
    Kurniawan, Irfan Tito
    Trilaksono, Bambang Riyanto
    2022 12TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET 2022), 2022, : 42 - 47
  • [35] Multiscale Geometric Analysis Fusion-Based Unsupervised Change Detection in Remote Sensing Images via FLICM Model
    Li, Liangliang
    Ma, Hongbing
    Jia, Zhenhong
    ENTROPY, 2022, 24 (02)
  • [36] MSF-YOLO: A multi-scale features fusion-based method for small object detection
    Yang, Fengyu
    Zhou, Jiaqi
    Chen, Yuan
    Liao, Jie
    Yang, Mingxiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (22) : 61239 - 61260
  • [37] The Dual-Band SAR Image Fusion-Based Foliage-Penetrating Target Detection Method
    Zhang, Haonan
    An, Daoxiang
    Li, Jianpeng
    Chen, Leping
    Feng, Dong
    Song, Yongping
    Zhou, Zhimin
    IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [38] RGB-THERMAL BASED PEDESTRIAN DETECTION WITH SINGLE-MODAL AUGMENTATION AND ROI POOLING MULTISCALE FUSION
    Xiang, Jiajun
    Gou, Shuiping
    Li, Ruimin
    Zheng, Zhihui
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3532 - 3535
  • [39] Multispectral Object Detection Based on Multilevel Feature Fusion and Dual Feature Modulation
    Sun, Jin
    Yin, Mingfeng
    Wang, Zhiwei
    Xie, Tao
    Bei, Shaoyi
    ELECTRONICS, 2024, 13 (02)
  • [40] Multi-Scale Feature Fusion Saliency Object Detection Based on RGB-D Images
    Wang, Zhen
    Yu, Wanjun
    Chen, Ying
    Computer Engineering and Applications, 2024, 60 (11) : 242 - 250