Unified diffusion-based object detection in multi-modal and low-light remote sensing images

被引:0
|
作者
Sun, Xu [1 ]
Yu, Yinhui [1 ]
Cheng, Qing [1 ]
机构
[1] Jilin Univ, Sch Commun Engn, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; convolutional neural nets; image processing;
D O I
10.1049/ell2.70093
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Remote sensing object detection remains a challenge under complex conditions such as low light, adverse weather, modality attacks or losses. Previous approaches typically alleviate this problem by enhancing visible images or leveraging multi-modal fusion technologies. In view of this, the authors propose a unified framework based on YOLO-World that combines the advantages of both schemes, achieving more adaptable and robust remote sensing object detection in complex real-world scenarios. This framework introduces a unified modality modelling strategy, allowing the model to learn abundant object features from multiple remote sensing datasets. Additionally, a U-fusion neck based on the diffusion method is designed to effectively remove modality-specific noise and generate missing complementary features. Extensive experiments were conducted on four remote sensing image datasets: Multimodal VEDAI, DroneVehicle, unimodal VisDrone and UAVDT. This approach achieves average precision scores of 50.5%$\%$, 55.3%$\%$, 25.1%$\%$, and 20.7%$\%$, which outperforms advanced multimodal remote sensing object detection methods and low-light image enhancement techniques.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Gated Multi-Modal Edge Refinement Network for Light Field Salient Object Detection
    Li, Yefan
    Duan, Fuqing
    Lu, Ke
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (10)
  • [32] Multi-modal Remote Sensing Image Registration Based on Multi-scale Phase Congruency
    Cui, Song
    Zhong, Yanfei
    2018 10TH IAPR WORKSHOP ON PATTERN RECOGNITION IN REMOTE SENSING (PRRS), 2018,
  • [33] Accurate Pipeline Blockage Detection with Low-Cost Multi-Modal Sensing
    Zhang, Chengjie
    Heidemann, John
    2014 IEEE 11TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2014, : 172 - 180
  • [34] An Accurate Low-light Object Detection Method Based on Pyramid Networks
    Tao Qingyang
    Ren Kun
    Feng Bo
    Gao Xuejin
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY VII, 2020, 11550
  • [35] Hierarchical Multi-Label Object Detection Framework for Remote Sensing Images
    Shin, Su-Jin
    Kim, Seyeob
    Kim, Youngjung
    Kim, Sungho
    REMOTE SENSING, 2020, 12 (17)
  • [36] MSNet: Multi-Scale Network for Object Detection in Remote Sensing Images
    Gao, Tao
    Xia, Shilin
    Liu, Mengkun
    Zhang, Jing
    Chen, Ting
    Li, Ziqi
    PATTERN RECOGNITION, 2025, 158
  • [37] DENSE CONTRASTIVE LEARNING BASED OBJECT DETECTION FOR REMOTE SENSING IMAGES
    Liu, Shuo
    Zou, Huanxin
    Li, Meilin
    Cao, Xu
    He, Shitian
    Wei, Juan
    Sun, Li
    Zhang, Yuqing
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6458 - 6461
  • [38] Object detection in remote sensing images based on deep transfer learning
    Chen, Jinyong
    Sun, Jianguo
    Li, Yuqian
    Hou, Changbo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (09) : 12093 - 12109
  • [39] Feature Enhancement Based Oriented Object Detection in Remote Sensing Images
    Guo, Hongjian
    Zhou, Xianlin
    Yang, Peng
    NEURAL PROCESSING LETTERS, 2024, 56 (06)
  • [40] Object Detection in Optical Remote Sensing Images Based on Residual Network
    Li, Da
    Gong, Shaoxing
    Liu, Dong
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518