Unified diffusion-based object detection in multi-modal and low-light remote sensing images

被引:0
|
作者
Sun, Xu [1 ]
Yu, Yinhui [1 ]
Cheng, Qing [1 ]
机构
[1] Jilin Univ, Sch Commun Engn, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; convolutional neural nets; image processing;
D O I
10.1049/ell2.70093
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Remote sensing object detection remains a challenge under complex conditions such as low light, adverse weather, modality attacks or losses. Previous approaches typically alleviate this problem by enhancing visible images or leveraging multi-modal fusion technologies. In view of this, the authors propose a unified framework based on YOLO-World that combines the advantages of both schemes, achieving more adaptable and robust remote sensing object detection in complex real-world scenarios. This framework introduces a unified modality modelling strategy, allowing the model to learn abundant object features from multiple remote sensing datasets. Additionally, a U-fusion neck based on the diffusion method is designed to effectively remove modality-specific noise and generate missing complementary features. Extensive experiments were conducted on four remote sensing image datasets: Multimodal VEDAI, DroneVehicle, unimodal VisDrone and UAVDT. This approach achieves average precision scores of 50.5%$\%$, 55.3%$\%$, 25.1%$\%$, and 20.7%$\%$, which outperforms advanced multimodal remote sensing object detection methods and low-light image enhancement techniques.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Subtask Attention Based Object Detection in Remote Sensing Images
    Xiong, Shengzhou
    Tan, Yihua
    Li, Yansheng
    Wen, Cai
    Yan, Pei
    REMOTE SENSING, 2021, 13 (10)
  • [22] Object Detection Based on BING in Optical Remote Sensing Images
    Zheng, Jiangbin
    Xi, Yue
    Feng, Mingchen
    Lie, Xiuxiu
    Li, Na
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 504 - 509
  • [23] Multi-task Learning of Semantic Segmentation and Height Estimation for Multi-modal Remote Sensing Images
    Mengyu WANG
    Zhiyuan YAN
    Yingchao FENG
    Wenhui DIAO
    Xian SUN
    Journal of Geodesy and Geoinformation Science, 2023, 6 (04) : 27 - 39
  • [24] Multi-Scale Bushfire Detection From Multi-Modal Streams of Remote Sensing Data
    Thanh Cong Phan
    Thanh Tam Nguyen
    Thanh Dat Hoang
    Quoc Viet Hung Nguyen
    Jo, Jun
    IEEE ACCESS, 2020, 8 : 228496 - 228513
  • [25] Multi-Attention Object Detection Model in Remote Sensing Images Based on Multi-Scale
    Ying, Xiang
    Wang, Qiang
    Li, Xuewei
    Yu, Mei
    Jiang, Han
    Gao, Jie
    Liu, Zhiqiang
    Yu, Ruiguo
    IEEE ACCESS, 2019, 7 : 94508 - 94519
  • [26] Unified multimodal fusion transformer for few shot object detection for remote sensing images
    Azeem, Abdullah
    Li, Zhengzhou
    Siddique, Abubakar
    Zhang, Yuting
    Zhou, Shangbo
    INFORMATION FUSION, 2024, 111
  • [27] A Multi-modal Moving Object Detection Method Based on GrowCut Segmentation
    Zhang, Xiuwei
    Zhang, Yanning
    Maybank, Stephen John
    Liang, Jun
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR MULTIMEDIA, SIGNAL AND VISION PROCESSING (CIMSIVP), 2014, : 213 - 218
  • [28] Robust registration of multi-modal remote sensing images based on multi-dimensional oriented self-similarity features
    Zhang, Yongjun
    Zhang, Wenfei
    Yao, Yongxiang
    Zheng, Zhi
    Wan, Yi
    Xiong, Mingtao
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 127
  • [29] EMNet: Edge-guided multi-level network for salient object detection in low-light images
    Jing, Lianghu
    Wang, Bo
    IMAGE AND VISION COMPUTING, 2024, 143
  • [30] Salient Object Detection via Multi-feature Diffusion-based Method
    Ye Feng
    Hong Siting
    Chen Jiazhen
    Zheng Zihua
    Liu Guanghai
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (05) : 1210 - 1218