Object Detection by Channel and Spatial Exchange for Multimodal Remote Sensing Imagery

被引:6
|
作者
Nan, Guozheng [1 ]
Zhao, Yue [1 ]
Fu, Liyong [2 ,3 ]
Ye, Qiaolin [1 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, Coll Artificial Intelligence, Nanjing 210037, Peoples R China
[2] Chinese Acad Forestry, Inst Forest Resource Informat Tech, Beijing 100091, Peoples R China
[3] Hebei Agr Univ, Coll Forestry, Baoding 071000, Peoples R China
关键词
Neck; Remote sensing; Feature extraction; Decoding; Iron; Head; Forestry; Multimodal feature fusion; remote sensing image (RSI); RGB-infrared object detection; super resolution (SR);
D O I
10.1109/JSTARS.2024.3388013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Smart satellites and unmanned aerial vehicles (UAVs) are typically equipped with visible light and infrared (IR) spectrum sensors. However, achieving real-time object detection utilizing these multimodal data on such resource-limited devices is a challenging task. This article proposes HyperYOLO, a real-time lightweight object detection framework for multimodal remote sensing images. First, we propose a lightweight multimodal fusion module named channel and spatial exchange (CSE) to effectively extract complementary information from different modalities. The CSE module consists of two stages: channel exchange and spatial exchange. Channel exchange achieves global fusion by learning global weights to better utilize cross-channel information correlation, while spatial exchange captures details by considering spatial relationships to calibrate local fusion. Second, we propose an effective auxiliary branch module based on the feature pyramid network for super resolution (FPNSR) to enhance the framework's responsiveness to small objects by learning high-quality feature representations. Moreover, we embed a coordinate attention mechanism to assist our network in precisely localizing and attending to the objects of interest. The experimental results show that on the VEDAI remote sensing dataset, HyperYOLO achieves a 76.72% mAP(50), surpassing the SOTA SuperYOLO by 1.63%. Meanwhile, the parameter size and GFLOPs of HyperYOLO are about 1.34 million (28%) and 3.97 (22%) less than SuperYOLO, respectively. In addition, HyperYOLO has a file size of only 7.3 MB after the removal of the auxiliary FPNSR branch, which makes it easier to deploy on these resource-constrained devices.
引用
收藏
页码:8581 / 8593
页数:13
相关论文
共 50 条
  • [41] Remote Sensing Imagery Object Detection Model Compression via Tucker Decomposition
    Huyan, Lang
    Li, Ying
    Jiang, Dongmei
    Zhang, Yanning
    Zhou, Quan
    Li, Bo
    Wei, Jiayuan
    Liu, Juanni
    Zhang, Yi
    Wang, Peng
    Fang, Hai
    MATHEMATICS, 2023, 11 (04)
  • [42] Editorial for the Special Issue "Advances in Object and Activity Detection in Remote Sensing Imagery"
    Ulhaq, Anwaar
    Gomes, Douglas Pinto Sampaio
    REMOTE SENSING, 2022, 14 (08)
  • [43] Efficient Inductive Vision Transformer for Oriented Object Detection in Remote Sensing Imagery
    Zhang, Cong
    Su, Jingran
    Ju, Yakun
    Lam, Kin-Man
    Wang, Qi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [44] Absorption Pruning of Deep Neural Network for Object Detection in Remote Sensing Imagery
    Wang, Jielei
    Cui, Zongyong
    Zang, Zhipeng
    Meng, Xiangjie
    Cao, Zongjie
    REMOTE SENSING, 2022, 14 (24)
  • [45] Cross-Modal Adaptation for Object Detection in Infrared Remote Sensing Imagery
    Wang, Zeyu
    Li, Shuaiting
    Huang, Kejie
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [46] Stepwise Locating Bidirectional Pyramid Network for Object Detection in Remote Sensing Imagery
    Yu, Nanjing
    Ren, Haohao
    Deng, Tianmin
    Fan, Xiaobiao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [47] Progressive Context-Dependent Inference for Object Detection in Remote Sensing Imagery
    Liu, Binhui
    Xu, Chunyan
    Cui, Zhen
    Yang, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 580 - 590
  • [48] Progressive Symmetric Registration for Multimodal Remote Sensing Imagery
    Yan, Heng
    Ma, Ailong
    Zhong, Yanfei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [49] Parallel Space and Channel Attention for Stronger Remote Sensing Object Detection
    Zhao, Yuhui
    Yang, Ruifeng
    Guo, Chenxia
    Chen, Xiaole
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 2610 - 2621
  • [50] NOVELTY DETECTION IN REMOTE SENSING IMAGERY
    Du, Dawei
    Funk, Christopher
    Hoogs, Anthony
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 5325 - 5328