Object Detection by Channel and Spatial Exchange for Multimodal Remote Sensing Imagery

被引:6
|
作者
Nan, Guozheng [1 ]
Zhao, Yue [1 ]
Fu, Liyong [2 ,3 ]
Ye, Qiaolin [1 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, Coll Artificial Intelligence, Nanjing 210037, Peoples R China
[2] Chinese Acad Forestry, Inst Forest Resource Informat Tech, Beijing 100091, Peoples R China
[3] Hebei Agr Univ, Coll Forestry, Baoding 071000, Peoples R China
关键词
Neck; Remote sensing; Feature extraction; Decoding; Iron; Head; Forestry; Multimodal feature fusion; remote sensing image (RSI); RGB-infrared object detection; super resolution (SR);
D O I
10.1109/JSTARS.2024.3388013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Smart satellites and unmanned aerial vehicles (UAVs) are typically equipped with visible light and infrared (IR) spectrum sensors. However, achieving real-time object detection utilizing these multimodal data on such resource-limited devices is a challenging task. This article proposes HyperYOLO, a real-time lightweight object detection framework for multimodal remote sensing images. First, we propose a lightweight multimodal fusion module named channel and spatial exchange (CSE) to effectively extract complementary information from different modalities. The CSE module consists of two stages: channel exchange and spatial exchange. Channel exchange achieves global fusion by learning global weights to better utilize cross-channel information correlation, while spatial exchange captures details by considering spatial relationships to calibrate local fusion. Second, we propose an effective auxiliary branch module based on the feature pyramid network for super resolution (FPNSR) to enhance the framework's responsiveness to small objects by learning high-quality feature representations. Moreover, we embed a coordinate attention mechanism to assist our network in precisely localizing and attending to the objects of interest. The experimental results show that on the VEDAI remote sensing dataset, HyperYOLO achieves a 76.72% mAP(50), surpassing the SOTA SuperYOLO by 1.63%. Meanwhile, the parameter size and GFLOPs of HyperYOLO are about 1.34 million (28%) and 3.97 (22%) less than SuperYOLO, respectively. In addition, HyperYOLO has a file size of only 7.3 MB after the removal of the auxiliary FPNSR branch, which makes it easier to deploy on these resource-constrained devices.
引用
收藏
页码:8581 / 8593
页数:13
相关论文
共 50 条
  • [21] GroupNet: Learning to group corner for object detection in remote sensing imagery
    Lei NI
    Chunlei HUO
    Xin ZHANG
    Peng WANG
    Zhixin ZHOU
    Chinese Journal of Aeronautics, 2022, 35 (06) : 273 - 284
  • [22] An Efficient Feature Pyramid Network for Object Detection in Remote Sensing Imagery
    Fang Qingyun
    Zhang Lin
    Wang Zhaokui
    IEEE ACCESS, 2020, 8 : 93058 - 93068
  • [23] Oriented Object Detection by Searching Corner Points in Remote Sensing Imagery
    Chen, Xueqing
    Ma, Li
    Du, Qian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [24] Information balance network for multiscale object detection in remote sensing imagery
    Bin Wen
    Zhang, Jun
    Shen, Yanjun
    Xu, Bingrong
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [25] Long-Tailed Object Detection for Multimodal Remote Sensing Images
    Yang, Jiaxin
    Yu, Miaomiao
    Li, Shuohao
    Zhang, Jun
    Hu, Shengze
    REMOTE SENSING, 2023, 15 (18)
  • [26] HyNet: Hyper-scale object detection network framework for multiple spatial resolution remote sensing imagery
    Zheng, Zhuo
    Zhong, Yanfei
    Ma, Ailong
    Han, Xiaobing
    Zhao, Ji
    Liu, Yanfei
    Zhang, Liangpei
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 166 : 1 - 14
  • [27] FADA: Feature Aligned Domain Adaptive Object Detection in Remote Sensing Imagery
    Xu, Tao
    Sun, Xian
    Diao, Wenhui
    Zhao, Liangjin
    Fu, Kun
    Wang, Hongqi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [28] Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery
    Shahid Karim
    Ye Zhang
    Shoulin Yin
    Irfana Bibi
    Sensing and Imaging, 2021, 22
  • [29] Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery
    Karim, Shahid
    Zhang, Ye
    Yin, Shoulin
    Bibi, Irfana
    SENSING AND IMAGING, 2021, 22 (01):
  • [30] The geographic object-based method for change detection with remote sensing imagery
    Dian, Yuanyong, 1600, Editorial Board of Medical Journal of Wuhan University (39):