Object Detection by Channel and Spatial Exchange for Multimodal Remote Sensing Imagery

被引:6
|
作者
Nan, Guozheng [1 ]
Zhao, Yue [1 ]
Fu, Liyong [2 ,3 ]
Ye, Qiaolin [1 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, Coll Artificial Intelligence, Nanjing 210037, Peoples R China
[2] Chinese Acad Forestry, Inst Forest Resource Informat Tech, Beijing 100091, Peoples R China
[3] Hebei Agr Univ, Coll Forestry, Baoding 071000, Peoples R China
关键词
Neck; Remote sensing; Feature extraction; Decoding; Iron; Head; Forestry; Multimodal feature fusion; remote sensing image (RSI); RGB-infrared object detection; super resolution (SR);
D O I
10.1109/JSTARS.2024.3388013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Smart satellites and unmanned aerial vehicles (UAVs) are typically equipped with visible light and infrared (IR) spectrum sensors. However, achieving real-time object detection utilizing these multimodal data on such resource-limited devices is a challenging task. This article proposes HyperYOLO, a real-time lightweight object detection framework for multimodal remote sensing images. First, we propose a lightweight multimodal fusion module named channel and spatial exchange (CSE) to effectively extract complementary information from different modalities. The CSE module consists of two stages: channel exchange and spatial exchange. Channel exchange achieves global fusion by learning global weights to better utilize cross-channel information correlation, while spatial exchange captures details by considering spatial relationships to calibrate local fusion. Second, we propose an effective auxiliary branch module based on the feature pyramid network for super resolution (FPNSR) to enhance the framework's responsiveness to small objects by learning high-quality feature representations. Moreover, we embed a coordinate attention mechanism to assist our network in precisely localizing and attending to the objects of interest. The experimental results show that on the VEDAI remote sensing dataset, HyperYOLO achieves a 76.72% mAP(50), surpassing the SOTA SuperYOLO by 1.63%. Meanwhile, the parameter size and GFLOPs of HyperYOLO are about 1.34 million (28%) and 3.97 (22%) less than SuperYOLO, respectively. In addition, HyperYOLO has a file size of only 7.3 MB after the removal of the auxiliary FPNSR branch, which makes it easier to deploy on these resource-constrained devices.
引用
收藏
页码:8581 / 8593
页数:13
相关论文
共 50 条
  • [1] YOLOrs: Object Detection in Multimodal Remote Sensing Imagery
    Sharma, Manish
    Dhanaraj, Mayur
    Karnam, Srivallabha
    Chachlakis, Dimitris G.
    Ptucha, Raymond
    Markopoulos, Panos P.
    Saber, Eli
    Markopoulos, Panos P. (pxmeee@rit.edu), 1600, Institute of Electrical and Electronics Engineers Inc. (14): : 1497 - 1508
  • [2] YOLOrs: Object Detection in Multimodal Remote Sensing Imagery
    Sharma, Manish
    Dhanaraj, Mayur
    Karnam, Srivallabha
    Chachlakis, Dimitris G.
    Ptucha, Raymond
    Markopoulos, Panos P.
    Saber, Eli
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 1497 - 1508
  • [3] MULTIMODAL OBJECT DETECTION IN REMOTE SENSING
    Belmouhcine, A.
    Burnel, J. C.
    Courtrai, L.
    Pham, M. T.
    Lefevre, S.
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 1245 - 1248
  • [4] SuperYOLO: Super Resolution Assisted Object Detection in Multimodal Remote Sensing Imagery
    Zhang, Jiaqing
    Lei, Jie
    Xie, Weiying
    Fang, Zhenman
    Li, Yunsong
    Du, Qian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] NOVEL OBJECT DETECTION IN REMOTE SENSING IMAGERY
    Du, Dawei
    Funk, Christopher
    Doctor, Katarina
    Hoogs, Anthony
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5798 - 5801
  • [6] Multi-Scale Spatial and Channel-wise Attention for Improving Object Detection in Remote Sensing Imagery
    Chen, Jie
    Wan, Li
    Zhu, Jingru
    Xu, Gang
    Deng, Min
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (04) : 681 - 685
  • [7] Cascaded Object Detection Algorithm in Remote Sensing Imagery
    Zhang X.
    Li C.
    Xu J.
    Xie J.
    Cui Z.
    Yang J.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (10): : 1524 - 1531
  • [8] ORSIm Detector: A Novel Object Detection Framework in Optical Remote Sensing Imagery Using Spatial-Frequency Channel Features
    Wu, Xin
    Hong, Danfeng
    Tian, Jiaojiao
    Chanussot, Jocelyn
    Li, Wei
    Tao, Ran
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (07): : 5146 - 5158
  • [9] Multimodal Object Detection by Channel Switching and Spatial Attention
    Cao, Yue
    Bin, Junchi
    Hamari, Jozsef
    Blasch, Erik
    Liu, Zheng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 403 - 411
  • [10] FEATURE-ATTENTIONED OBJECT DETECTION IN REMOTE SENSING IMAGERY
    Li, Chengzheng
    Xu, Chunyan
    Cui, Zhen
    Wang, Dan
    Zhang, Tong
    Yang, Jian
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3886 - 3890