Object Detection by Channel and Spatial Exchange for Multimodal Remote Sensing Imagery

被引:6
|
作者
Nan, Guozheng [1 ]
Zhao, Yue [1 ]
Fu, Liyong [2 ,3 ]
Ye, Qiaolin [1 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, Coll Artificial Intelligence, Nanjing 210037, Peoples R China
[2] Chinese Acad Forestry, Inst Forest Resource Informat Tech, Beijing 100091, Peoples R China
[3] Hebei Agr Univ, Coll Forestry, Baoding 071000, Peoples R China
关键词
Neck; Remote sensing; Feature extraction; Decoding; Iron; Head; Forestry; Multimodal feature fusion; remote sensing image (RSI); RGB-infrared object detection; super resolution (SR);
D O I
10.1109/JSTARS.2024.3388013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Smart satellites and unmanned aerial vehicles (UAVs) are typically equipped with visible light and infrared (IR) spectrum sensors. However, achieving real-time object detection utilizing these multimodal data on such resource-limited devices is a challenging task. This article proposes HyperYOLO, a real-time lightweight object detection framework for multimodal remote sensing images. First, we propose a lightweight multimodal fusion module named channel and spatial exchange (CSE) to effectively extract complementary information from different modalities. The CSE module consists of two stages: channel exchange and spatial exchange. Channel exchange achieves global fusion by learning global weights to better utilize cross-channel information correlation, while spatial exchange captures details by considering spatial relationships to calibrate local fusion. Second, we propose an effective auxiliary branch module based on the feature pyramid network for super resolution (FPNSR) to enhance the framework's responsiveness to small objects by learning high-quality feature representations. Moreover, we embed a coordinate attention mechanism to assist our network in precisely localizing and attending to the objects of interest. The experimental results show that on the VEDAI remote sensing dataset, HyperYOLO achieves a 76.72% mAP(50), surpassing the SOTA SuperYOLO by 1.63%. Meanwhile, the parameter size and GFLOPs of HyperYOLO are about 1.34 million (28%) and 3.97 (22%) less than SuperYOLO, respectively. In addition, HyperYOLO has a file size of only 7.3 MB after the removal of the auxiliary FPNSR branch, which makes it easier to deploy on these resource-constrained devices.
引用
收藏
页码:8581 / 8593
页数:13
相关论文
共 50 条
  • [31] A Method of Object Detection for Remote Sensing Imagery Based on Spectral Space Transformation
    Wu Gui-ping
    Xiao Peng-feng
    Feng Xue-zhi
    Wang Ke
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2013, 33 (03) : 741 - 745
  • [32] ABNet: Adaptive Balanced Network for Multiscale Object Detection in Remote Sensing Imagery
    Liu, Yanfeng
    Li, Qiang
    Yuan, Yuan
    Du, Qian
    Wang, Qi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [33] Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery
    Azimi, Seyed Majid
    Vig, Eleonora
    Bahmanyar, Reza
    Koerner, Marco
    Reinartz, Peter
    COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 150 - 165
  • [34] Stepwise Locating Bidirectional Pyramid Network for Object Detection in Remote Sensing Imagery
    Yu, Nanjing
    Ren, Haohao
    Deng, Tianmin
    Fan, Xiaobiao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [35] Adaptive adjacent context negotiation network for object detection in remote sensing imagery
    School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, China
    不详
    PeerJ Comput. Sci.,
  • [36] Orientation guided anchoring for geospatial object detection from remote sensing imagery
    Yu, Yongtao
    Guan, Haiyan
    Li, Dilong
    Gu, Tiannan
    Tang, E.
    Li, Aixia
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 160 : 67 - 82
  • [37] Object detection in remote sensing imagery using a discriminatively trained mixture model
    Cheng, Gong
    Han, Junwei
    Guo, Lei
    Qian, Xiaoliang
    Zhou, Peicheng
    Yao, Xiwen
    Hu, Xintao
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 85 : 32 - 43
  • [38] MDCNet: A Multiplatform Distributed Collaborative Network for Object Detection in Remote Sensing Imagery
    Duan, Shujing
    Cheng, Peirui
    Wang, Zhechao
    Wang, Zhirui
    Chen, Kaiqiang
    Sun, Xian
    Fu, Kun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [39] Adaptive adjacent context negotiation network for object detection in remote sensing imagery
    Dong, Yan
    Liu, Yundong
    Cheng, Yuhua
    Gao, Guangshuai
    Chen, Kai
    Li, Chunlei
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [40] Lifting Based Object Detection Networks of Remote Sensing Imagery for FPGA Accelerator
    Zheng, Yujin
    Shi, Zishan
    He, Chu
    Zhang, Qinglin
    IEEE ACCESS, 2020, 8 (200430-200439) : 200430 - 200439