A Multi-Feature Fusion and Attention Network for Multi-Scale Object Detection in Remote Sensing Images

被引:6
|
作者
Cheng, Yong [1 ]
Wang, Wei [2 ]
Zhang, Wenjie [3 ]
Yang, Ling [1 ]
Wang, Jun [1 ]
Ni, Huan [4 ]
Guan, Tingzhao [1 ]
He, Jiaxin [2 ]
Gu, Yakang [1 ]
Tran, Ngoc Nguyen [5 ,6 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Software, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Automat, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Geog Sci, Nanjing 210044, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Remote Sensing & Geomat Engn, Nanjing 210044, Peoples R China
[5] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, Hanoi 100803, Vietnam
[6] Univ Technol Sydney, Sch Life Sci, Ultimo 2007, Australia
基金
中国国家自然科学基金;
关键词
remote sensing images; multi-scale object detection; multi-feature fusion and attention network; multi-branch convolution; attention mechanism; loss function;
D O I
10.3390/rs15082096
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate multi-scale object detection in remote sensing images poses a challenge due to the complexity of transferring deep features to shallow features among multi-scale objects. Therefore, this study developed a multi-feature fusion and attention network (MFANet) based on YOLOX. By reparameterizing the backbone, fusing multi-branch convolution and attention mechanisms, and optimizing the loss function, the MFANet strengthened the feature extraction of objects at different sizes and increased the detection accuracy. The ablation experiment was carried out on the NWPU VHR-10 dataset. Our results showed that the overall performance of the improved network was around 2.94% higher than the average performance of every single module. Based on the comparison experiments, the improved MFANet demonstrated a high mean average precision of 98.78% for 9 classes of objects in the NWPU VHR-10 10-class detection dataset and 94.91% for 11 classes in the DIOR 20-class detection dataset. Overall, MFANet achieved an mAP of 96.63% and 87.88% acting on the NWPU VHR-10 and DIOR datasets, respectively. This method can promote the development of multi-scale object detection in remote sensing images and has the potential to serve and expand intelligent system research in related fields such as object tracking, semantic segmentation, and scene understanding.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Object Detection in Remote Sensing Images via Multi-Feature Pyramid Network with Receptive Field Block
    Yuan, Zhichao
    Liu, Ziming
    Zhu, Chunbo
    Qi, Jing
    Zhao, Danpei
    [J]. REMOTE SENSING, 2021, 13 (05)
  • [32] OBJECT-ORIENTED CHANGE DETECTION FOR REMOTE SENSING IMAGES BASED ON MULTI-SCALE FUSION
    Feng, Wenqing
    Sui, Haigang
    Tu, Jihui
    [J]. XXIII ISPRS CONGRESS, COMMISSION VII, 2016, 41 (B7): : 483 - 491
  • [33] Small Object Detection in UAV Remote Sensing Images Based on Intra-Group Multi-Scale Fusion Attention and Adaptive Weighted Feature Fusion Mechanism
    Yuan, Zhe
    Gong, Jianglei
    Guo, Baolong
    Wang, Chao
    Liao, Nannan
    Song, Jiawei
    Wu, Qiming
    [J]. Remote Sensing, 2024, 16 (22)
  • [34] Attention-Based Multi-Level Feature Fusion for Object Detection in Remote Sensing Images
    Dong, Xiaohu
    Qin, Yao
    Gao, Yinghui
    Fu, Ruigang
    Liu, Songlin
    Ye, Yuanxin
    [J]. REMOTE SENSING, 2022, 14 (15)
  • [35] Scene Classification of High-Resolution Remote Sensing Image by Multi-scale and Multi-feature Fusion
    Huang H.
    Xu K.-J.
    Shi G.-Y.
    [J]. Huang, Hong (hhuang@cqu.edu.cn), 1824, Chinese Institute of Electronics (48): : 1824 - 1833
  • [36] Multi-Scale Feature Attention-DEtection TRansformer: Multi-Scale Feature Attention for security check object detection
    Sima, Haifeng
    Chen, Bailiang
    Tang, Chaosheng
    Zhang, Yudong
    Sun, Junding
    [J]. IET COMPUTER VISION, 2024, 18 (05) : 613 - 625
  • [37] MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK
    Guan, Wenjie
    Zou, YueXian
    Zhou, Xiaoqun
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2596 - 2600
  • [38] Multi-Scale Object Detection Using Feature Fusion Recalibration Network
    Guo, Ziyuan
    Zhang, Weimin
    Liang, Zhenshuo
    Shi, Yongliang
    Huang, Qiang
    [J]. IEEE ACCESS, 2020, 8 : 51664 - 51673
  • [39] Multi-scale and multi-feature high resolution remote sensing image segmentation
    Zhao, Qiang
    Zhang, Sheng
    Huang, Shuling
    [J]. International Journal of Applied Mathematics and Statistics, 2013, 51 (22): : 343 - 350
  • [40] Multi-scale Cross Dual Attention Network for Building Change Detection in Remote Sensing Images
    Zhang J.
    Yan Z.
    Ma S.
    [J]. Journal of Geo-Information Science, 2023, 25 (12) : 2487 - 2500