A Multi-Feature Fusion and Attention Network for Multi-Scale Object Detection in Remote Sensing Images

被引:6
|
作者
Cheng, Yong [1 ]
Wang, Wei [2 ]
Zhang, Wenjie [3 ]
Yang, Ling [1 ]
Wang, Jun [1 ]
Ni, Huan [4 ]
Guan, Tingzhao [1 ]
He, Jiaxin [2 ]
Gu, Yakang [1 ]
Tran, Ngoc Nguyen [5 ,6 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Software, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Automat, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Geog Sci, Nanjing 210044, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Remote Sensing & Geomat Engn, Nanjing 210044, Peoples R China
[5] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, Hanoi 100803, Vietnam
[6] Univ Technol Sydney, Sch Life Sci, Ultimo 2007, Australia
基金
中国国家自然科学基金;
关键词
remote sensing images; multi-scale object detection; multi-feature fusion and attention network; multi-branch convolution; attention mechanism; loss function;
D O I
10.3390/rs15082096
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate multi-scale object detection in remote sensing images poses a challenge due to the complexity of transferring deep features to shallow features among multi-scale objects. Therefore, this study developed a multi-feature fusion and attention network (MFANet) based on YOLOX. By reparameterizing the backbone, fusing multi-branch convolution and attention mechanisms, and optimizing the loss function, the MFANet strengthened the feature extraction of objects at different sizes and increased the detection accuracy. The ablation experiment was carried out on the NWPU VHR-10 dataset. Our results showed that the overall performance of the improved network was around 2.94% higher than the average performance of every single module. Based on the comparison experiments, the improved MFANet demonstrated a high mean average precision of 98.78% for 9 classes of objects in the NWPU VHR-10 10-class detection dataset and 94.91% for 11 classes in the DIOR 20-class detection dataset. Overall, MFANet achieved an mAP of 96.63% and 87.88% acting on the NWPU VHR-10 and DIOR datasets, respectively. This method can promote the development of multi-scale object detection in remote sensing images and has the potential to serve and expand intelligent system research in related fields such as object tracking, semantic segmentation, and scene understanding.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] MULTI-SCALE FEATURE FUSION NETWORK FOR OBJECT DETECTION IN VHR OPTICAL REMOTE SENSING IMAGES
    Zhang, Wenhua
    Jiao, Licheng
    Liu, Xu
    Liu, Jia
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 330 - 333
  • [2] Small object detection in remote sensing images based on attention mechanism and multi-scale feature fusion
    Zhang, Li-guo
    Wang, Lei
    Jin, Mei
    Geng, Xing-shuo
    Shen, Qian
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (09) : 3280 - 3297
  • [3] Remote Sensing Small Object Detection Network Based on Attention Mechanism and Multi-Scale Feature Fusion
    Qu, Junsuo
    Tang, Zongbing
    Zhang, Le
    Zhang, Yanghai
    Zhang, Zhenguo
    [J]. REMOTE SENSING, 2023, 15 (11)
  • [4] Multi-Scale Feature Fusion Attention Network for Building Extraction in Remote Sensing Images
    Liu, Jia
    Gu, Hang
    Li, Zuhe
    Chen, Hongyang
    Chen, Hao
    [J]. ELECTRONICS, 2024, 13 (05)
  • [5] Object Detection of Remote Sensing Image Based on Multi-Scale Feature Fusion and Attention Mechanism
    Du, Zuoqiang
    Liang, Yuan
    [J]. IEEE ACCESS, 2024, 12 : 8619 - 8632
  • [6] Remote Sensing Object Detection Method Based on Attention Mechanism and Multi-scale Feature Fusion
    Liu, Yang
    Xiao, Yewei
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7155 - 7160
  • [7] Object Detection in Remote Sensing Images Based on Adaptive Multi-Scale Feature Fusion Method
    Liu, Chun
    Zhang, Sixuan
    Hu, Mengjie
    Song, Qing
    [J]. REMOTE SENSING, 2024, 16 (05)
  • [8] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [9] Deep multi-feature fusion network for remote sensing images
    Xiong, Wei
    Xiong, Zhenyu
    Cui, Yaqi
    Lv, Yafei
    [J]. REMOTE SENSING LETTERS, 2020, 11 (06) : 563 - 571
  • [10] MERGENET: FEATURE-MERGED NETWORK FOR MULTI-SCALE OBJECT DETECTION IN REMOTE SENSING IMAGES
    Wang, Peijin
    Sun, Xian
    Diao, Wenhui
    Fu, Kun
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 238 - 241