MSFANet: multi-scale fusion attention network for mangrove remote sensing lmage segmentation using pattern recognition

被引:1
|
作者
Fu, Lixiang [1 ]
Chen, Jinbiao [2 ]
Wang, Zhuoying [3 ]
Zang, Tao [1 ]
Chen, Huandong [1 ]
Wu, Shulei [1 ]
Zhao, Yuchen [1 ]
机构
[1] Hainan Normal Univ, Sch Informat Sci & Technol, Haikou 571158, Peoples R China
[2] Peoples Police Univ China, Smart Police Coll, Langfang 065000, Peoples R China
[3] Hainan Normal Univ, Fine Arts Acad, Haikou 571158, Peoples R China
关键词
44;
D O I
10.1186/s13677-023-00565-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mangroves are ecosystems that grow in the intertidal areas of coastal zones, playing crucial ecological roles and possessing unique economic and social values. They have garnered significant attention and research interest. Semantic segmentation of mangroves is a fundamental step for further investigations. However, mangrove remote sensing images often have large dimensions, with a substantial portion of the image containing mangrove features. Deep learning convolutional kernels may lead to inadequate receptive fields for accurate mangrove recognition. In mangrove remote sensing images, various challenges arise, including the presence of small and intricate details aside from the mangrove regions, which intensify the segmentation complexity. To address these issues, this paper primarily focuses on two key aspects: first, the exploration of methods to achieve a large receptive field, and second, the fusion of multi-scale information. To this end, we propose the Multi-Scale Fusion Attention Network (MSFANet), which incorporates a multi-scale network structure with a large receptive field for feature fusion. We emphasize preserving spatial information by integrating spatial data across different scales, employing separable convolutions to reduce computational complexity. Additionally, we introduce an Attention Fusion Module (AFM). This module helps mitigate the influence of irrelevant information and enhances segmentation quality. To retain more semantic information, this paper introduces a dual channel approach for information extraction through the deep structure of ResNet. We fuse features using the Feature Fusion Module (FFM) to combine both semantic and spatial information for the final output, further enhancing segmentation accuracy. In this study, a total of 230 images with dimensions of 768 pixels in width and height were selected for this experiment, with 184 images used for training and 46 images for validation. Experimental results demonstrate that our proposed method achieves excellent segmentation results on a small sample dataset of remote-sensing images, with significant practical value. This paper primarily focuses on three key aspects: the generation of mangrove datasets, the preprocessing of mangrove data, and the design and training of models. The primary contribution of this paper lies in the development of an effective approach for multi-scale information fusion and advanced feature preservation, providing a novel solution for mangrove remote sensing image segmentation tasks. The best Mean Intersection over Union (MIoU) achieved on the mangrove dataset is 86%, surpassing other existing models by a significant margin.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] The remote sensing image segmentation of land cover based on multi-scale attention features
    Hu, Haiyang
    Yang, Linnan
    Chen, Jiaojiao
    Luo, Shuang
    [J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 429 - 436
  • [32] MCNet: A Multi-scale and Cascade Network for Semantic Segmentation of Remote Sensing Images
    Zhou, Yin
    Li, Tianyi
    Li, Xianju
    Feng, Ruyi
    [J]. WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 162 - 176
  • [33] A multi-scale contextual attention network for remote sensing visual question answering
    Feng, Jiangfan
    Wang, Hui
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 126
  • [34] EMR-HRNet: A Multi-Scale Feature Fusion Network for Landslide Segmentation from Remote Sensing Images
    Jin, Yuanhang
    Liu, Xiaosheng
    Huang, Xiaobin
    [J]. SENSORS, 2024, 24 (11)
  • [35] Near-shore remote sensing target recognition based on multi-scale attention reconstructing convolutional network
    Zhao, Song
    Wang, Long
    Song, Lujie
    Ma, Pengge
    Liao, Liang
    Liu, Zhaoyu
    Zhao, Xiaobin
    [J]. FRONTIERS IN MARINE SCIENCE, 2024, 11
  • [36] Multi-scale Attentive Fusion Network for Remote Sensing Image Change Captioning
    Chen, Cai
    Wang, Yi
    Yap, Kim-Hui
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [37] Multi-Modality and Multi-Scale Attention Fusion Network for Land Cover Classification from VHR Remote Sensing Images
    Lei, Tao
    Li, Linze
    Lv, Zhiyong
    Zhu, Mingzhe
    Du, Xiaogang
    Nandi, Asoke K.
    [J]. REMOTE SENSING, 2021, 13 (18)
  • [38] Collaborative Attention Guided Multi-Scale Feature Fusion Network for Medical Image Segmentation
    Xu, Zhenghua
    Tian, Biao
    Liu, Shijie
    Wang, Xiangtao
    Yuan, Di
    Gu, Junhua
    Chen, Junyang
    Lukasiewicz, Thomas
    Leung, Victor C. M.
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1857 - 1871
  • [39] Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation
    Fengqin Yao
    Shengke Wang
    Laihui Ding
    Guoqiang Zhong
    Shu Li
    Zhiwei Xu
    [J]. Cognitive Computation, 2024, 16 : 366 - 376
  • [40] A Multi-Scale Channel Attention Network for Prostate Segmentation
    Ding, Meiwen
    Lin, Zhiping
    Lee, Chau Hung
    Tan, Cher Heng
    Huang, Weimin
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (05) : 1754 - 1758