Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation

被引:0
|
作者
Fengqin Yao
Shengke Wang
Laihui Ding
Guoqiang Zhong
Shu Li
Zhiwei Xu
机构
[1] Ocean University of China,
[2] Shandong Willand Intelligent Technology Co.,undefined
[3] Ltd,undefined
来源
Cognitive Computation | 2024年 / 16卷
关键词
Semantic segmentation; Attention-guided; Multi-scale fusion; High inter-class similarity;
D O I
暂无
中图分类号
学科分类号
摘要
Image segmentation accuracy is critical in marine ecological detection utilizing unmanned aerial vehicles (UAVs). By flying a drone around, we can swiftly determine the location of a variety of species. However, remote sensing photos, particularly those of inter-class items, are remarkably similar, and there are a significant number of little objects. The universal segmentation network is ineffective. This research constructs attentional networks that imitate the human cognitive system, inspired by camouflaged object detection and the management of human attentional mechanisms in the recognition of diverse things. This research proposes TriseNet, an attention-guided multi-scale fusion semantic segmentation network that solves the challenges of high item similarity and poor segmentation accuracy in UAV settings. To begin, we employ a bidirectional feature extraction network to extract low-level spatial and high-level semantic information. Second, we leverage the attention-induced cross-level fusion module (ACFM) to create a new multi-scale fusion branch that performs cross-level learning and enhances the representation of inter-class comparable objects. Finally, the receptive field block (RFB) module is used to increase the receptive field, resulting in richer characteristics in specific layers. The inter-class similarity increases the difficulty of segmentation accuracy greatly, whereas the three modules improve feature expression and segmentation results. Experiments are conducted using our UAV dataset, UAV-OUC-SEG (55.61% MIoU), and the public dataset, Cityscapes (76.10% MIoU), to demonstrate the efficacy of our strategy. In two datasets, the TriseNet delivers the best results when compared to other prominent segmentation algorithms.
引用
收藏
页码:366 / 376
页数:10
相关论文
共 50 条
  • [1] Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation
    Yao, Fengqin
    Wang, Shengke
    Ding, Laihui
    Zhong, Guoqiang
    Li, Shu
    Xu, Zhiwei
    [J]. COGNITIVE COMPUTATION, 2024, 16 (01) : 366 - 376
  • [2] GLIMS: Attention-guided lightweight multi-scale hybrid network for volumetric semantic segmentation
    Yazici, Ziya Ata
    Oksuz, Ilkay
    Ekenel, Hazim Kemal
    [J]. IMAGE AND VISION COMPUTING, 2024, 146
  • [3] Lightweight multi-scale attention-guided network for real-time semantic segmentation
    Hu, Xuegang
    Liu, Yuanjing
    [J]. IMAGE AND VISION COMPUTING, 2023, 139
  • [4] Attention-Guided Deep Neural Network With Multi-Scale Feature Fusion for Liver Vessel Segmentation
    Yan, Qingsen
    Wang, Bo
    Zhang, Wei
    Luo, Chuan
    Xu, Wei
    Xu, Zhengqing
    Zhang, Yanning
    Shi, Qinfeng
    Zhang, Liang
    You, Zheng
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (07) : 2629 - 2642
  • [5] Multi-Scale Attention-Guided Network for mammograms classification
    Xu, Chunbo
    Lou, Meng
    Qi, Yunliang
    Wang, Yiming
    Pi, Jiande
    Ma, Yide
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 68
  • [6] Dense Dilated Multi-Scale Supervised Attention-Guided Network for histopathology image segmentation
    Das, Rangan
    Bose, Shirsha
    Chowdhury, Ritesh Sur
    Maulik, Ujjwal
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [7] Attention-guided multi-scale learning network for automatic prostate and tumor segmentation on MRI
    Li, Yuchun
    Wu, Yuanyuan
    Huang, Mengxing
    Zhang, Yu
    Bai, Zhiming
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
  • [8] Attention-Guided Network for Semantic Video Segmentation
    Li, Jiangyun
    Zhao, Yikai
    Fu, Jun
    Wu, Jiajia
    Liu, Jing
    [J]. IEEE ACCESS, 2019, 7 : 140680 - 140689
  • [9] Multi-scale attention fusion network for semantic segmentation of remote sensing images
    Wen, Zhiqiang
    Huang, Hongxu
    Liu, Shuai
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (24) : 7909 - 7926
  • [10] Attention-guided multi-scale context aggregation network for multi-modal brain glioma segmentation
    Wu, Shaozhi
    Cao, Yunjian
    Li, Xinke
    Liu, Qiyu
    Ye, Yuyun
    Liu, Xingang
    Zeng, Liaoyuan
    Tian, Miao
    [J]. MEDICAL PHYSICS, 2023, 50 (12) : 7629 - 7640