Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation

被引:0
|
作者
Yao, Fengqin [1 ]
Wang, Shengke [1 ]
Ding, Laihui [2 ]
Zhong, Guoqiang [1 ]
Li, Shu [1 ]
Xu, Zhiwei [2 ]
机构
[1] Ocean Univ China, Qingdao 266100, Peoples R China
[2] Shandong Willand Intelligent Technol Co Ltd, Qingdao 266100, Peoples R China
关键词
Semantic segmentation; Attention-guided; Multi-scale fusion; High inter-class similarity;
D O I
10.1007/s12559-023-10206-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image segmentation accuracy is critical in marine ecological detection utilizing unmanned aerial vehicles (UAVs). By flying a drone around, we can swiftly determine the location of a variety of species. However, remote sensing photos, particularly those of inter-class items, are remarkably similar, and there are a significant number of little objects. The universal segmentation network is ineffective. This research constructs attentional networks that imitate the human cognitive system, inspired by camouflaged object detection and the management of human attentional mechanisms in the recognition of diverse things. This research proposes TriseNet, an attention-guided multi-scale fusion semantic segmentation network that solves the challenges of high item similarity and poor segmentation accuracy in UAV settings. To begin, we employ a bidirectional feature extraction network to extract low-level spatial and high-level semantic information. Second, we leverage the attention-induced cross-level fusion module (ACFM) to create a new multi-scale fusion branch that performs cross-level learning and enhances the representation of inter-class comparable objects. Finally, the receptive field block (RFB) module is used to increase the receptive field, resulting in richer characteristics in specific layers. The inter-class similarity increases the difficulty of segmentation accuracy greatly, whereas the three modules improve feature expression and segmentation results. Experiments are conducted using our UAV dataset, UAV-OUC-SEG (55.61% MIoU), and the public dataset, Cityscapes (76.10% MIoU), to demonstrate the efficacy of our strategy. In two datasets, the TriseNet delivers the best results when compared to other prominent segmentation algorithms.
引用
下载
收藏
页码:366 / 376
页数:11
相关论文
共 50 条
  • [41] Noise Suppression of DAS Seismic Data by Attention-guided Multi-scale Generative Adversarial Network
    Wu N.
    Wang Y.
    Li Y.
    Geophysics, 2023, 88 (03)
  • [42] Attention-guided chained context aggregation for semantic segmentation*
    Tang, Quan
    Liu, Fagui
    Zhang, Tong
    Jiang, Jun
    Zhang, Yu
    IMAGE AND VISION COMPUTING, 2021, 115 (115)
  • [43] A Residual UNet Denoising Network Based on Multi-Scale Feature Extraction and Attention-Guided Filter
    Liu, Hualin
    Li, Zhe
    Lin, Shijie
    Cheng, Libo
    SENSORS, 2023, 23 (16)
  • [44] MSDRA-NET: A MULTI-SCALE ATTENTION-GUIDED NETWORK FOR MAGNETIC RESONANCE IMAGE RESTORATION
    You, Xuexiao
    Cao, Ning
    Wang, Wei
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2024, 24 (02)
  • [45] Attention Guided Multi Scale Feature Fusion Network for Automatic Prostate Segmentation
    Li, Yuchun
    Huang, Mengxing
    Zhang, Yu
    Bai, Zhiming
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02): : 1649 - 1668
  • [46] An Asymmetric Semantic Segmentation Model via Lightweight Attention-Guided Feature Enhancement and Fusion
    Qingsong Tang
    Minghui Zhao
    Yalei Ren
    Xiaomeng Shi
    Wuming Jiang
    Cognitive Computation, 2025, 17 (1)
  • [47] A Stackable Attention-Guided Multi-scale CNN for Number Plate Detection
    Wang, Yixuan
    Zheng, Shangdong
    Xu, Wei
    Xu, Yang
    Zhan, Tianming
    Zheng, Peng
    Wei, Zhihui
    Wu, Zebin
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 199 - 209
  • [48] Attention-guided Unified Network for Panoptic Segmentation
    Li, Yanwei
    Chen, Xinze
    Zhu, Zheng
    Xie, Lingxi
    Huang, Guan
    Du, Dalong
    Wang, Xingang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7019 - 7028
  • [49] Multiscale Attention-Guided Panoptic Segmentation Network
    Fu, Du
    Qu, Shaojun
    Fu, Ya
    Computer Engineering and Applications, 2023, 59 (22) : 223 - 232
  • [50] Multi-scale Spatial-Spectral Attention Guided Fusion Network for Pansharpening
    Yang, Yong
    Li, Mengzhen
    Huang, Shuying
    Lu, Hangyuan
    Tu, Wei
    Wan, Weiguo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3346 - 3354