Scale channel attention network for image segmentation

被引:4
|
作者
Chen, Jianjun [1 ,2 ]
Tian, Youliang [3 ]
Ma, Wei [1 ]
Mao, Zhengdong [1 ]
Hu, Yue [1 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Natl Engn Lab Informat Secur Technol, Beijing 100093, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Guizhou Univ, Coll Comp Sci & Technol, Guizhou Prov Key Lab Publ Big Data, Guiyang 550025, Guizhou, Peoples R China
关键词
Image segmentation; Convolutional neural network; Attention mechanism; Spatial pyramid pooling; Multi-source and heterogeneous data;
D O I
10.1007/s11042-020-08921-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The object scale variation results in a negative effect on image segmentation performance. Spatial pyramid pooling module or the attention mechanism are two widely used components in deep neural networks to handle this problem. Applying the single component commonly achieves limited benefit. To push the limit, in this paper, we propose a scale channel attention network (SCA-Net), which enhances the fusion feature of multi-scale by using channel attention components. After the multiple-scale pooling step, the multi-scale spatial information distributes in different feature channels. Meanwhile, the channel attention block is employed to guide SCA-Net focus on the object-relevant scale channels. We further explore the channel attention block and find a simple yet effective structure to combine global average pooling and global maximum pooling, resulting in a robust global information encoder. The SCA-Net does not contain any time-consuming post-processing, which is an extra step after the neural network for the segmentation result optimization. The assessment results on PASCAL VOC 2012 and Cityscapes benchmarks achieve the test set performance of 75.5% and 77.0%.
引用
收藏
页码:16473 / 16489
页数:17
相关论文
共 50 条
  • [31] MCDALNet: Multi-scale Contextual Dual Attention Learning Network for Medical Image Segmentation
    Guo, Pengcheng
    Su, Xiangdong
    Zhang, Haoran
    Bao, Feilong
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [32] MSAR-Net: A multi-scale attention residual network for medical image segmentation
    Li, Xiaoheng
    Chen, Cheng
    Chen, Yunqing
    Yu, Ming-an
    Xiao, Ruoxiu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 104
  • [33] Collaborative Attention Guided Multi-Scale Feature Fusion Network for Medical Image Segmentation
    Xu, Zhenghua
    Tian, Biao
    Liu, Shijie
    Wang, Xiangtao
    Yuan, Di
    Gu, Junhua
    Chen, Junyang
    Lukasiewicz, Thomas
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1857 - 1871
  • [34] Label-aware Attention Network with Multi-scale Boosting for Medical Image Segmentation
    Wang, Linbo
    Xu, Peng
    Cao, Xianfeng
    Nappi, Michele
    Wan, Shaohua
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [35] SMANet: Superpixel-guided multi-scale attention network for medical image segmentation
    Shen, Yiwei
    Guo, Junchen
    Liu, Yan
    Xu, Chang
    Li, Qingwu
    Qi, Fei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [36] CANet: cross attention network for food image segmentation
    Dong, Xiaoxiao
    Li, Haisheng
    Wang, Xiaochuan
    Wang, Wei
    Du, Junping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (21) : 60987 - 61006
  • [37] LANet: A Ladder Attention Network for Image Semantic Segmentation
    Wang, Dongli
    Wang, Bo
    Zhou, Yan
    45TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2019), 2019, : 126 - 131
  • [38] TANet: Triple Attention Network for medical image segmentation
    Wei, Xin
    Ye, Fanghua
    Wan, Huan
    Xu, Jianfeng
    Min, Weidong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 82
  • [39] Fully convolutional attention network for biomedical image segmentation
    Cheng, Junlong
    Tian, Shengwei
    Yu, Long
    Lu, Hongchun
    Lv, Xiaoyi
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 107
  • [40] Multimodal parallel attention network for medical image segmentation
    Wang, Zhibing
    Wang, Wenmin
    Li, Nannan
    Zhang, Shenyong
    Chen, Qi
    Jiang, Zhe
    IMAGE AND VISION COMPUTING, 2024, 147