Self-attention neural architecture search for semantic image segmentation

被引:35
|
作者
Fan, Zhenkun [1 ]
Hu, Guosheng [2 ]
Sun, Xin [1 ]
Wang, Gaige [1 ]
Dong, Junyu [1 ]
Su, Chi [3 ]
机构
[1] Ocean Univ China, Dept Comp Sci & Technol, Qingdao 266100, Shandong, Peoples R China
[2] Anyvision, London, England
[3] Kingsoft Cloud, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Self-attention; Neural architecture search; Semantic segmentation;
D O I
10.1016/j.knosys.2021.107968
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-attention can capture long-distance dependencies and is widely used in semantic segmentation. Existing methods mainly use two kinds of self-attentions, i.e., spatial attention and channel attention, which can capture the relations in HW dimension (image plane, height and width) and C dimension (channels), respectively. Very little research investigates self-attention along other dimensions, which can potentially improve the segmentation performance. In this work, we investigate the self-attentions along all the possible dimensions {H, W, C, HW, HC, CW, HWC}. Then we explore the aggregation of all the possible self-attentions. We apply the neural architecture search (NAS) technique to achieve optimal aggregation. Specifically, we carefully design (1) the search space and (2) the optimization method. For (1), we introduce a building block, a basic self-attention search unit (BSU), which can model self-attentions along all the dimensions. And the search space contains within-BSU and crossBSU operations. In addition, we propose an attention-map splitting method, which can reduce the computations by 1/3. For (2), we apply an efficient differentiable optimization method to search the optimal aggregation. We conduct extensive experiments on Cityscapes and ADE20K datasets. The results show the effectiveness of the proposed method, and we achieve very competitive performance against state-of-the-art methods. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] DNAS: Decoupling Neural Architecture Search for High-Resolution Remote Sensing Image Semantic Segmentation
    Wang, Yu
    Li, Yansheng
    Chen, Wei
    Li, Yunzhou
    Dang, Bo
    [J]. REMOTE SENSING, 2022, 14 (16)
  • [42] Adaptive soft erasure with edge self-attention for weakly supervised semantic segmentation: Thyroid ultrasound image case study
    Yu, Mei
    Han, Ming
    Li, Xuewei
    Wei, Xi
    Jiang, Han
    Chen, Huiling
    Yu, Ruiguo
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 144
  • [43] Semantic segmentation of remote sensing images based on neural architecture search
    Zhou, Peng
    Yang, Jun
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (05): : 47 - 57
  • [44] Deep Semantic Ranking Hashing Based on Self-Attention for Medical Image Retrieval
    Tang, Yibo
    Chen, Yaxiong
    Xiong, Shengwu
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4960 - 4966
  • [45] Robust Semi-Supervised Semantic Segmentation Based on Self-Attention and Spectral Normalization
    Zhang, Jia
    Li, Zhixin
    Zhang, Canlong
    Ma, Huifang
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [46] Self-Attention Networks for Code Search
    Fang, Sen
    Tan, You-Shuai
    Zhang, Tao
    Liu, Yepang
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 134
  • [47] Self-Attention Prediction Correction with Channel Suppression for Weakly-Supervised Semantic Segmentation
    Sun, Guoying
    Yang, Meng
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 846 - 851
  • [48] Saliency Guided Self-Attention Network for Weakly and Semi-Supervised Semantic Segmentation
    Yao, Qi
    Gong, Xiaojin
    [J]. IEEE ACCESS, 2020, 8 : 14413 - 14423
  • [49] Semantic segmentation using cross-stage feature reweighting and efficient self-attention
    Ma, Yingdong
    Lan, Xiaobin
    [J]. IMAGE AND VISION COMPUTING, 2024, 145
  • [50] EFFICIENT OCT IMAGE SEGMENTATION USING NEURAL ARCHITECTURE SEARCH
    Gheshlaghi, Saba Heidari
    Dehzangi, Omid
    Dahouei, Ali
    Amireskandari, Annahita
    Rezai, Ali
    Nasrabadi, Nasser M.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 428 - 432