Self-attention neural architecture search for semantic image segmentation

被引:35
|
作者
Fan, Zhenkun [1 ]
Hu, Guosheng [2 ]
Sun, Xin [1 ]
Wang, Gaige [1 ]
Dong, Junyu [1 ]
Su, Chi [3 ]
机构
[1] Ocean Univ China, Dept Comp Sci & Technol, Qingdao 266100, Shandong, Peoples R China
[2] Anyvision, London, England
[3] Kingsoft Cloud, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Self-attention; Neural architecture search; Semantic segmentation;
D O I
10.1016/j.knosys.2021.107968
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-attention can capture long-distance dependencies and is widely used in semantic segmentation. Existing methods mainly use two kinds of self-attentions, i.e., spatial attention and channel attention, which can capture the relations in HW dimension (image plane, height and width) and C dimension (channels), respectively. Very little research investigates self-attention along other dimensions, which can potentially improve the segmentation performance. In this work, we investigate the self-attentions along all the possible dimensions {H, W, C, HW, HC, CW, HWC}. Then we explore the aggregation of all the possible self-attentions. We apply the neural architecture search (NAS) technique to achieve optimal aggregation. Specifically, we carefully design (1) the search space and (2) the optimization method. For (1), we introduce a building block, a basic self-attention search unit (BSU), which can model self-attentions along all the dimensions. And the search space contains within-BSU and crossBSU operations. In addition, we propose an attention-map splitting method, which can reduce the computations by 1/3. For (2), we apply an efficient differentiable optimization method to search the optimal aggregation. We conduct extensive experiments on Cityscapes and ADE20K datasets. The results show the effectiveness of the proposed method, and we achieve very competitive performance against state-of-the-art methods. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Pyramid Self-attention for Semantic Segmentation
    Qi, Jiyang
    Wang, Xinggang
    Hu, Yao
    Tang, Xu
    Liu, Wenyu
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 480 - 492
  • [2] ARCHITECTURE SELF-ATTENTION MECHANISM: NONLINEAR OPTIMIZATION FOR NEURAL ARCHITECTURE SEARCH
    Hao, Jie
    Zhu, William
    [J]. JOURNAL OF NONLINEAR AND VARIATIONAL ANALYSIS, 2021, 5 (01): : 119 - 140
  • [3] Research of Self-Attention in Image Segmentation
    Cao, Fude
    Zheng, Chunguang
    Huang, Limin
    Wang, Aihua
    Zhang, Jiong
    Zhou, Feng
    Ju, Haoxue
    Guo, Haitao
    Du, Yuxia
    [J]. JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [4] Lightweight Self-Attention Network for Semantic Segmentation
    Zhou, Yan
    Zhou, Haibin
    Li, Nanjun
    Li, Jianxun
    Wang, Dongli
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [5] Self-Attention Technology in Image Segmentation
    Cao, Fude
    Lu, Xueyun
    [J]. INTERNATIONAL CONFERENCE ON INTELLIGENT TRAFFIC SYSTEMS AND SMART CITY (ITSSC 2021), 2022, 12165
  • [6] FsaNet: Frequency Self-Attention for Semantic Segmentation
    Zhang, Fengyu
    Panahi, Ashkan
    Gao, Guangjun
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4757 - 4772
  • [7] Semantic Segmentation of Remote Sensing Image Based on Regional Self-Attention Mechanism
    Zhao, Danpei
    Wang, Chenxu
    Gao, Yue
    Shi, Zhenwei
    Xie, Fengying
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [8] Hierarchical self-attention embedded neural network with dense connection for remote-sensing image semantic segmentation
    Li, Chunhua
    Li, Xin
    Xia, Runliang
    Li, Tao
    Lyu, Xin
    Tong, Yao
    Zhao, Liancheng
    Wang, Xinyuan
    [J]. IEEE Access, 2021, 9 : 126623 - 126634
  • [9] Hierarchical Self-Attention Embedded Neural Network With Dense Connection for Remote-Sensing Image Semantic Segmentation
    Li, Chunhua
    Li, Xin
    Xia, Runliang
    Li, Tao
    Lyu, Xin
    Tong, Yao
    Zhao, Liancheng
    Wang, Xinyuan
    [J]. IEEE ACCESS, 2021, 9 : 126623 - 126634
  • [10] DCNAS: Densely Connected Neural Architecture Search for Semantic Image Segmentation
    Zhang, Xiong
    Xu, Hongmin
    Mo, Hong
    Tan, Jianchao
    Yang, Cheng
    Wang, Lei
    Ren, Wenqi
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13951 - 13962