MEDA: Multi-output Encoder-Decoder for Spatial Attention in Convolutional Neural Networks

被引:0
|
作者
Li, Huayu [1 ]
Razi, Abolfazl [1 ]
机构
[1] No Arizona Univ, Sch Informat Comp & Cyber Syst, Flagstaff, AZ 86011 USA
关键词
Attention Mechanism; Deep Learning; Encoder-Decoder Architecture; Convolutional Networks;
D O I
10.1109/ieeeconf44664.2019.9048981
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Utilizing channel-wise spatial attention mechanisms to emphasize special parts of an input image is an effective method to improve the performance of convolutional neural networks (CNNs). There are multiple effective implementations of attention mechanism. One is adding squeeze-and-excitation (SE) blocks to the CNN structure that selectively emphasize the most informative channels and suppress the relatively less informative channels by taking advantage of channel dependence. Another method is adding convolutional block attention module (CBAM) to implement both channel-wise and spatial attention mechanisms to select important pixels of the feature maps while emphasizing informative channels. In this paper, we propose an encoder-decoder architecture based on the idea of letting the channel-wise and spatial attention blocks share the same latent space representation. Instead of separating the channel-wise and spatial attention modules into two independent parts in CBAM, we combine them into one encoder-decoder architecture with two outputs. To evaluate the performance of the proposed algorithm, we apply it to different CNN architectures and test it on image classification and semantic segmentation. Through comparing the resulting structure equipped with MEDA blocks against other attention module, we show that the proposed method achieves better performance across different test scenarios.
引用
收藏
页码:2087 / 2091
页数:5
相关论文
共 50 条
  • [31] Attention-based Encoder-Decoder Recurrent Neural Networks for HTTP Payload Anomaly Detection
    Wu, Shang
    Wang, Yijie
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1452 - 1459
  • [32] Deep Encoder-Decoder Neural Network Architectures for Graph Output Signals
    Rey, Samuel
    Tenorio, Victor
    Rozada, Sergio
    Martino, Luca
    Marques, Antonio G.
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 225 - 229
  • [33] CT IMAGE DENOISING WITH ENCODER-DECODER BASED GRAPH CONVOLUTIONAL NETWORKS
    Chen, Yu-Jen
    Tsai, Cheng-Yen
    Xu, Xiaowei
    Shi, Yiyu
    Ho, Tsung-Yi
    Huang, Meiping
    Yuan, Haiyun
    Zhuang, Jian
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 400 - 404
  • [34] Fully Convolutional Encoder-Decoder With an Attention Mechanism for Practical Pedestrian Trajectory Prediction
    Chen, Kai
    Song, Xiao
    Yuan, Haitao
    Ren, Xiaoxiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 20046 - 20060
  • [35] Convolutional Encoder-Decoder Networks for Robust Image-to-Motion Prediction
    Ridge, Barry
    Pahic, Rok
    Ude, Ales
    Morimoto, Jun
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, 2020, 980 : 514 - 523
  • [36] Eyenet: Attention based Convolutional Encoder-Decoder Network for Eye Region Segmentation
    Kansal, Priya
    Nathan, Sabari
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3688 - 3693
  • [37] PottsMGNet: A Mathematical Explanation of Encoder-Decoder Based Neural Networks
    Tai, Xue-Cheng
    Liu, Hao
    Chan, Raymond
    SIAM JOURNAL ON IMAGING SCIENCES, 2024, 17 (01): : 540 - 594
  • [38] Convolutional LSTM-Attention Based Encoder-Decoder Neural Network for Prediction of Chaotic Vibrations of Multi-Dimensional Dynamic Systems
    Wang, Luyao
    Dai, Liming
    Zhao, Haixing
    Fang, Pan
    INTERNATIONAL JOURNAL OF STRUCTURAL STABILITY AND DYNAMICS, 2024, 24 (20)
  • [39] AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks
    Kass, Dmitrijs
    Vats, Ekta
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 507 - 522
  • [40] Street-view Change Detection via Siamese Encoder-decoder Structured Convolutional Neural Networks
    Zhao, Xinwei
    Li, Haichang
    Wang, Rui
    Zheng, Changwen
    Shi, Song
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 525 - 532