Combining convolutional neural networks and self-attention for fundus diseases identification

被引:16
|
作者
Wang, Keya [1 ]
Xu, Chuanyun [1 ,2 ]
Li, Gang [1 ]
Zhang, Yang [2 ]
Zheng, Yu [1 ]
Sun, Chengjie [1 ]
机构
[1] Chongqing Univ Technol, Sch Artificial Intelligence, Chongqing 401135, Peoples R China
[2] Chongqing Normal Univ, Coll Comp & Informat Sci, Chongqing 401331, Peoples R China
关键词
DIABETIC-RETINOPATHY; GLAUCOMA; NUMBER; PEOPLE;
D O I
10.1038/s41598-022-27358-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Early detection of lesions is of great significance for treating fundus diseases. Fundus photography is an effective and convenient screening technique by which common fundus diseases can be detected. In this study, we use color fundus images to distinguish among multiple fundus diseases. Existing research on fundus disease classification has achieved some success through deep learning techniques, but there is still much room for improvement in model evaluation metrics using only deep convolutional neural network (CNN) architectures with limited global modeling ability; the simultaneous diagnosis of multiple fundus diseases still faces great challenges. Therefore, given that the self-attention (SA) model with a global receptive field may have robust global-level feature modeling ability, we propose a multistage fundus image classification model MBSaNet which combines CNN and SA mechanism. The convolution block extracts the local information of the fundus image, and the SA module further captures the complex relationships between different spatial positions, thereby directly detecting one or more fundus diseases in retinal fundus image. In the initial stage of feature extraction, we propose a multiscale feature fusion stem, which uses convolutional kernels of different scales to extract low-level features of the input image and fuse them to improve recognition accuracy. The training and testing were performed based on the ODIR-5k dataset. The experimental results show that MBSaNet achieves state-of-the-art performance with fewer parameters. The wide range of diseases and different fundus image collection conditions confirmed the applicability of MBSaNet.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Combining convolutional neural networks and self-attention for fundus diseases identification
    Keya Wang
    Chuanyun Xu
    Gang Li
    Yang Zhang
    Yu Zheng
    Chengjie Sun
    [J]. Scientific Reports, 13
  • [2] Combining Contextual Information by Self-attention Mechanism in Convolutional Neural Networks for Text Classification
    Wu, Xin
    Cai, Yi
    Li, Qing
    Xu, Jingyun
    Leung, Ho-fung
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2018, PT I, 2018, 11233 : 453 - 467
  • [3] Convolutional Self-Attention Networks
    Yang, Baosong
    Wang, Longyue
    Wong, Derek F.
    Chao, Lidia S.
    Tu, Zhaopeng
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 4040 - 4045
  • [4] Gland and Colonoscopy Segmentation Method Combining Self-Attention and Convolutional Neural Network
    Zhang Jiabao
    Xiao Zhiyong
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (02)
  • [5] Global Convolutional Neural Networks With Self-Attention for Fisheye Image Rectification
    Kim, Byunghyun
    Lee, Dohyun
    Min, Kyeongyuk
    Chong, Jongwha
    Joe, Inwhee
    [J]. IEEE ACCESS, 2022, 10 : 129580 - 129587
  • [6] Combining Gated Convolutional Networks and Self-Attention Mechanism for Speech Emotion Recognition
    Li, Chao
    Jiao, Jinlong
    Zhao, Yiqin
    Zhao, Ziping
    [J]. 2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2019, : 105 - 109
  • [7] Convolutional Recurrent Neural Networks with a Self-Attention Mechanism for Personnel Performance Prediction
    Xue, Xia
    Feng, Jun
    Gao, Yi
    Liu, Meng
    Zhang, Wenyu
    Sun, Xia
    Zhao, Aiqi
    Guo, Shouxi
    [J]. ENTROPY, 2019, 21 (12)
  • [8] Leukocyte subtypes identification using bilinear self-attention convolutional neural network
    Yang, Dongxu
    Zhao, Hongdong
    Han, Tiecheng
    Kang, Qing
    Ma, Juncheng
    Lu, Haiyan
    [J]. MEASUREMENT, 2021, 173
  • [9] Generating self-attention activation maps for visual interpretations of convolutional neural networks
    Liang, Yu
    Li, Maozhen
    Jiang, Changjun
    [J]. NEUROCOMPUTING, 2022, 490 : 206 - 216
  • [10] Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention
    Demirel, Emir
    Ahlback, Sven
    Dixon, Simon
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,