Multi-scale multi-attention network for diabetic retinopathy grading

被引:1
|
作者
Xia, Haiying [1 ]
Long, Jie [1 ]
Song, Shuxiang [1 ]
Tan, Yumei [2 ]
机构
[1] Guangxi Normal Univ, Sch Elect & Informat Engn, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541004, Peoples R China
来源
PHYSICS IN MEDICINE AND BIOLOGY | 2024年 / 69卷 / 01期
基金
中国国家自然科学基金;
关键词
diabetic retinopathy grading; lesions attention module; multi-scale feature fusion module; CONVOLUTIONAL NEURAL-NETWORK; SYSTEM; DIAGNOSIS;
D O I
10.1088/1361-6560/ad111d
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. Diabetic retinopathy (DR) grading plays an important role in clinical diagnosis. However, automatic grading of DR is challenging due to the presence of intra-class variation and small lesions. On the one hand, deep features learned by convolutional neural networks often lose valid information about these small lesions. On the other hand, the great variability of lesion features, including differences in type and quantity, can exhibit considerable divergence even among fundus images of the same grade. To address these issues, we propose a novel multi-scale multi-attention network (MMNet). Approach. Firstly, to focus on different lesion features of fundus images, we propose a lesion attention module, which aims to encode multiple different lesion attention feature maps by combining channel attention and spatial attention, thus extracting global feature information and preserving diverse lesion features. Secondly, we propose a multi-scale feature fusion module to learn more feature information for small lesion regions, which combines complementary relationships between different convolutional layers to capture more detailed feature information. Furthermore, we introduce a Cross-layer Consistency Constraint Loss to overcome semantic differences between multi-scale features. Main results. The proposed MMNet obtains a high accuracy of 86.4% and a high kappa score of 88.4% for multi-class DR grading tasks on the EyePACS dataset, while 98.6% AUC, 95.3% accuracy, 92.7% recall, 95.0% precision, and 93.3% F1-score for referral and non-referral classification on the Messidor-1 dataset. Extensive experiments on two challenging benchmarks demonstrate that our MMNet achieves significant improvements and outperforms other state-of-the-art DR grading methods. Significance. MMNet has improved the diagnostic efficiency and accuracy of diabetes retinopathy and promoted the application of computer-aided medical diagnosis in DR screening.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] An interpretable multi-scale convolutional attention residual neural network for glioma grading with Raman spectroscopy
    Li, Qingbo
    Shao, Xupeng
    Zhou, Yan
    Wang, Yinyan
    Yan, Zeya
    Bao, Hongbo
    Zhou, Lipu
    ANALYTICAL METHODS, 2024, : 677 - 687
  • [32] Diabetic Retinopathy Grading with Deep Visual Attention Network
    Geetha, S.
    Parashar, Mansi
    Abhishek, J. S.
    Turaga, Raj Vishal
    Lawal, Isah A.
    Kadry, Seifedine
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2022, 18 (09) : 160 - 177
  • [33] Multi-Scale Context Attention Network for Stereo Matching
    Sang, Haiwei
    Wang, Quanhong
    Zhao, Yong
    IEEE ACCESS, 2019, 7 : 15152 - 15161
  • [34] MANet: Multi-Scale Attention Network for Correspondence Learning
    Chen, Yukai
    Zheng, Linxin
    Liu, Xin
    Xiao, Guobao
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1978 - 1982
  • [35] Hyperspectral Unmixing With Multi-Scale Convolution Attention Network
    Hu, Sheng
    Li, Huali
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 2531 - 2542
  • [36] STACKED MULTI-SCALE ATTENTION NETWORK FOR IMAGE COLORIZATION
    Jiang, Bin
    Xu, Fangqiang
    Xia, Jun
    Yang, Chao
    Huang, Wei
    Huang, Yun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2225 - 2229
  • [37] Multi-attention Network for One Shot Learning
    Wang, Peng
    Liu, Lingqiao
    Shen, Chunhua
    Huang, Zi
    van den Hengel, Anton
    Shen, Heng Tao
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6212 - 6220
  • [38] Multi-scale Attention Recalibration Network for crowd counting
    Xie, Jinyang
    Pang, Chen
    Zheng, Yanjun
    Li, Liang
    Lyu, Chen
    Lyu, Lei
    Liu, Hong
    APPLIED SOFT COMPUTING, 2022, 117
  • [39] Multi-Scale Dense Attention Network for Stereo Matching
    Chang, Yuhui
    Xu, Jiangtao
    Gao, Zhiyuan
    ELECTRONICS, 2020, 9 (11) : 1 - 12
  • [40] Multi-Scale Bilateral Attention Fusion Network For Pansharpening
    Guo Z.
    Li J.
    Lei J.
    Liu J.
    Zhou S.
    Wang B.
    Kasabov N.K.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 15