Attention-based multi-scale feature fusion network for myopia grading using optical coherence tomography images

被引:4
|
作者
Huang, Gengyou [1 ]
Wen, Yang [2 ]
Qian, Bo [1 ]
Bi, Lei [3 ]
Chen, Tingli [4 ]
Sheng, Bin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comupter Sci & Engn, Shanghai, Peoples R China
[2] Shenzhen Univ, Sch Elect & Informat Engn, Shenzhen, Peoples R China
[3] Shanghai Jiao Tong Univ, Inst Translat Med, Shanghai, Peoples R China
[4] Huadong Sanat, Wuxi, Jiangsu, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 09期
基金
美国国家科学基金会;
关键词
Optical coherence tomography (OCT); Myopia grading; Deep Learning; Vision Transformer; Attention fusion; OCT;
D O I
10.1007/s00371-023-03189-y
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Myopia is a serious threat to eye health and can even cause blindness. It is important to grade myopia and carry out targeted intervention. Nowadays, various studies using deep learning models based on optical coherence tomography (OCT) images to screen for high myopia. However, since regions of interest (ROIs) of pre-myopia and low myopia on OCT images are relatively small, it is rather difficult to use OCT images to conduct detailed myopia grading. There are few studies using OCT images for more detailed myopia grading. To address these problems, we propose a novel attention-based multi-scale feature fusion network named AMFF for myopia grading using OCT images. The proposed AMFF mainly consists of five modules: a pre-trained vision transformer (ViT) module, a multi-scale convolutional module, an attention feature fusion module, an Avg-TopK pooling module and a fully connected (FC) classifier. Firstly, unsupervised pre-training of ViT on the training set can better extract feature maps. Secondly, multi-scale convolutional layers further extract multi-scale feature maps to obtain more receptive fields and extract scale-invariant features. Thirdly, feature maps of different scales are fused through channel attention and spatial attention to further obtain more meaningful features. Lastly, the most prominent features are obtained by the weighted average of the highest activation values of each channel, and then they are used to classify myopia through a fully connected layer. Extensive experiments show that our proposed model has the superior performance compared with other state-of-the-art myopia grading models.
引用
收藏
页码:6627 / 6638
页数:12
相关论文
共 50 条
  • [1] AGGN: Attention-based glioma grading network with multi-scale feature extraction and multi-modal information fusion
    Wu, Peishu
    Wang, Zidong
    Zheng, Baixun
    Li, Han
    Alsaadi, Fuad E.
    Zeng, Nianyin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
  • [2] Multi-Scale Feature Fusion Network with Symmetric Attention for Land Cover Classification Using SAR and Optical Images
    Xu, Dongdong
    Li, Zheng
    Feng, Hao
    Wu, Fanlu
    Wang, Yongcheng
    REMOTE SENSING, 2024, 16 (06)
  • [3] Detecting multi-scale faces using attention-based feature fusion and smoothed context enhancement
    Shi L.
    Xu X.
    Kakadiaris I.A.
    IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020, 2 (03): : 235 - 244
  • [4] Attention-based multi-scale feature fusion for free-space detection
    Song, Pengfei
    Fan, Hui
    Li, Jinjiang
    Hua, Feng
    IET INTELLIGENT TRANSPORT SYSTEMS, 2022, 16 (09) : 1222 - 1235
  • [5] AMFF-Net: An attention-based multi-scale feature fusion network for allergic pollen detection
    Li, Jianqiang
    Wang, Quanzeng
    Xiong, Chengyao
    Zhao, Linna
    Cheng, Wenxiu
    Xu, Xi
    Expert Systems with Applications, 2024, 235
  • [6] AMFF-Net: An attention-based multi-scale feature fusion network for allergic pollen detection
    Li, Jianqiang
    Wang, Quanzeng
    Xiong, Chengyao
    Zhao, Linna
    Cheng, Wenxiu
    Xu, Xi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [7] Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
    Yang, Feng
    Li, Wentong
    Hu, Haiwei
    Li, Wanyi
    Wang, Peng
    SENSORS, 2020, 20 (06)
  • [8] Multi-Scale Feature Fusion Attention Network for Building Extraction in Remote Sensing Images
    Liu, Jia
    Gu, Hang
    Li, Zuhe
    Chen, Hongyang
    Chen, Hao
    ELECTRONICS, 2024, 13 (05)
  • [9] An Automated Multi-scale Feature Fusion Network for Spine Fracture Segmentation Using Computed Tomography Images
    Saeed, Muhammad Usman
    Bin, Wang
    Sheng, Jinfang
    Albarakati, Hussain Mobarak
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (05): : 2216 - 2226
  • [10] Audio steganalysis using multi-scale feature fusion-based attention neural network
    Peng, Jinghui
    Liao, Yi
    Tang, Shanyu
    IET COMMUNICATIONS, 2025, 19 (01)