Enhancing intra-aural disease classification with attention-based deep learning models

被引:0
|
作者
Furkancan Demircan [1 ]
Murat Ekinci [2 ]
Zafer Cömert [1 ]
机构
[1] Samsun University,Software Engineering, Faculty of Engineering and Natural Sciences
[2] Karadeniz Technical University,Computer Engineering, Faculty of Engineering
[3] Department of Technical Sciences of the Western Caspian University,undefined
关键词
Classification; Ear diseases; Deep learning; Transformers; Machine learning;
D O I
10.1007/s00521-025-10990-4
中图分类号
学科分类号
摘要
Ear diseases are defined as pathological conditions that indicate dysfunction or abnormal function of the ear organ, which is part of the auditory system of living organisms that regulates hearing and balance functions. These diseases usually manifest as conditions that affect the internal components of the ear structure and can manifest themselves with symptoms such as hearing loss, ear pain, balance problems, and fluid accumulation in the ear. The accuracy of the diagnosis depends on expert knowledge and subjective opinion. This method is prone to human error. This study presents a novel computer-aided diagnosis system for otoscope images of ear diseases, utilizing a vision transformer-based feature extractor combined with machine learning classifiers to provide accurate second opinions for ENT specialists. For this purpose, a new model based on state-of-the-art vision transformer feature extractor and machine learning models is proposed. In the experimental study, the dataset, comprising 880 eardrum images categorized into four classes (CSOM, earwax, myringosclerosis, and normal), was split into training (70%), validation (10%), and testing (20%) subsets. Each image was preprocessed to 420 × 380 pixels to fit the input dimensions of the models. The vision transformer architecture was utilized for feature extraction, followed by classification using various machine learning algorithms including kNN, SVM, and random forest. As a result, the model using vision transformer feature extractor and k-nearest neighbors (kNN) algorithm achieved 99.00% accuracy. In this study, a deep learning-based and computer-aided diagnosis system, in other words, a computational model, was developed instead of the current human error-prone disease diagnosis method used by ear nose throat (ENT) specialists. The main purpose of the deep learning-based decision support system is to support the diagnosis process where expert knowledge is difficult to access and to provide an alternative opinion to the expert diagnosis.
引用
收藏
页码:6601 / 6616
页数:15
相关论文
共 50 条
  • [21] Attention-based label consistency for semi-supervised deep learning based image classification
    Chen, Jiaming
    Yang, Meng
    Ling, Jie
    NEUROCOMPUTING, 2021, 453 : 731 - 741
  • [22] Attention-based multiscale deep learning with unsampled pixel utilization for hyperspectral image classification
    AL-Kubaisi, Mohammed Ahmed
    Shafri, Helmi Z. M.
    Ismail, Mohd Hasmadi
    Yusof, Mohd Johari Mohd
    bin Hashim, Shaiful Jahari
    GEOCARTO INTERNATIONAL, 2023, 38 (01)
  • [23] Attention-Based Deep Learning Model for Early Detection of Parkinson’s Disease
    Sadiq, Mohd
    Khan, Mohd Tauheed
    Masood, Sarfaraz
    Computers, Materials and Continua, 2022, 71 (02): : 5183 - 5200
  • [24] Attention-Based Deep Learning Model for Early Detection of Parkinson's Disease
    Sadiq, Mohd
    Khan, Mohd Tauheed
    Masood, Sarfaraz
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5183 - 5200
  • [25] MacularNet: Towards Fully Automated Attention-Based Deep CNN for Macular Disease Classification
    Sapna S. Mishra
    Bappaditya Mandal
    Niladri B. Puhan
    SN Computer Science, 2022, 3 (2)
  • [26] Attention-based Deep Learning for Network Intrusion Detection
    Guo, Naiwang
    Tian, Yingjie
    Li, Fan
    Yang, Hongshan
    2020 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING AND ARTIFICIAL INTELLIGENCE, 2020, 11584
  • [27] On the Instability of Softmax Attention-Based Deep Learning Models in Side-Channel Analysis
    Hajra, Suvadeep
    Alam, Manaar
    Saha, Sayandeep
    Picek, Stjepan
    Mukhopadhyay, Debdeep
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 514 - 528
  • [28] SLAPP: Subgraph-level attention-based performance prediction for deep learning models
    Wang, Zhenyi
    Yang, Pengfei
    Hu, Linwei
    Zhang, Bowen
    Lin, Chengmin
    Lv, Wenkai
    Wang, Quan
    NEURAL NETWORKS, 2024, 170 : 285 - 297
  • [29] ATTENTION-BASED DEEP SEQUENTIAL NETWORK FOR POLSAR IMAGE CLASSIFICATION
    Hua, Wenqiang
    Wang, Xinlei
    Zhang, Cong
    Jin, Xiaomin
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1500 - 1503
  • [30] Deep Attention-based Supernovae Classification of Multiband Light Curves
    Pimentel, Oscar
    Estevez, Pablo A.
    Forster, Francisco
    ASTRONOMICAL JOURNAL, 2023, 165 (01):