Enhancing intra-aural disease classification with attention-based deep learning models

被引:0
|
作者
Furkancan Demircan [1 ]
Murat Ekinci [2 ]
Zafer Cömert [1 ]
机构
[1] Samsun University,Software Engineering, Faculty of Engineering and Natural Sciences
[2] Karadeniz Technical University,Computer Engineering, Faculty of Engineering
[3] Department of Technical Sciences of the Western Caspian University,undefined
关键词
Classification; Ear diseases; Deep learning; Transformers; Machine learning;
D O I
10.1007/s00521-025-10990-4
中图分类号
学科分类号
摘要
Ear diseases are defined as pathological conditions that indicate dysfunction or abnormal function of the ear organ, which is part of the auditory system of living organisms that regulates hearing and balance functions. These diseases usually manifest as conditions that affect the internal components of the ear structure and can manifest themselves with symptoms such as hearing loss, ear pain, balance problems, and fluid accumulation in the ear. The accuracy of the diagnosis depends on expert knowledge and subjective opinion. This method is prone to human error. This study presents a novel computer-aided diagnosis system for otoscope images of ear diseases, utilizing a vision transformer-based feature extractor combined with machine learning classifiers to provide accurate second opinions for ENT specialists. For this purpose, a new model based on state-of-the-art vision transformer feature extractor and machine learning models is proposed. In the experimental study, the dataset, comprising 880 eardrum images categorized into four classes (CSOM, earwax, myringosclerosis, and normal), was split into training (70%), validation (10%), and testing (20%) subsets. Each image was preprocessed to 420 × 380 pixels to fit the input dimensions of the models. The vision transformer architecture was utilized for feature extraction, followed by classification using various machine learning algorithms including kNN, SVM, and random forest. As a result, the model using vision transformer feature extractor and k-nearest neighbors (kNN) algorithm achieved 99.00% accuracy. In this study, a deep learning-based and computer-aided diagnosis system, in other words, a computational model, was developed instead of the current human error-prone disease diagnosis method used by ear nose throat (ENT) specialists. The main purpose of the deep learning-based decision support system is to support the diagnosis process where expert knowledge is difficult to access and to provide an alternative opinion to the expert diagnosis.
引用
收藏
页码:6601 / 6616
页数:15
相关论文
共 50 条
  • [1] Enhancing Time Series Product Demand Forecasting With Hybrid Attention-Based Deep Learning Models
    Zhang, Xuguang
    Li, Pan
    Han, Xu
    Yang, Yongbin
    Cui, Yiwen
    IEEE Access, 2024, 12 : 190079 - 190091
  • [2] An attention-based deep learning for acute lymphoblastic leukemia classification
    Jawahar, Malathy
    Anbarasi, L. Jani
    Narayanan, Sathiya
    Gandomi, Amir H.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [3] Federated deep active learning for attention-based transaction classification
    Usman Ahmed
    Jerry Chun-Wei Lin
    Philippe Fournier-Viger
    Applied Intelligence, 2023, 53 : 8631 - 8643
  • [4] Federated deep active learning for attention-based transaction classification
    Ahmed, Usman
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    APPLIED INTELLIGENCE, 2023, 53 (08) : 8631 - 8643
  • [5] Multimodal attention-based deep learning for automatic modulation classification
    Han, Jia
    Yu, Zhiyong
    Yang, Jian
    FRONTIERS IN ENERGY RESEARCH, 2023, 10
  • [6] Company Industry Classification with Neural and Attention-Based Learning Models
    Slavov, Stanislav
    Tagarev, Andrey
    Tulechki, Nikola
    Boytcheva, Svetla
    2019 BIG DATA, KNOWLEDGE AND CONTROL SYSTEMS ENGINEERING (BDKCSE), 2019,
  • [7] Attention-Based Light Weight Deep Learning Models for Early Potato Disease Detection
    Kasana, Singara Singh
    Rathore, Ajayraj Singh
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [8] Global Semantic Classification of Fluvial Landscapes with Attention-Based Deep Learning
    Carbonneau, Patrice E.
    Remote Sensing, 2024, 16 (24)
  • [9] An Attention-Based Mechanism to Combine Images and Metadata in Deep Learning Models Applied to Skin Cancer Classification
    Pacheco, Andre G. C.
    Krohling, Renato A.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (09) : 3554 - 3563
  • [10] Enhancing ASD classification through hybrid attention-based learning of facial features
    Shahzad, Inzamam
    Khan, Saif Ur Rehman
    Waseem, Abbas
    Abideen, Zain U. I.
    Liu, Jin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 475 - 488