The Effect of Noise on Deep Learning for Classification of Pathological Voice

被引:1
|
作者
Hasebe, Koki [1 ]
Kojima, Tsuyoshi [1 ,2 ]
Fujimura, Shintaro [1 ]
Tamura, Keiichi [1 ]
Kawai, Yoshitaka [1 ]
Kishimoto, Yo [1 ]
Omori, Koichi [1 ]
机构
[1] Kyoto Univ, Grad Sch Med, Dept Otolaryngol Head & Neck Surg, Kyoto, Japan
[2] Kyoto Univ, Grad Sch Med, Dept Otolaryngol Head & Neck Surg, 54 Shogoin Kawahara Cho,Sakyo Ku, Kyoto 6068507, Japan
来源
LARYNGOSCOPE | 2024年 / 134卷 / 08期
基金
日本学术振兴会;
关键词
1D-CNN; GRBAS scale; machine learning; noise resilience; voice disorders;
D O I
10.1002/lary.31303
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
ObjectiveThis study aimed to evaluate the significance of background noise in machine learning models assessing the GRBAS scale for voice disorders.MethodsA dataset of 1406 voice samples was collected from retrospective data, and a 5-layer 1D convolutional neural network (CNN) model was constructed using TensorFlow. The dataset was divided into training, validation, and test data. Gaussian noise was added to test samples at various intensities to assess the model's noise resilience. The model's performance was evaluated using accuracy, F1 score, and quadratic weighted Cohen's kappa score.ResultsThe model's performance on the GRBAS scale generally declined with increasing noise intensities. For the G scale, accuracy dropped from 70.9% (original) to 8.5% (at the highest noise), F1 score from 69.2% to 1.3%, and Cohen's kappa from 0.679 to 0.0. Similar declines were observed for the remaining RBAS components.ConclusionThe model's performance was affected by background noise, with substantial decreases in evaluation metrics as noise levels intensified. Future research should explore noise-tolerant techniques, such as data augmentation, to improve the model's noise resilience in real-world settings.Level of EvidenceThis study evaluates a machine learning model using a single dataset without comparative controls. Given its non-comparative design and specific focus, it aligns with Level 4 evidence (Case-series) under the 2011 OCEBM guidelines Laryngoscope, 2024
引用
收藏
页码:3537 / 3541
页数:5
相关论文
共 50 条
  • [21] Pathological Image Classification of Cancer Cell Subtypes Based on Deep Learning
    Chen, Kai
    Tang, Shiyi
    Qin, Quansheng
    Hu, Lianjiang
    Ye, Jing
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING, INDUSTRIAL MATERIALS AND INDUSTRIAL ELECTRONICS (MEIMIE 2019), 2019, : 478 - 483
  • [22] ECG noise classification using deep learning with feature extraction
    Vibinkumar Vijayakumar
    Shaik Ummar
    Thomas J. Varghese
    Anu Elizabeth Shibu
    Signal, Image and Video Processing, 2022, 16 : 2287 - 2293
  • [23] Robustness of Deep Learning models in electrocardiogram noise detection and classification
    Rahman, Saifur
    Pal, Shantanu
    Yearwood, John
    Karmakar, Chandan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 253
  • [24] Deep Learning Based Noise Level Classification of Medical Images
    Zhang, Yifei
    Wu, Chengdong
    Chi, Jianning
    Yu, Xiaosheng
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT I, 2019, 11740 : 535 - 546
  • [25] Classification of Imbalanced Data Using Deep Learning with Adding Noise
    Fan, Wan-Wei
    Lee, Ching-Hung
    JOURNAL OF SENSORS, 2021, 2021 (2021)
  • [26] ECG noise classification using deep learning with feature extraction
    Vijayakumar, Vibinkumar
    Ummar, Shaik
    Varghese, Thomas J.
    Shibu, Anu Elizabeth
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (08) : 2287 - 2293
  • [27] Pathological Voice Recognition by Deep Neural Network
    Zhang, Xiaojun
    Tao, Zhi
    Zhao, Heming
    Xu, Tianqi
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 464 - 468
  • [28] A NEW INDEX FOR EVALUATION OF THE TURBULENT NOISE IN PATHOLOGICAL VOICE
    FUKAZAWA, T
    ELASSUOOTY, A
    HONJO, I
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 83 (03): : 1189 - 1193
  • [29] Optimized Deep Learning for the Classification of Parkinson’s Disease Based on Voice Features
    Sharanyaa S.
    Sambath M.
    Renjith P.N.
    Critical Reviews in Biomedical Engineering, 2022, 50 (05) : 1 - 28
  • [30] Healthy vs pathological classification of corneal nerves images using deep learning
    Scarpa, Fabio
    Colonna, Alessia
    Ruggeri, Alfredo
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2019, 60 (09)