Exploring Spectrogram-Based Audio Classification for Parkinson's Disease: A Study on Speech Classification and Qualitative Reliability Verification

被引:0
|
作者
Jeong, Seung-Min [1 ]
Kim, Seunghyun [1 ]
Lee, Eui Chul [2 ]
Kim, Han Joon [3 ]
机构
[1] Sangmyung Univ, Grad Sch, Dept AI & Informat, Hongjimun 2 Gil 20, Seoul 03016, South Korea
[2] Sangmyung Univ, Dept Human Centered Artificial Intelligence, Hongjimun 2 Gil 20, Seoul 03016, South Korea
[3] Seoul Natl Univ, Coll Med, Seoul Natl Univ Hosp, Dept Neurol, Daehak Ro 101, Seoul 03080, South Korea
基金
新加坡国家研究基金会;
关键词
PSLA; AST; explainable AI; Parkinson's disease; speech classification;
D O I
10.3390/s24144625
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Patients suffering from Parkinson's disease suffer from voice impairment. In this study, we introduce models to classify normal and Parkinson's patients using their speech. We used an AST (audio spectrogram transformer), a transformer-based speech classification model that has recently outperformed CNN-based models in many fields, and a CNN-based PSLA (pretraining, sampling, labeling, and aggregation), a high-performance model in the existing speech classification field, for the study. This study compares and analyzes the models from both quantitative and qualitative perspectives. First, qualitatively, PSLA outperformed AST by more than 4% in accuracy, and the AUC was also higher, with 94.16% for AST and 97.43% for PSLA. Furthermore, we qualitatively evaluated the ability of the models to capture the acoustic features of Parkinson's through various CAM (class activation map)-based XAI (eXplainable AI) models such as GradCAM and EigenCAM. Based on PSLA, we found that the model focuses well on the muffled frequency band of Parkinson's speech, and the heatmap analysis of false positives and false negatives shows that the speech features are also visually represented when the model actually makes incorrect predictions. The contribution of this paper is that we not only found a suitable model for diagnosing Parkinson's through speech using two different types of models but also validated the predictions of the model in practice.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Empirical Wavelet Transform Based Features for Classification of Parkinson's Disease Severity
    Oung, Qi Wei
    Muthusamy, Hariharan
    Basah, Shafriza Nisha
    Lee, Hoileong
    Vijean, Vikneswaran
    JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (02)
  • [42] Empirical Wavelet Transform Based Features for Classification of Parkinson’s Disease Severity
    Qi Wei Oung
    Hariharan Muthusamy
    Shafriza Nisha Basah
    Hoileong Lee
    Vikneswaran Vijean
    Journal of Medical Systems, 2018, 42
  • [43] The characterization of Parkinson's disease related nociplastic pain based on the classification system
    Tezuka, T.
    Nukariya, T.
    Okusa, S.
    Nihei, Y.
    Nakahara, J.
    Seki, M.
    MOVEMENT DISORDERS, 2023, 38 : S221 - S222
  • [44] Classification of Parkinson's Disease in Patch-Based MRI of Substantia Nigra
    Hussain, Sayyed Shahid
    Degang, Xu
    Shah, Pir Masoom
    Ul Islam, Saif
    Alam, Mahmood
    Khan, Izaz Ahmad
    Awwad, Fuad A.
    Ismail, Emad A. A.
    DIAGNOSTICS, 2023, 13 (17)
  • [45] SVM-Based Gait Analysis and Classification for Patients with Parkinson's Disease
    Zheng, Yuncheng
    Weng, Yanhong
    Yang, Xiaoli
    Cai, Guofa
    Cai, Guoen
    Song, Yang
    2021 15TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION AND COMMUNICATION TECHNOLOGY (ISMICT), 2021, : 53 - 58
  • [46] Feature Selection for Classification Based on Fine Motor Signs of Parkinson's Disease
    Brewer, B. R.
    Pradhan, S.
    Carvell, G.
    Delitto, A.
    2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 214 - +
  • [47] Optimized Deep Learning for the Classification of Parkinson’s Disease Based on Voice Features
    Sharanyaa S.
    Sambath M.
    Renjith P.N.
    Critical Reviews in Biomedical Engineering, 2022, 50 (05) : 1 - 28
  • [48] Diagnosis of Parkinson's disease on the basis of clinical and genetic classification: a population-based modelling study
    Nalls, Mike A.
    McLean, Cory Y.
    Rick, Jacqueline
    Eberly, Shirley
    Hutten, Samantha J.
    Gwinn, Katrina
    Sutherland, Margaret
    Martinez, Maria
    Heutink, Peter
    Williams, Nigel M.
    Hardy, John
    Gasser, Thomas
    Brice, Alexis
    Price, T. Ryan
    Nicolas, Aude
    Keller, Margaux F.
    Molony, Cliona
    Gibbs, J. Raphael
    Chen-Plotkin, Alice
    Suh, Eunran
    Letson, Christopher
    Fiandaca, Massimo S.
    Mapstone, Mark
    Federoff, Howard J.
    Noyce, Alastair J.
    Morris, Huw
    Van Deerlin, Vivianna M.
    Weintraub, Daniel
    Zabetian, Cyrus
    Hernandez, Dena G.
    Lesage, Suzanne
    Mullins, Meghan
    Conley, Emily Drabant
    Northover, Carrie A. M.
    Frasier, Mark
    Marek, Ken
    Day-Williams, Aaron G.
    Stone, David J.
    Ioannidis, John P. A.
    Singleton, Andrew B.
    LANCET NEUROLOGY, 2015, 14 (10): : 1002 - 1009
  • [49] A qualitative study exploring the clinical phenomenology and impact of hypersexuality in patients with Parkinson’s Disease
    Natalie Tayim
    Jalesh N. Panicker
    Jennifer Foley
    Caroline Selai
    Walaa G. El Sheikh
    Scientific Reports, 14 (1)
  • [50] Raw speech waveform based classification of patients with ALS, Parkinson's Disease and healthy controls using CNN-BLSTM
    Mallela, Jhansi
    Illa, Aravind
    Belur, Yamini
    Atchayaram, Nalini
    Yadav, Ravi
    Reddy, Pradeep
    Gope, Dipanjan
    Ghosh, Prasanta Kumar
    INTERSPEECH 2020, 2020, : 4586 - 4590