Exploring Spectrogram-Based Audio Classification for Parkinson's Disease: A Study on Speech Classification and Qualitative Reliability Verification

被引:0
|
作者
Jeong, Seung-Min [1 ]
Kim, Seunghyun [1 ]
Lee, Eui Chul [2 ]
Kim, Han Joon [3 ]
机构
[1] Sangmyung Univ, Grad Sch, Dept AI & Informat, Hongjimun 2 Gil 20, Seoul 03016, South Korea
[2] Sangmyung Univ, Dept Human Centered Artificial Intelligence, Hongjimun 2 Gil 20, Seoul 03016, South Korea
[3] Seoul Natl Univ, Coll Med, Seoul Natl Univ Hosp, Dept Neurol, Daehak Ro 101, Seoul 03080, South Korea
基金
新加坡国家研究基金会;
关键词
PSLA; AST; explainable AI; Parkinson's disease; speech classification;
D O I
10.3390/s24144625
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Patients suffering from Parkinson's disease suffer from voice impairment. In this study, we introduce models to classify normal and Parkinson's patients using their speech. We used an AST (audio spectrogram transformer), a transformer-based speech classification model that has recently outperformed CNN-based models in many fields, and a CNN-based PSLA (pretraining, sampling, labeling, and aggregation), a high-performance model in the existing speech classification field, for the study. This study compares and analyzes the models from both quantitative and qualitative perspectives. First, qualitatively, PSLA outperformed AST by more than 4% in accuracy, and the AUC was also higher, with 94.16% for AST and 97.43% for PSLA. Furthermore, we qualitatively evaluated the ability of the models to capture the acoustic features of Parkinson's through various CAM (class activation map)-based XAI (eXplainable AI) models such as GradCAM and EigenCAM. Based on PSLA, we found that the model focuses well on the muffled frequency band of Parkinson's speech, and the heatmap analysis of false positives and false negatives shows that the speech features are also visually represented when the model actually makes incorrect predictions. The contribution of this paper is that we not only found a suitable model for diagnosing Parkinson's through speech using two different types of models but also validated the predictions of the model in practice.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Combining speech sample and feature bilateral selection algorithm for classification of Parkinson's disease
    混合语音段特征双边式优选算法用于帕金森病分类研究
    Li, Yongming (yongmingli@cqu.edu.cn), 1600, West China Hospital, Sichuan Institute of Biomedical Engineering (34):
  • [22] Parkinson's Disease Classification using Pitch Synchronous Speech Segments and Fine Gaussian Kernels based SVM
    Appakaya, Sai Bharadwaj
    Sankar, Ravi
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 236 - 239
  • [23] Source and Vocal Tract Cues for Speech-based Classification of Patients with Parkinson's Disease and Healthy Subjects
    Bhattacharjee, Tanuka
    Mallela, Jhansi
    Belur, Yamini
    Atchayaram, Nalini
    Yadav, Ravi
    Reddy, Pradeep
    Gope, Dipanjan
    Ghosh, Prasanta Kumar
    INTERSPEECH 2021, 2021, : 2961 - 2965
  • [24] Exploring Traditional Medicine Diagnostic Classification for Parkinson's Disease Using Hierarchical Clustering
    Zhao, Huiyan
    Kwon, Ojin
    Cha, Jiyun
    Jung, In Chul
    Jun, Purumea
    Jang, Jae Young
    Jang, Jung-Hee
    COMPLEMENTARY MEDICINE RESEARCH, 2024, 31 (02) : 160 - 174
  • [25] Exploring Federated Learning for Speech-based Parkinson's Disease Detection
    Sarlas, Athanasios
    Kalafatelis, Alexandros S.
    Alexandridis, Georgios
    Kourtis, Michail-Alexandros
    Trakadas, Panagiotis
    18TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY & SECURITY, ARES 2023, 2023,
  • [26] Fatigue in Parkinson's Disease: A Qualitative Descriptive Study Exploring the Individual's Perspective
    Bruno, Amy
    NURSING RESEARCH, 2017, 66 (02) : E75 - E76
  • [27] Reliability of a Crohn's disease clinical classification scheme based on disease behavior
    Steinhart, AH
    Girgrah, N
    McLeod, RS
    INFLAMMATORY BOWEL DISEASES, 1998, 4 (03) : 228 - 234
  • [28] Classification of Parkinson's disease and Essential Tremor Based on Structural MRI
    Zhang, Li
    Liu, Chang
    Zhang, Xiujun
    PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT & APPLICATIONS (SKIMA), 2016, : 410 - 412
  • [29] HMM for Classification of Parkinson’s Disease Based on the Raw Gait Data
    Abed Khorasani
    Mohammad Reza Daliri
    Journal of Medical Systems, 2014, 38
  • [30] HMM for Classification of Parkinson's Disease Based on the Raw Gait Data
    Khorasani, Abed
    Daliri, Mohammad Reza
    JOURNAL OF MEDICAL SYSTEMS, 2014, 38 (12)