Deep Learning and Artificial Intelligence Applied to Model Speech and Language in Parkinson's Disease

被引:9
|
作者
Escobar-Grisales, Daniel [1 ]
Rios-Urrego, Cristian David [1 ]
Orozco-Arroyave, Juan Rafael [1 ,2 ]
机构
[1] Univ Antioquia, Fac Engn, GITA Lab, Medellin 050010, Colombia
[2] Univ Erlangen Nurnberg, LME Lab, D-91054 Erlangen, Germany
关键词
Parkinson's disease; natural language processing; speech processing; convolutional neural networks; Wav2Vec; word embeddings; DISCOURSE;
D O I
10.3390/diagnostics13132163
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Parkinson's disease (PD) is the second most prevalent neurodegenerative disorder in the world, and it is characterized by the production of different motor and non-motor symptoms which negatively affect speech and language production. For decades, the research community has been working on methodologies to automatically model these biomarkers to detect and monitor the disease; however, although speech impairments have been widely explored, language remains underexplored despite being a valuable source of information, especially to assess cognitive impairments associated with non-motor symptoms. This study proposes the automatic assessment of PD patients using different methodologies to model speech and language biomarkers. One-dimensional and two-dimensional convolutional neural networks (CNNs), along with pre-trained models such as Wav2Vec 2.0, BERT, and BETO, were considered to classify PD patients vs. Healthy Control (HC) subjects. The first approach consisted of modeling speech and language independently. Then, the best representations from each modality were combined following early, joint, and late fusion strategies. The results show that the speech modality yielded an accuracy of up to 88%, thus outperforming all language representations, including the multi-modal approach. These results suggest that speech representations better discriminate PD patients and HC subjects than language representations. When analyzing the fusion strategies, we observed that changes in the time span of the multi-modal representation could produce a significant loss of information in the speech modality, which was likely linked to a decrease in accuracy in the multi-modal experiments. Further experiments are necessary to validate this claim with other fusion methods using different time spans.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep Learning Applied to Deep Brain Stimulation in Parkinson's Disease
    Guillen, Pablo
    HIGH PERFORMANCE COMPUTING CARLA 2016, 2017, 697 : 269 - 278
  • [2] Machine Learning Applied to Speech Recordings for Parkinson's Disease Recognition
    Aversano, Lerina
    Bernardi, Mario L.
    Cimitile, Marta
    Iammarino, Martina
    Madau, Antonella
    Verdone, Chiara
    DEEP LEARNING THEORY AND APPLICATIONS, DELTA 2023, 2023, 1875 : 101 - 114
  • [3] A cross-language speech model for detection of Parkinson's disease
    Lim, Wee Shin
    Chiu, Shu-, I
    Peng, Pei-Ling
    Jang, Jyh-Shing Roger
    Lee, Sol-Hee
    Lin, Chin-Hsien
    Kim, Han-Joon
    JOURNAL OF NEURAL TRANSMISSION, 2025, 132 (04) : 579 - 590
  • [4] Artificial Intelligence Model for Parkinson Disease Detection Using Machine Learning Algorithms
    Sunil Yadav
    Munindra Kumar Singh
    Saurabh Pal
    Biomedical Materials & Devices, 2023, 1 (2): : 899 - 911
  • [5] Potentialities of Applied Translation for Language Learning in the Era of Artificial Intelligence
    Munoz-Basols, Javier
    Neville, Craig
    Lafford, Barbara A.
    Godev, Concepcion
    HISPANIA-A JOURNAL DEVOTED TO THE TEACHING OF SPANISH AND PORTUGUESE, 2023, 106 (02): : 171 - 194
  • [6] MAKEDONKA: Applied Deep Learning Model for Text-to-Speech Synthesis in Macedonian Language
    Mishev, Kostadin
    Karovska Ristovska, Aleksandra
    Trajanov, Dimitar
    Eftimov, Tome
    Simjanoska, Monika
    APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 14
  • [7] Speech parameter and deep learning based approach for the detection of parkinson’s disease
    Krishna A.
    Sahu S.P.
    Janghel R.R.
    Singh B.K.
    Lecture Notes on Data Engineering and Communications Technologies, 2021, 66 : 507 - 517
  • [8] Deep brain stimulation and speech: A new model of speech function and dysfunction in Parkinson's disease
    Montgomery, Erwin B., Jr.
    JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY, 2007, 15 (03) : IX - XXV
  • [9] Federated Learning of Explainable Artificial Intelligence Models for Predicting Parkinson's Disease Progression
    Barcena, Jose Luis Corcuera
    Ducange, Pietro
    Marcelloni, Francesco
    Renda, Alessandro
    Ruffini, Fabrizio
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT I, 2023, 1901 : 630 - 648
  • [10] Language, speech, and communication disorders in Parkinson's disease
    Krysiak, Adrian Piotr
    NEUROPSYCHIATRIA I NEUROPSYCHOLOGIA, 2011, 6 (01): : 36 - 42