Deep Learning and Artificial Intelligence Applied to Model Speech and Language in Parkinson's Disease

被引：9

作者：

Escobar-Grisales, Daniel ^{[1
]}

Rios-Urrego, Cristian David ^{[1
]}

Orozco-Arroyave, Juan Rafael ^{[1
,2
]}

机构：

[1] Univ Antioquia, Fac Engn, GITA Lab, Medellin 050010, Colombia

[2] Univ Erlangen Nurnberg, LME Lab, D-91054 Erlangen, Germany

来源：

DIAGNOSTICS | 2023年 / 13卷 / 13期

关键词：

Parkinson's disease; natural language processing; speech processing; convolutional neural networks; Wav2Vec; word embeddings; DISCOURSE;

D O I：

10.3390/diagnostics13132163

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Parkinson's disease (PD) is the second most prevalent neurodegenerative disorder in the world, and it is characterized by the production of different motor and non-motor symptoms which negatively affect speech and language production. For decades, the research community has been working on methodologies to automatically model these biomarkers to detect and monitor the disease; however, although speech impairments have been widely explored, language remains underexplored despite being a valuable source of information, especially to assess cognitive impairments associated with non-motor symptoms. This study proposes the automatic assessment of PD patients using different methodologies to model speech and language biomarkers. One-dimensional and two-dimensional convolutional neural networks (CNNs), along with pre-trained models such as Wav2Vec 2.0, BERT, and BETO, were considered to classify PD patients vs. Healthy Control (HC) subjects. The first approach consisted of modeling speech and language independently. Then, the best representations from each modality were combined following early, joint, and late fusion strategies. The results show that the speech modality yielded an accuracy of up to 88%, thus outperforming all language representations, including the multi-modal approach. These results suggest that speech representations better discriminate PD patients and HC subjects than language representations. When analyzing the fusion strategies, we observed that changes in the time span of the multi-modal representation could produce a significant loss of information in the speech modality, which was likely linked to a decrease in accuracy in the multi-modal experiments. Further experiments are necessary to validate this claim with other fusion methods using different time spans.

引用

页数：16

共 50 条

[1] Deep Learning Applied to Deep Brain Stimulation in Parkinson's Disease
Guillen, Pablo
HIGH PERFORMANCE COMPUTING CARLA 2016, 2017, 697 : 269 - 278
[2] Machine Learning Applied to Speech Recordings for Parkinson's Disease Recognition
Aversano, Lerina
Bernardi, Mario L.
Cimitile, Marta
Iammarino, Martina
Madau, Antonella
Verdone, Chiara
DEEP LEARNING THEORY AND APPLICATIONS, DELTA 2023, 2023, 1875 : 101 - 114
[3] A cross-language speech model for detection of Parkinson's disease
Lim, Wee Shin
Chiu, Shu-, I
Peng, Pei-Ling
Jang, Jyh-Shing Roger
Lee, Sol-Hee
Lin, Chin-Hsien
Kim, Han-Joon
JOURNAL OF NEURAL TRANSMISSION, 2025, 132 (04) : 579 - 590
[4] Artificial Intelligence Model for Parkinson Disease Detection Using Machine Learning Algorithms
Sunil Yadav
Munindra Kumar Singh
Saurabh Pal
Biomedical Materials & Devices, 2023, 1 (2): : 899 - 911
[5] Potentialities of Applied Translation for Language Learning in the Era of Artificial Intelligence
Munoz-Basols, Javier
Neville, Craig
Lafford, Barbara A.
Godev, Concepcion
HISPANIA-A JOURNAL DEVOTED TO THE TEACHING OF SPANISH AND PORTUGUESE, 2023, 106 (02): : 171 - 194
[6] MAKEDONKA: Applied Deep Learning Model for Text-to-Speech Synthesis in Macedonian Language
Mishev, Kostadin
Karovska Ristovska, Aleksandra
Trajanov, Dimitar
Eftimov, Tome
Simjanoska, Monika
APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 14
[7] Speech parameter and deep learning based approach for the detection of parkinson’s disease
Krishna A.
Sahu S.P.
Janghel R.R.
Singh B.K.
Lecture Notes on Data Engineering and Communications Technologies, 2021, 66 : 507 - 517
[8] Deep brain stimulation and speech: A new model of speech function and dysfunction in Parkinson's disease
Montgomery, Erwin B., Jr.
JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY, 2007, 15 (03) : IX - XXV
[9] Federated Learning of Explainable Artificial Intelligence Models for Predicting Parkinson's Disease Progression
Barcena, Jose Luis Corcuera
Ducange, Pietro
Marcelloni, Francesco
Renda, Alessandro
Ruffini, Fabrizio
EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT I, 2023, 1901 : 630 - 648
[10] Language, speech, and communication disorders in Parkinson's disease
Krysiak, Adrian Piotr
NEUROPSYCHIATRIA I NEUROPSYCHOLOGIA, 2011, 6 (01): : 36 - 42

← 1 2 3 4 5 →