Enhancing Speech Recognition for Parkinson’s Disease Patient Using Transfer Learning Technique

被引:7
|
作者
Yu Q. [1 ,2 ]
Ma Y. [1 ,2 ]
Li Y. [1 ,2 ]
机构
[1] Department of Micro-Nano Electronics, Shanghai Jiao Tong University, Shanghai
[2] MoE Key Lab of Artificial Intelligence, Shanghai Jiao Tong University, Shanghai
来源
关键词
A; data augmentation; parkinson’s disease; R; 857.3; scarce data; speech recognition; transfer learning technique;
D O I
10.1007/s12204-021-2376-3
中图分类号
学科分类号
摘要
Parkinson’s disease patients suffer from disorders of speech. The most frequently reported speech problems are weak, hoarse, nasal or monotonous voice, imprecise articulation, slow or fast speech, difficulty starting speech, impaired stress or rhythm, stuttering, and tremor. To improve the speech quality and assist the patient with speech rehabilitation therapy, we have proposed the speech recognition model for Parkinson’s disease patients using transfer learning technique (PSTL), where we have pre-trained the long short-term memory (LSTM) neural network model with our developed publicly available dataset that has been obtained from healthy people through the social media platform. Then, we applied the transfer learning technique to improve the performance of the PSTL framework. The frequency spectrogram masking data augmentation method has been used to alleviate the over-fitting problem so that the word error rate (WER) is further reduced. Even with a limited dataset, our proposed model has effectively reduced the WER from 58% to 44.5% on the original speech dataset and 53.1% to 43% on the denoised speech dataset, which demonstrated the feasibility of our framework. © 2021, Shanghai Jiao Tong University and Springer-Verlag GmbH Germany, part of Springer Nature.
引用
收藏
页码:90 / 98
页数:8
相关论文
共 50 条
  • [11] Parkinson's Disease Recognition by Speech Acoustic Parameters Classification
    Meghraoui, D.
    Boudraa, B.
    Merazi-Meksen, T.
    Boudraa, M.
    MODELLING AND IMPLEMENTATION OF COMPLEX SYSTEMS, MISC 2016, 2016, : 165 - 173
  • [12] Improving speech intelligibility in patients with Parkinson's disease and dysarthria using SpeechEasy®, a speech fluency-enhancing device
    Wang, EQ
    Metman, LV
    de Vries, MH
    MOVEMENT DISORDERS, 2005, 20 : S138 - S138
  • [13] Identification of Parkinson’s disease from speech signal using machine learning approach
    Nayak S.S.
    Darji A.D.
    Shah P.K.
    International Journal of Speech Technology, 2023, 26 (04) : 981 - 990
  • [14] Machine learning for the diagnosis of Parkinson’s disease using speech analysis: a systematic review
    Bang C.
    Bogdanovic N.
    Deutsch G.
    Marques O.
    International Journal of Speech Technology, 2023, 26 (04) : 991 - 998
  • [15] Parkinson's Disease Identification from Speech Signals Using Machine Learning Models
    Saxena, Rahul
    Andrew, J.
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 201 - 213
  • [16] Parkinson’s Disease Identification from Speech Signals Using Machine Learning Models
    Saxena, Rahul
    Andrew, J.
    Lecture Notes in Networks and Systems, 2367, (201-213):
  • [17] Ensemble Machine Learning Approach for Parkinson's Disease Detection Using Speech Signals
    Bukhari, Syed Nisar Hussain
    Ogudo, Kingsley A.
    MATHEMATICS, 2024, 12 (10)
  • [18] Different Machine Learning Algorithms for Parkinson's Disease Detection Using Speech Signals
    Raje, Chaitali Shamrao
    Kulkarni, Pramodkumar H.
    Deshmukh, Rupali
    COMMUNICATION AND INTELLIGENT SYSTEMS, VOL 1, ICCIS 2023, 2024, 967 : 169 - 181
  • [19] An ensemble technique to predict Parkinson's disease using machine learning algorithms
    Singh, Nutan
    Tripathi, Priyanka
    SPEECH COMMUNICATION, 2024, 159
  • [20] Enhancing Parkinson's Disease Prediction Using Machine Learning and Feature Selection Methods
    Saeed, Faisal
    Al-Sarem, Mohammad
    Al-Mohaimeed, Muhannad
    Emara, Abdelhamid
    Boulila, Wadii
    Alasli, Mohammed
    Ghabban, Fahad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5639 - 5657