Enhancing Speech Recognition for Parkinson’s Disease Patient Using Transfer Learning Technique

被引:7
|
作者
Yu Q. [1 ,2 ]
Ma Y. [1 ,2 ]
Li Y. [1 ,2 ]
机构
[1] Department of Micro-Nano Electronics, Shanghai Jiao Tong University, Shanghai
[2] MoE Key Lab of Artificial Intelligence, Shanghai Jiao Tong University, Shanghai
来源
关键词
A; data augmentation; parkinson’s disease; R; 857.3; scarce data; speech recognition; transfer learning technique;
D O I
10.1007/s12204-021-2376-3
中图分类号
学科分类号
摘要
Parkinson’s disease patients suffer from disorders of speech. The most frequently reported speech problems are weak, hoarse, nasal or monotonous voice, imprecise articulation, slow or fast speech, difficulty starting speech, impaired stress or rhythm, stuttering, and tremor. To improve the speech quality and assist the patient with speech rehabilitation therapy, we have proposed the speech recognition model for Parkinson’s disease patients using transfer learning technique (PSTL), where we have pre-trained the long short-term memory (LSTM) neural network model with our developed publicly available dataset that has been obtained from healthy people through the social media platform. Then, we applied the transfer learning technique to improve the performance of the PSTL framework. The frequency spectrogram masking data augmentation method has been used to alleviate the over-fitting problem so that the word error rate (WER) is further reduced. Even with a limited dataset, our proposed model has effectively reduced the WER from 58% to 44.5% on the original speech dataset and 53.1% to 43% on the denoised speech dataset, which demonstrated the feasibility of our framework. © 2021, Shanghai Jiao Tong University and Springer-Verlag GmbH Germany, part of Springer Nature.
引用
收藏
页码:90 / 98
页数:8
相关论文
共 50 条
  • [41] Multiclass recognition of Alzheimer's and Parkinson's disease using various machine learning techniques: A study
    Balaji, Chetan
    Suresh, D. S.
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2022, 13 (01)
  • [42] Building a Data Base for Automatic Speech Recognition in Parkinson's disease (PD)
    Ramig, L.
    MacDonald, R.
    Jiang, P.
    Hodges, H.
    Spielman, J.
    Reed, O.
    Nauman, E.
    Bergey, C.
    Cattiau, J.
    MOVEMENT DISORDERS, 2021, 36 : S568 - S568
  • [43] Study of the performance of automatic speech recognition systems in speakers with Parkinson's Disease
    Moro-Velazquez, Laureano
    Cho, JaeJin
    Watanabe, Shinji
    Hasegawa-Johnson, Mark A.
    Scharenborg, Odette
    Kim, Heejin
    Dehak, Najim
    INTERSPEECH 2019, 2019, : 3875 - 3879
  • [44] An Unsupervised Two-Step Convolution Sparse Transfer Learning Algorithm for Parkinson's Disease Speech Diagnosis
    Zhang X.-H.
    Zhang X.-Y.
    Li Y.-M.
    Wang P.
    Liu Y.-C.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (01): : 177 - 184
  • [45] Insight into an unsupervised two-step sparse transfer learning algorithm for speech diagnosis of Parkinson’s disease
    Yongming Li
    Xinyue Zhang
    Pin Wang
    Xiaoheng Zhang
    Yuchuan Liu
    Neural Computing and Applications, 2021, 33 : 9733 - 9750
  • [46] Cross-language Transfer Speech Recognition using Deep Learning
    Zhao, Yue
    Xu, Yan M.
    Sun, Mei J.
    Xu, Xiao N.
    Wang, Hui
    Yang, Guo S.
    Ji, Qiang
    11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 1422 - 1426
  • [47] Summary feedback schedules and speech motor learning in Parkinson's disease
    Adams, SG
    Page, AD
    Jog, M
    JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY, 2002, 10 (04) : 215 - 220
  • [48] Insight into an unsupervised two-step sparse transfer learning algorithm for speech diagnosis of Parkinson's disease
    Li, Yongming
    Zhang, Xinyue
    Wang, Pin
    Zhang, Xiaoheng
    Liu, Yuchuan
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (15): : 9733 - 9750
  • [49] Classification of Parkinson Disease Based on Patient's Voice Signal Using Machine Learning
    Ahmed, Imran
    Aljahdali, Sultan
    Khan, Muhammad Shakeel
    Kaddoura, Sanaa
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 32 (02): : 705 - 722
  • [50] Using Transfer Learning Technique for SAR Automatic Target Recognition
    Al Mufti, Maha
    Al Hadhrami, Esra
    Taha, Bilal
    Werghi, Naoufel
    SPIE FUTURE SENSING TECHNOLOGIES, 2019, 11197