Optimization of dysarthric speech recognition

被引:0
|
作者
Chen, FX [1 ]
Kostov, A [1 ]
机构
[1] Univ Alberta, Dept Linguist, Edmonton, AB T6G 2M7, Canada
关键词
communication disorders; dysarthria; speech recognition; hidden Markov model;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In this study we explored residual vocal ability of people who have severe motor impairments accompanied with severe dysarthria and developed methods for improving performance of automatic speech recognition (ASR) of dysarthric speech. The target applications for this technology are in the development of communication and control devices for these persons. In our speech recognition system we developed an adaptive word detection algorithm to detect words in highly-irregular dysarthric speech. We also implemented perceptually-based mel frequency cepstrum coefficients (MFCC) for parametric representation of the speech signal and we adopted the left-to-right discrete hidden Markov model (DHMM) for speech pattern recognition. The system was tested with one person who has cerebral palsy and dysarthria reducing intelligibility of her speech to less than 15%. Our initial results on a word set consisting of ten digits demonstrated that recognition rates above 90% fan bt; achieved if more than ten repetitions were used for training.
引用
收藏
页码:1436 / 1439
页数:4
相关论文
共 50 条
  • [1] A Survey of Automatic Speech Recognition for Dysarthric Speech
    Qian, Zhaopeng
    Xiao, Kejing
    [J]. ELECTRONICS, 2023, 12 (20)
  • [2] Dysarthric Speech Transformer: A Sequence-to-Sequence Dysarthric Speech Recognition System
    Shahamiri, Seyed Reza
    Lal, Vanshika
    Shah, Dhvani
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 3407 - 3416
  • [3] The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition
    Prananta, Luke
    Halpern, Bence Mark
    Feng, Siyuan
    Scharenborg, Odette
    [J]. INTERSPEECH 2022, 2022, : 36 - 40
  • [4] SYNTHESIZING DYSARTHRIC SPEECH USING MULTI-SPEAKER TTS FOR DYSARTHRIC SPEECH RECOGNITION
    Soleymanpour, Mohammad
    Johnson, Michael T.
    Soleymanpour, Rahim
    Berry, Jeffrey
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7382 - 7386
  • [5] Articulatory Knowledge in the Recognition of Dysarthric Speech
    Rudzicz, Frank
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 947 - 960
  • [6] An approach for speech enhancement with dysarthric speech recognition using optimization based machine learning frameworks
    Jolad B.
    Khanai R.
    [J]. Int J Speech Technol, 2023, 2 (287-305): : 287 - 305
  • [7] PHONETIC ANALYSIS OF DYSARTHRIC SPEECH TEMPO AND APPLICATIONS TO ROBUST PERSONALISED DYSARTHRIC SPEECH RECOGNITION
    Xiong, Feifei
    Barker, Jon
    Christensen, Heidi
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5836 - 5840
  • [8] Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech
    Calvo, Irene
    Tropea, Peppino
    Vigano, Mauro
    Scialla, Maria
    Cavalcante, Agnieszka B.
    Grajzer, Monika
    Gilardone, Marco
    Corbo, Massimo
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (05) : 432 - 441
  • [9] Optimizing Vocabulary Modeling for Dysarthric Speech Recognition
    Na, Minsoo
    Chung, Minhwa
    [J]. COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT II (ICCHP 2016), 2016, 9759 : 507 - 510
  • [10] A survey of technologies for automatic Dysarthric speech recognition
    Qian, Zhaopeng
    Xiao, Kejing
    Yu, Chongchong
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)