A Discriminative Training Method Incorporating Pronunciation Variations for Dysarthric Automatic Speech Recognition

被引:0
|
作者
Seong, Woo Kyeong [1 ]
Kim, Nam Kyun [1 ]
Ha, Hun Kyu [1 ]
Kim, Hong Kook [1 ]
机构
[1] Gwangju Inst Sci & Technol, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
基金
新加坡国家研究基金会;
关键词
SPEAKERS; DATABASE; MODEL;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
While dysarthric speech recognition can be a convenient interface for dysarthric speakers, it is hard to collect enough speech data to overcome the underestimation problem of acoustic models. In addition, there are lots of pronunciation variations in the collected database due to the paralysis of the articulator of dysarthric speakers. Thus, a discriminative training method is proposed for improving the performance of such resource-limited dysarthric speech recognition. The proposed method is applied to subspace Gaussian mixture modeling by incorporating pronunciation variations into a conventional minimum phone error discriminative training method.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Automatic Speech Recognition and Pronunciation Training
    Xiao, Wenqi
    [J]. PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON EDUCATION, ECONOMICS AND MANAGEMENT RESEARCH (ICEEMR 2018), 2018, 182 : 466 - 468
  • [2] Discriminative Training for Automatic Speech Recognition
    Heigold, Georg
    Ney, Hermann
    Schlueter, Ralf
    Wiesler, Simon
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69
  • [3] A Survey of Automatic Speech Recognition for Dysarthric Speech
    Qian, Zhaopeng
    Xiao, Kejing
    [J]. ELECTRONICS, 2023, 12 (20)
  • [4] Discriminative training of HMMs for automatic speech recognition: A survey
    Jiang, Hui
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608
  • [5] Multi-Stage DNN Training for Automatic Recognition of Dysarthric Speech
    Yilmaz, Emre
    Ganzeboom, Mario
    Cucchiarini, Catia
    Strik, Helmer
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2685 - 2689
  • [6] Automatic recognition of Arabic dysarthric speech
    Tolba, Hesham M.
    El-Torgoman, Ahmed S.
    [J]. AEJ - Alexandria Engineering Journal, 2010, 49 (02): : 131 - 138
  • [7] Discriminative pronunciation modeling for dialectal speech recognition
    Lehr, Maider
    Gorman, Kyle
    Shafran, Izhak
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1458 - 1462
  • [8] Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech
    Calvo, Irene
    Tropea, Peppino
    Vigano, Mauro
    Scialla, Maria
    Cavalcante, Agnieszka B.
    Grajzer, Monika
    Gilardone, Marco
    Corbo, Massimo
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (05) : 432 - 441
  • [9] A survey of technologies for automatic Dysarthric speech recognition
    Qian, Zhaopeng
    Xiao, Kejing
    Yu, Chongchong
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [10] A survey of technologies for automatic Dysarthric speech recognition
    Zhaopeng Qian
    Kejing Xiao
    Chongchong Yu
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2023