A Discriminative Training Method Incorporating Pronunciation Variations for Dysarthric Automatic Speech Recognition

被引：0

作者：

Seong, Woo Kyeong ^{[1
]}

Kim, Nam Kyun ^{[1
]}

Ha, Hun Kyu ^{[1
]}

Kim, Hong Kook ^{[1
]}

机构：

[1] Gwangju Inst Sci & Technol, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea

来源：

2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2016年

基金：

新加坡国家研究基金会;

关键词：

SPEAKERS; DATABASE; MODEL;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

While dysarthric speech recognition can be a convenient interface for dysarthric speakers, it is hard to collect enough speech data to overcome the underestimation problem of acoustic models. In addition, there are lots of pronunciation variations in the collected database due to the paralysis of the articulator of dysarthric speakers. Thus, a discriminative training method is proposed for improving the performance of such resource-limited dysarthric speech recognition. The proposed method is applied to subspace Gaussian mixture modeling by incorporating pronunciation variations into a conventional minimum phone error discriminative training method.

引用

页数：5

共 50 条

[1] Automatic Speech Recognition and Pronunciation Training
Xiao, Wenqi
[J]. PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON EDUCATION, ECONOMICS AND MANAGEMENT RESEARCH (ICEEMR 2018), 2018, 182 : 466 - 468
[2] Discriminative Training for Automatic Speech Recognition
Heigold, Georg
Ney, Hermann
Schlueter, Ralf
Wiesler, Simon
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69
[3] A Survey of Automatic Speech Recognition for Dysarthric Speech
Qian, Zhaopeng
Xiao, Kejing
[J]. ELECTRONICS, 2023, 12 (20)
[4] Discriminative training of HMMs for automatic speech recognition: A survey
Jiang, Hui
[J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608
[5] Multi-Stage DNN Training for Automatic Recognition of Dysarthric Speech
Yilmaz, Emre
Ganzeboom, Mario
Cucchiarini, Catia
Strik, Helmer
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2685 - 2689
[6] Automatic recognition of Arabic dysarthric speech
Tolba, Hesham M.
El-Torgoman, Ahmed S.
[J]. AEJ - Alexandria Engineering Journal, 2010, 49 (02): : 131 - 138
[7] Discriminative pronunciation modeling for dialectal speech recognition
Lehr, Maider
Gorman, Kyle
Shafran, Izhak
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1458 - 1462
[8] Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech
Calvo, Irene
Tropea, Peppino
Vigano, Mauro
Scialla, Maria
Cavalcante, Agnieszka B.
Grajzer, Monika
Gilardone, Marco
Corbo, Massimo
[J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (05) : 432 - 441
[9] A survey of technologies for automatic Dysarthric speech recognition
Qian, Zhaopeng
Xiao, Kejing
Yu, Chongchong
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
[10] A survey of technologies for automatic Dysarthric speech recognition
Zhaopeng Qian
Kejing Xiao
Chongchong Yu
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2023

← 1 2 3 4 5 →