Unsupervised pronunciation grammar generation for non-native speech recognition

被引：0

作者：

Huang, Chien-Lin ^{[1
]}

Wu, Chung-Hsien ^{[1
]}

Chen, Yi ^{[1
]}

Hsu, Chin-Shun ^{[2
]}

Lee, Kuei-Ming ^{[2
]}

机构：

[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan

[2] Inst Informat Ind, Tainan, Taiwan

来源：

TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study presents a novel approach to unsupervised pronunciation grammar generation for non-native speech recognition. Unsupervised pronunciation grammar generation includes pronunciation variation graph construction, stochastic Markov search and grammar selection. Context-dependent relation and phone broad class information are used for variation graph construction. Confidence measure and co-occurrence frequency are used to select the variants of pronunciation grammar for non-native speech modeling. Experiments show that unsupervised pronunciation grammar generation is suitable for the improvement of non-native speech recognition.

引用

页码：452 / +

页数：2

共 50 条

[21] Investigating automatic recognition of non-native arabic speech
Selouani, Sid-Ahmed
Alotaibi, Yousef Ajami
[J]. 2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 204 - +
[22] Dual supervised learning for non-native speech recognition
Radzikowski, Kacper
Nowak, Robert
Wang, Le
Yoshie, Osamu
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (1)
[23] Audio style transfer for non-native speech recognition
Radzikowski, Kacper
[J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2018, 2018, 10808
[24] Non-native pronunciation variants of city names as a problem for speech technology applications
Schaden, S
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 229 - 236
[25] Dual supervised learning for non-native speech recognition
Kacper Radzikowski
Robert Nowak
Le Wang
Osamu Yoshie
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2019
[26] Optimizing non-native speech recognition for CALL applications
van Doremalen, Joost
Strik, Helmer
Cucchiarini, Catia
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 588 - 591
[27] Multilingual Weighted Codebooks for Non-native Speech Recognition
Raab, Martin
Gruhn, Rainer
Noeth, Elmar
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 485 - +
[28] Influence of native and non-native multitalker babble on speech recognition in noise
Jain, Chandni
Konadath, Sreeraj
Vimal, Bharathi M.
Suresh, Vidhya
[J]. AUDIOLOGY RESEARCH, 2014, 4 (01) : 9 - 13
[29] NON-NATIVE SPEECH CORPORA FOR THE DEVELOPMENT OF COMPUTER ASSISTED PRONUNCIATION TRAINING SYSTEMS
Carranza, M.
Cucchiarini, C.
Burgos, P.
Strik, H.
[J]. EDULEARN14: 6TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2014, : 3624 - 3633
[30] Predicting Word Accuracy for the Automatic Speech Recognition of Non-Native Speech
Yoon, Su-Youn
Chen, Lei
Zechner, Klaus
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 773 - 776

← 1 2 3 4 5 →