Unsupervised pronunciation grammar generation for non-native speech recognition

被引:0
|
作者
Huang, Chien-Lin [1 ]
Wu, Chung-Hsien [1 ]
Chen, Yi [1 ]
Hsu, Chin-Shun [2 ]
Lee, Kuei-Ming [2 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
[2] Inst Informat Ind, Tainan, Taiwan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a novel approach to unsupervised pronunciation grammar generation for non-native speech recognition. Unsupervised pronunciation grammar generation includes pronunciation variation graph construction, stochastic Markov search and grammar selection. Context-dependent relation and phone broad class information are used for variation graph construction. Confidence measure and co-occurrence frequency are used to select the variants of pronunciation grammar for non-native speech modeling. Experiments show that unsupervised pronunciation grammar generation is suitable for the improvement of non-native speech recognition.
引用
收藏
页码:452 / +
页数:2
相关论文
共 50 条
  • [21] Investigating automatic recognition of non-native arabic speech
    Selouani, Sid-Ahmed
    Alotaibi, Yousef Ajami
    [J]. 2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 204 - +
  • [22] Dual supervised learning for non-native speech recognition
    Radzikowski, Kacper
    Nowak, Robert
    Wang, Le
    Yoshie, Osamu
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (1)
  • [23] Audio style transfer for non-native speech recognition
    Radzikowski, Kacper
    [J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2018, 2018, 10808
  • [24] Non-native pronunciation variants of city names as a problem for speech technology applications
    Schaden, S
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 229 - 236
  • [25] Dual supervised learning for non-native speech recognition
    Kacper Radzikowski
    Robert Nowak
    Le Wang
    Osamu Yoshie
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [26] Optimizing non-native speech recognition for CALL applications
    van Doremalen, Joost
    Strik, Helmer
    Cucchiarini, Catia
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 588 - 591
  • [27] Multilingual Weighted Codebooks for Non-native Speech Recognition
    Raab, Martin
    Gruhn, Rainer
    Noeth, Elmar
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 485 - +
  • [28] Influence of native and non-native multitalker babble on speech recognition in noise
    Jain, Chandni
    Konadath, Sreeraj
    Vimal, Bharathi M.
    Suresh, Vidhya
    [J]. AUDIOLOGY RESEARCH, 2014, 4 (01) : 9 - 13
  • [29] NON-NATIVE SPEECH CORPORA FOR THE DEVELOPMENT OF COMPUTER ASSISTED PRONUNCIATION TRAINING SYSTEMS
    Carranza, M.
    Cucchiarini, C.
    Burgos, P.
    Strik, H.
    [J]. EDULEARN14: 6TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2014, : 3624 - 3633
  • [30] Predicting Word Accuracy for the Automatic Speech Recognition of Non-Native Speech
    Yoon, Su-Youn
    Chen, Lei
    Zechner, Klaus
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 773 - 776